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Introduction 


The  36th  annual  Sanibel  Symposium,  orga¬ 
nized  by  the  faculty,  students,  and  staff  of 
the  Quantum  Theory  Project  of  the  University  of 
Florida,  was  held  on  February  24-March  2,  1996. 
The  meeting  was  again  held  at  the  Ponce  de  Leon 
Conference  Center  in  St.  Augustine,  Florida.  About 
320  participants  gathered  for  eight  days  of  lectures 
and  informal  discussions. 

The  format  of  the  symposium  adopted  for  the 
past  few  years  was  followed  again  this  year  with  a 
compact  eight-day  schedule  with  an  integrated 
program  of  quantum  biology,  quantum  chemistry, 
and  condensed  matter  physics.  The  topics  of  the 
sessions  covered  by  these  proceedings  included 
Developmental  Tools  and  Methodologies  for  Com¬ 
putational  Chemical  Physics,  Density  Functional 
Theory  and  Applications,  Quantum-Classical  Dy¬ 
namics  for  Extended  Systems,  Simulations  of 
Nanocrystals,  Clusters  and  Molecules,  Novel 
Method  in  Electronic  Structure  Theory,  Potential 
Energy  Surfaces  for  Excited  States,  Industrial  Ap¬ 
plications  of  Quantum  Chemistry,  Time-Depen¬ 
dent  Dynamics  for  Many-Electron  Systems,  Meth¬ 
ods  for  Big  Molecules,  Molecular  Self-Assembly 
and  Design,  and  Medium  Effects  on  Embedded 
Molecules. 

The  articles  have  been  subjected  to  the  ordinary 
refereeing  procedures  of  the  International  Journal  of 
Quantum  Chemistry.  The  articles  presented  in  the 
sessions  on  quantum  biology  and  associated  poster 
sessions  are  published  in  a  separate  volume  of  the 
International  Journal  of  Quantum  Chemistry. 

The  organizers  acknowledge  the  following 
sponsors  for  their  support  of  the  1996  Sanibel  Sym¬ 
posium; 

■  U.S.  Army  Research  Office  and  U.S.  Army 


Edgewood  RD  &  E  Center  through  Grant  No. 
DAAH04-96-1-PHCF.  (The  views,  opinions, 
and/or  findings  contained  in  this  report  are 
those  of  the  authorfs)  and  should  not  be 
construed  as  an  official  Department  of  the 
Army  position,  policy,  or  decision,  unless  so 
designated  by  other  documentation.) 

-  The  Office  of  Naval  Research  through  Grant 
No.  N00014-96-1-0197.  (This  work  relates  to 
Department  of  Navy  Grant  No.  N00014-96-1- 
0197  issued  by  the  Office  of  Naval  Research. 
The  United  States  Government  has  a 
royalty-free  license  throughout  the  world  in 
all  copyrightable  material  contained  herein.) 

■  IBM  Corporation 

•  FlyperCube,  Inc. 

-  Sun  Microsystems  Computer  Corporation 

•  The  University  of  Florida 

Very  special  thanks  go  to  the  staff  of  the  Quan¬ 
tum  Theory  Project  of  the  University  of  Florida  for 
handling  the  numerous  administrative,  clerical, 
and  practical  details.  The  organizers  are  proud  to 
recognize  the  contributions  of  Mrs.  Judy  Parker, 
Ms.  Leann  Golemo,  Ms.  Sandra  Weakland,  and  Mr. 
Sullivan  Beck.  All  the  graduate  students  of  the 
Quantum  Theory  Project  who  served  as  “gofers” 
are  gratefully  recognized  for  their  contributions  to 
the  1996  Sanibel  Symposium. 

N.  Y.  Ohrn 
J.  R.  Sabin 
M.  C.  Zerner 
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The  Chairman’s  Introductory  Remark 


JEAN-LOUIS  CALAIS 

1932-1995 


Before  introducing  our  speakers,  let  me  re¬ 
mind  you  of  one  of  the  Sanibel  Pioneers, 
Jean-Louis  Calais  of  Uppsala  University.  He  died 
suddenly  in  Uppsala  on  May  30,  1995  at  the  age  of 
62,  while  still  approaching  the  highest  peaks  of  his 
scientific  career. 

When  the  first  Sanibel  Symposium  took  place  at 
Sanibel  Island,  Jean-Louis  was  a  very  young  scien¬ 
tist,  but  he  played  a  major  role  in  organizing  that 
symposium.  Jean-Louis  was  one  of  Per-Olov 
Lowdin's  first  graduate  students  and  coworkers 
and  thus  one  of  the  founding  members  of  the 
Uppsala  Quantum  Chemistry  Group  and  the 
Quantum  Theory  Project  (QTP)  at  the  University 
of  Florida,  where  he  became  an  adjunct  professor, 
Jean-Louis  made  fundamental  contributions  to 
the  methods  of  quantum  chemistry,  to  solid-state 
and  polymer  physics  and  chemistry,  electron  nu¬ 
clear  dynamics  in  the  hydrogen  molecule  and  ex¬ 
tended  systems,  and  much  more. 


Jean-Louis'  students  and  coworkers  came  from 
all  over  the  world.  For  many  years  he  was  instru¬ 
mental  in  organizing  and  lecturing  at  the  Interna¬ 
tional  Summer  School  in  Quantum  Chemistry  held 
in  Scandinavia  and  the  corresponding  Winter  Insti¬ 
tutes  at  Gainesville,  and  he  was  the  Uppsala  editor 
of  the  International  Journal  of  Quantum  Chemistry. 

Jean-Louis  was  a  Scholar  in  the  true  sense  of  the 
word.  During  the  last  few  years,  I  had  the  privi¬ 
lege  to  witness  his  success  in  establishing — or 
rather  reestablishing — human  and  scientific  con¬ 
tacts  with  colleagues  and  students  in  the  Baltic 
Countries,  especially  at  the  University  of  Tartu.  It 
was  no  surprise  that  he  succeeded  so  well.  He  was 
almost  an  expert  in  Baltic  history  and  culture,  and 
had  started  to  learn  the  languages.  Above  all,  he 
was  almost  unique  in  generosity,  kindness,  and 
friendship. 

Goran  Bergson 


International  Journal  of  Quantum  Chemistry;  Quantum  Chemistry  Symposium  30.  1215  (1996) 
©  1 996  John  Wiley  &  Sons,  Inc. 


CCC  0360-8832  /  96  /  071 21 5-01 


Partial  Waves  in  the  Nonspherical  Case 


T.  LEVITINA*  AND  E.  J.  BRANDAS 

Quantum  Chemistry  Group  for  Research  in  Atomic,  Molecular  and  Solid  State  Physics,  Uppsala 
University,  Box  518,  S-751  20  Uppsala,  Sweden 

Received  February  28,  1996;  revised  manuscript  received  April  2,  1996;  accepted  May  8,  1996 


ABSTRACT  - - 

Many  quantum  mechanical  problems  are  separable  in  one  or  several  of  the  standard 
classifications  of  general  coordinate  systems.  The  associated  solution  is  most  conveniently 
evaluated  using  expansions  in  Lame  wave  functions.  We  give  here  some  preliminary 
numerical  results  displaying  the  continuous  distortion  of  an  s-,  p-,  and  d-wave  spherical 
harmonic.  The  results  indicate  rather  surprising  features  which  may  be  useful  in  general 
quantum  mechanical  contexts.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

When  separating  the  variables  in  terms  of 
ellipsoidal  coordinates,  particular  solu¬ 
tions  of  the  Helmholtz  equation  are  formed  as  pro¬ 
ducts  of  the  so-called  angle  and  radial  ellipsoidal 
(or  Lame)  wave  functions.  For  comparison,  in  the 
case  of  a  sphere,  the  angle  functions  are  the  Legen¬ 
dre  polynomials  and  the  associated  functions, 
while  the  radial  ones  are  the  Bessel  and  Hankel 
functions. 

Consider  the  ellipsoidal  coordinate  system  given 
by  x^/(a^  H-  17)  -I-  y^/(b^  +  r])  +  z^/ic^  -h  17)  =  1, 
where  the  basical  ellipsoid  has  the  semiaxes  a  > 

*  Permanent  address:  Department  of  Numerical  Methods, 
Computing  Center  of  Russian  Academy  of  Sciences,  Vavilova 
str.  40,  117967  Moscow  GSP-1,  Russia. 


b  >  c.  Going  over  to  dimensionless  variables  ^  = 
(77  +  a^)/ia^  -  b^),  one  writes 


-  b^)^  ^  (a^  -  b^){^-l) 


Here,  we  have  set  —  c^)/{a^  —  b^l,  i.e., 

p^  >  1.  This  equation  for  each  point  ix,y,z), 
xyz  A  0,  has  exactly  three  different  roots:  g 

f  =  (0;  1),  ^2  ^  h  =  (1;  ^  I3  = 

( p^;  00).  If  ^  =  li,  the  surface  defined  above  is  an 
ellipsoid;  for  1=^2/  it  is  a  hyperboloid  of  one 
sheet;  and  for  ^  =  ^3,  it  is  a  hyperboloid  of  two 
sheets. 

The  above  three  families  of  confocal  surfaces 
form  an  orthogonal  curvilinear  coordinate  sys- 
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tern  related  to  Cartesian  coordinates  within  each 
octant  (e.g.,  x  >  0,  y  >  0,  z  >  0)  by  one-to-one 
correspondence: 

y"  =  -(a^-  -  1X^2  -  1)(^3  -  1)/ 

2^  =  (fl^  -  b^)ii,  -  -  p2)(^3  -  p2)/ 

[(  -  l)p^].  (1) 


One  can  find  the  detailed  consideration  of  the 
variables  separation  in  ellipsoidal  coordinates  in, 
e.g.,  [1,  2]. 

As  a  result,  we  obtain  a  system  of  three  equa¬ 
tions  known  as  the  Lame  wave  equations: 


+  ‘o)Ai  =  0,  e /i,  (2.a) 


-  ^  ")A2  =  0,  ^2  €E  4,  (2.b) 


rfAj 

^^3 


+  a))A^  =  0,  ^3^/3,  (2.c) 


where  h  and  /  are  the  separation  constants,  = 
-  IX^  -  p\  qU,  h,  I,  w)  =  hp^  -  Ip^  + 
and  (o^  =  k'^ia^  —  b^),  k  standing  for  the 
wave  number.  Hereafter,  we  consider  the  first  two 
equations,  i.e.,  those  determining  the  angular  part 
of  a  particular  solution  to  the  Helmholtz  equation. 


Results  and  Discussion 

Owing  to  the  symmetry  of  the  problem,  a  par¬ 
ticular  solution  must  be  either  odd  or  even  with 
respect  to  each  coordinate  plane.  Since  coordinate 
planes  correspond  to  the  ends  of  the  intervals  of 


ellipsoidal  coordinate  variations,  we  have 

lim  Ai(^^)=0or  lim  (yTT^A'iC ^i)]  =  0, 

f  =0or  ^  lim  ^(\//( A'X^j))  =  0, 

lim  A2(^2)  =  0or 
f2-.l+0 

lim  (V-/(^2)  A'2(^2))  =  0, 
lim  A2(^2)  =  0or 

(2-^P^-O 

lim  U-fi  ^2)  ^2))  =  0,  (3) 


besides  the  boundary  conditions  for  1  -  0 
and  for  ^2  1  +  d/  which  must  be  identical. 

No  reliable  methods  for  the  evaluation  of  ellip¬ 
soidal  wave  functions  existed  until  [3,  4]  appeared 
and,  therefore,  the  application  of  these  functions 
had  been  delayed.  This  is  because  the  angle  Lame 
wave  functions  compose  the  eigenfunctions  of  a 
two-parameter  self-adjoint  Sturm- Liouville  prob¬ 
lem  (see,  e.g.,  [5]),  i.e.,  on  separating  variables,  we 
arrive  at  a  couple  of  ODEs  (2.a,b),  each  depending 
on  its  own  argument  and  being  subject  to  its  own 
boundary  conditions.  The  equations  are  linked  to¬ 
gether  only  by  a  couple  of  separation  constants, 
which  enter  each  equation.  One  must,  hence,  find 
such  a  couple  of  values  for  the  separation  con¬ 
stants  such  that  both  equations  have  nontrivial 
solutions  satisfying  the  corresponding  boundary 
conditions.  The  eigenvalue  of  the  problem  is  such 
a  couple  of  values,  and  the  product  of  the  corre¬ 
sponding  solutions  to  the  system  gives  its  eigen¬ 
function. 

One  can  find  a  detailed  analytical  investigation 
of  the  Lame  wave  functions  in  [6]  and  the  articles 
cited  therein.  The  Lame  wave  functions  depend 
continuously  on  the  parameters  p  and  ca,  and  in 
the  case  o)  =  0,  they  coincide  with  the  well-known 
Lame  polynomials  (see  [2]).  Eigenfunctions  associ¬ 
ated  with  different  eigenvalues  are  orthogonal  with 
respect  to  the  inner  product: 


(*,*)  =  //  {*  X  *)cr(^j, 

where  D  =  /^  X  4,  ^2)  =  (iz  -  ^1) 

The  methods  proposed  in  [3,  4]  provide  an 
efficient  and  economic  way  to  calculate  the  Lame 
wave  functions  at  a  rather  wide  range  of  parame- 
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ter  variations.  This  technique  exploits  a  general¬ 
ized  Priifer  transformation,  which  has  also  been 
successfully  used  for  solving  various  similar  prob¬ 
lems  (see  [7,  8]).  The  merit  of  the  above  technique 
can  be  illustrated  from  [9],  where  the  solution  of 
an  acoustic  wave-scattering  problem  by  a  general 
ellipsoid  has  been  expanded  in  terms  of  the  Lame 
wave  functions  and  multiple  results  of  the  calcula¬ 
tions  have  been  plotted.  When  expanding  func¬ 
tions  defined  over  a  three-dimensional  space,  one 
faces  the  necessity  to  prolong  the  partial  solutions 
expressed  in  Lame  wave  functions  to  the  hole 
space,  since  (1)  defines  the  relations  between  ellip¬ 
soidal  and  Cartesian  coordinates  only  within  an 
octant.  The  prolongation  of  the  partial  solutions 
defined  in  one  octant  into  the  others  has  also  been 
considered  in  [9]. 

Here,  we  present  the  examples  of  angular  distri¬ 
bution  densities  which  have  been  calculated  as 
squared  eigenfunctions  of  the  above  two-parame¬ 
ter  Sturm-Liouville  problem,  normalized  accord¬ 


ing  to  the  condition 

f  I  =  1/8 

JJji 

(here,  we  take  into  account  the  above  comment 
that  n  covers  only  a  part  of  an  ellipsoid  enclosed 
in  an  octant).  Each  surface  is  plotted  as  a  function 
of  spherical  angles  0  and  (p,  where  6  is  counted 
off  X,  and  the  connection  between  the  angle  ellip¬ 
soidal  coordinates  and  the  spherical  ones  is  given 
by 

^1^2  =  P^COS^e, 

(^]  -  1)(^2  -  1)  =  “(P^  “  Dsin^PcosV, 

(^1  -  P^)(^2  -  P^)  =  sinV 

(for  more  details,  see  [9]). 

Given  a  set  of  boundary  conditions  (3)  and  an 
eigenvalue  multi-index,  i.e.,  a  couple  of  integers 


FIGURE  1 .  pco-diagram  for  the  s-like  density.  For  more  details,  see  text. 
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specifying  the  number  of  internal  zeros  of  solu¬ 
tions  to  (2.a)  and  (2.b),  the  parameters  p  and  w 
uniquely  determine  the  problem.  Hence,  we  can 
vary  the  semiaxes  of  the  basical  ellipsoid  and  the 
wave  number  k  and  obtain  the  same  solution 
provided  that  the  values  of  p  and  co  do  not  change. 

In  Figure  1,  we  give  the  pco-diagram  for  the 
s-like  state  density,  i.e.,  the  dependence  on  the 
parameters  p  and  w.  The  value  of  p  remains 
constant  in  each  line  ( p^  =  1.01,  1.1,  and  2.0,  re¬ 
spectively),  while  the  value  of  co  is  constant  in 
each  column  (co^  =  0.1,  1.0,  5.0,  10.0).  One  can  see 
that  for  small  co  the  shapes  of  partial  angular 
distributions  is  almost  independent  of  the  geome¬ 
try  of  the  problem  and  the  orbitals  are  close  to  the 
spherical  case.  In  the  case  co^  =  0,  all  the  orbitals 
of  the  same  type  (i.e.,  both  the  same  boundary 
condition  set  and  the  same  multi-index)  coincide 
regardless  of  the  value  of  p.  This  fact  can  be  easily 
proved  for  the  basical  state  orbitals  under  all  pos¬ 
sible  sets  of  boundary  conditions  (3);  it  also  seems 


to  be  true  for  all  harmonics.  Thus,  the  orbitals  can 
be  naturally  classified  according  to  the  conven¬ 
tional  classification  in  the  spherical  case,  in  spite  of 
the  fact  that  the  higher  the  value  of  co  (or  P)  the 
more  is  the  difference  among  the  shapes. 

In  Figure  2,  one  can  see  the  changes  in  the  p- 
and  rf-like  orbitals,  which  appear  as  co  increases 
from  left  to  right  (co^  =  0.0,  5.0,  10.0,  20.0,  50.0), 
with  p  remaining  constant  (  =  2.0).  The  original 

shapes  should  be  obvious  to  the  reader. 

In  practice,  much  more  complicated  Lame  wave 
functions  can  be  calculated;  see,  e.g,,  [3].  Here,  we 
have  only  displayed  the  continuous  distortion  of 
some  simple  partial  wave  distributions,  as  viewed 
and  comprehended  via  so-called  pco-diagrams.  It  is 
interesting  to  note,  however,  that  the  distortions 
occurring  in  Figures  1  and  2  can  be  obtained  in 
several  ways:  For  instance,  a  specific  distributional 
shape  in  the  pct>-diagram  may  be  obtained  for 
certain  values  of  the  semiaxes  a,  b,  and  c  at  given 
P.  Alternatively,  the  same  angular  distribution 


FIGURE  2.  pto-diagrams  for  the  p-  and  d-like  orbitals.  Note  that  only  co  changes  here. 
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can  also  be  maintained  via  different  values  of  P  if 
p  and  o)  remain  constant — a  flexibility  not  given 
by  standard  choices  of  coordinate  system.  This 
result  could  be  of  potential  interest,  suggesting 
useful  applications  in  a  more  general  quantum 
chemical  context. 
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ABSTRACT _ _ _ 

The  Feynman  path  integral  method  is  applied  to  the  many-electron  problem.  We  first 
give  new  closure  relations  in  terms  of  ordinary  complex  and  real  numbers,  which  could 
be  derived  from  an  arbitrary  complete  set  of  state  vectors.  Then,  in  the  path  integral 
form,  the  partition  function  of  the  system  and  the  ensemble  average  of  energy  are 
explicitly  expressed  in  terms  of  these  closure  relations.  It  is  impossible  to  evaluate  the 
path  integral  by  direct  numerical  integrations  because  of  its  huge  amoimt  of  integration 
variables.  Therefore,  we  develop  an  algorithm  by  the  Monte  Carlo  method  with  constraints 
corresponding  to  the  normalization  condition  of  states  to  calculate  the  required  integral. 
Finally,  the  ensemble  average  of  energy  for  the  hydrogen  molecule  is  explicitly  evaluated 
by  the  quantum  Monte  Carlo  method  and  results  are  compared  with  the  result  obtained 
by  the  ordinary  full  configuration  interaction  (Cl)  method.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

The  path  integral  introduced  by  Feynman  [1] 
has  been  applied  to  many  problems  in  quan¬ 
tum  mechanics.  The  quantization  by  the  path 
integral  is  different  from  the  usual  canonical  pro¬ 
cedures,  namely,  the  physical  quantity  can  be  han¬ 
dled  as  a  commuting  variable  similar  to  a  classical 
one.  Thus  the  physical  processes  can  be  under- 

*  To  whom  correspondence  should  be  addressed. 


Stood  more  clearly  than  in  ordinary  quantum  me¬ 
chanics.  On  the  other  hand,  almost  all  the  studies 
of  quantum  chemistry  have  been  done  by  dealing 
with  the  Schrodinger  equation.  The  wave  fimction 
of  the  many-electron  system  is  usually  written  in 
terms  of  the  Slater  determinant  composed  of  one- 
body  functions,  where  the  wave  nature  of  elec¬ 
trons  is  in  the  orbital  fimction  and  the  Fermion 
character  of  electrons  is  represented  by  the  anti¬ 
symmetric  property  of  the  determinant.  When  we 
apply  the  path  integral  method  directly  to  the 
electron  field  described  by  the  anticommuting  field 
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operator,  we  must  deal  with  the  corresponding 
anticommutable  c-numbers,  the  Grassmann  num¬ 
bers.  However,  the  properties  of  the  Grassmann 
numbers  are,  in  general,  not  so  well  known  [2,  3], 
so  that,  in  the  field  of  quantum  chemistry,  the  path 
integral  method  has  not  been  widely  applied  to 
the  many-electron  problems  for  a  long  time. 

Recently,  the  path  integral  method  has  been 
applied  to  the  Huckel  model  [4]  and  the  many- 
electron  problems  [5].  We  have  also  developed  the 
path  integral  method  without  using  the  Grass¬ 
mann  number  to  estimate  the  electron  correlation 
in  molecules  [6].  In  that  work,  we  have  derived  the 
new  closure  relation  in  terms  of  the  linear  combi¬ 
nation  of  atomic  orbital  (LCAO)  coefficients  of  the 
molecular  orbital  and  have  constructed  a  path  in¬ 
tegral  form  of  some  physical  quantities  using  the 
LCAO  closure  relation.  In  this  treatment,  there 
appears  no  Grassmann  numbers  corresponding  to 
the  electron  field  operator  but  the  c-numbers  cor¬ 
responding  to  LCAO  coefficients. 

The  essence  of  the  path  integral  is  a  sum  of 
histories,  and  the  path  integral  is  usually  ex¬ 
pressed  as  a  multidimensional  integral  with  a  huge 
number  of  variables.  Therefore,  we  cannot  directly 
carry  out  this  integration  by  ordinary  ways  and 
will  be  forced  to  use  statistical  methods,  such  as 
the  Monte  Carlo  method.  The  ordinary  quantum 
Monte  Carlo  method  is  a  very  powerful  tool  in 
order  to  estimate  thermal  and  quantal  fluctuations 
of  interacting  quantum  spin  systems  and  strongly 
correlated  electron  systems  [7].  This  method  is 
based  on  the  generalized  Trotter  formula,  and  the 
quantum  fluctuation  is  expressed  in  the  ordinary 
path  integral  form.  Therefore,  we  can  apply  the 
Monte  Carlo  method  to  evaluate  the  path  integral, 
though  the  path  summation  scheme  needs  a  large 
amount  of  machine  time  even  in  high-speed  com¬ 
puters. 

In  this  study,  we  present  the  new  closure  rela¬ 
tions  which  could  be,  in  principle,  derived  from  an 
arbitrary  complete  set  of  state  vectors,  and  apply 
the  closure  relation  to  construct  the  partition  func¬ 
tion  in  the  path  integral  form.  Here,  we  use  the 
closure  relation  corresponding  to  the  multiconfigu- 
rational  configuration  interaction  (Cl)  state  and 
show  that  it  is  very  useful  in  the  path  integral 
method  in  both  theoretical  treatments  and  practi¬ 
cal  calculations.  To  reduce  a  huge  amount  of  inte¬ 
gral  variables,  we  introduce  the  static  approxima¬ 
tion  and  discuss  the  physical  meaning  of  the 
approximation.  To  evaluate  the  required  integral, 
we  develop  an  algorithm  by  the  Monte  Carlo 


method  with  the  normalization  condition  of  states. 
Finally,  we  apply  this  method  to  the  hydrogen 
molecule,  evaluate  the  ensemble  average  of  the 
energy,  and  compare  it  with  the  result  obtained  by 
the  ordinary  full  Cl  method. 


Closure  Relation 

In  the  usual  path  integral  method,  a  transition 
amplitude  <x^,  t->  is  expressed  by  a  summa¬ 

tion  of  the  action  along  all  possible  paths  from 
ix/,  to  |x^,  ty).  In  the  coordinate  representation, 
all  position  eigenstates  1  x )  can  appear  as  interme¬ 
diate  states  in  the  path  from  |xy,  to  \xptj). 
Thus,  the  following  closure  relation  is  applied: 

l  =  jd^x\x)(xl  (1) 

Even  if  we  have  a  nonorthonormal  but  complete 
set,  {|r>},  we  could  generally  orthonormalize  them 
by  the  proper  unitary  transformation  as  follows: 

M 

l^>  -  L  (2) 

(k\k')  =  (3) 

where  M  is  a  dimension  of  the  vector  space  {|r>} 

and  is  an  element  of  an  unitary  matrix.  Here¬ 
after,  we  develop  our  theory  in  terms  of  the  or¬ 
thonormal  set,  {|A:)},  for  mathematical  convenience. 

The  state  \k}  could  be  either  the  single-particle 
state  or  the  many-electron  configurational  one. 
However,  it  is  the  essential  point  in  our  formula¬ 
tion  that  the  basis  set  satisfies  the  following  clo¬ 
sure  relations: 

M 

1=  L\k)(k\,  (4) 

that  is,  we  assume  that  the  orthonormal  set,  {|/:)}, 
spans  the  Hilbert  space.  We  construct  a  general¬ 
ized  state  vector  by  the  linear  combination  of  the 
above  state  vectors,  as 

M 

ic>  =  i:  (5) 

k=i 

Here,  the  coefficient  is  an  arbitrary  complex 
number.  If  we  choose  the  atomic  orbital  as  |A:), 
then  |C>  corresponds  to  the  molecular  orbital,  and 
if  \k)  is  the  many-electron  configuration,  it  be¬ 
comes  the  Cl  state. 
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Here,  let  us  assume  that  are  variables  and 
change  these  parameters  arbitrarily.  Then  resulting 
1C>  is  transformed  into  an  arbitrary  state  vector.  It 
is  important  to  notice  that  the  space  sparmed  by 
the  state  vector,  1C)  generated  by  changing  Cj. 
arbitrarily  could  include  every  state  in  the  Hilbert 
space,  namely,  the  set  {1C>}  is  an  overcomplete  set. 
Thus,  we  could  construct  the  closure  relation: 


terval,  j8  into  L  infinitesimal  time  slices,  i.e.,  /3  = 
Lt,  and  inserting  the  closure  relation  among  seg¬ 
ments,  then  Z  becomes 

Z  =  lim  i  f  dix[a,C]Z[C*,C],  (10) 

1,^00  ^  J 

where 


M  fir  /  ^  \ 

1  =  f  n  ^%^exp  -  E  IQP  ICXCI.  (6) 

■'  it=l  iTTl  \  t  =  i  j 

The  derivation  of  the  closure  relation  is  given  in 
Appendix  A.  Each  variables  covers  the  whole  com¬ 
plex  plane.  However,  the  Gaussian  weight  func¬ 
tion  effectively  restricts  a  region  of  the  integration. 
By  suitable  variable  transformations,  we  can  de¬ 
rive  another  form  of  the  closure  relation: 

M  dC?  dC,  I  ^  \ 

i  =  icxci. 

■'  Jr=l  iTTt  =  l  / 

(7) 


The  derivation  of  this  equation  from  Eq.  (6)  is 
given  in  Appendix  B.  The  delta  function  ensures 
the  normalization  condition  of  the  state  vectors. 
Thus,  in  this  case,  each  variable  varies  only  on  the 
surface  of  the  unit  sphere  in  the  complex  space. 

Next,  we  restrict  C^,  in  Eq.  (5)  to  be  real.  Then, 
we  could  get  the  real-number  closure  relation  with 
the  real  Gaussian  weight  function, 

^  dC,  I  \ 

l  =  2fn7^exp  -  EQMICXCI.  (8) 

The  real  delta  function  closure  relation  can  be  also 
derived; 


1 


M  \ 

E  Q^-l  ICXCl. 

jt=l  / 

(9) 


These  real  closure  relations  are  derived  in  Ap¬ 
pendix  C. 


diJi[C*,C] 

i  it  dC^(l)dC^il)  Ji 


-nn  ,, 

l=l  k=l  ^TTZ  /  =  1 


nw[c*(/),c(/)]. 


(11) 


Z[C*,C]  =  n<C(/)k^^"/HC(Z  -  1)>,  (12) 

;=i 


and 


H  (13) 

W[C*0),C(0]  is  the  Gaussian  weight  function  or 
the  delta  function  corresponding  to  Eq.  (6)  or  (7), 
respectively,  and  is  the  normalization  factor 
appearing  in  the  denominator  of  the  closure  rela¬ 
tion.  The  trace  operation  in  the  partition  function 
imposes  a  periodic  boundary  condition  upon  lC(/)> 
as 


|C(0)>  =  |C(L)>.  (14) 

The  ensemble  average  of  any  operator  O  can  be 
calculated  as 


<o> 


Tr(e^^^O) 

Tre“^" 

lim  |/ dix[C*,C]Z[C\C] 

1  ^  <C(/)10|C(/-1)>'| 
<C(/)iC(/ -  1)>  / 

x|/dMC*,C]Z[CXC]|  .  (15) 


Path  Integral  Formulation 

Consider  a  partition  function,  Z  =  Tr 
where  Tr  implies  a  trace  operation,  and  ^  = 
GcgT)"’  with  the  Boltzmann  constant  kg  and  a 
temperature  T.  Let  us  divide  the  temperature  in- 


Note  that,  in  the  above  expression,  the  normaliza¬ 
tion  factor  H  has  disappeared.  In  general,  we  could 
express  the  ensemble  average  of  any  physical 
quantity  in  terms  of  the  logarithmic  derivative  of 
the  partition  function  with  respect  to  suitable  pa¬ 
rameters.  For  example,  the  ensemble  average  of 
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the  Hamiltonian,  i.e.,  the  average  energy,  ( H )  is  When  we  adopt  the  delta  function  as  the  weight 
given  by  function,  Eq.  (19)  becomes 


<H>  = 


Trc-^« 

d 


I  r  ^  dCi  dC.  I  M 

<H>3ta,ic  =  /  n  ^  .  5  £  IQI  -  1 


Thus  substituting  Eq.  (10)  into  (16),  we  get 
<H>  =  lim  f  dfji[C*,C]Z[C*,C] 

30  I  J 


Xe-'^<ci«ic><c|H|C>  I  n 


^  rfQ  dc, 


kA  iTTi 


XS  EICJ^-I  e-/3<c|H|C>  _  (22) 


1  ^  <C(/)|H|C(Z  -  1)>  ^ 
<c(/)ic(/ - 1)>  / 

x|jrf;,[C*,C]Z[C%C]|  .  (17) 

It  is  impossible  to  evaluate  Eq.  (17)  by  direct  nu¬ 
merical  integration  because  of  its  huge  amount  of 
integration  variables  generated  by  slicing  (i  and 
the  presence  of  the  two-body  interactions.  There¬ 
fore,  we  should  approximate  the  above  integral  or 
should  adopt  other  integration  methods  such  as 
the  Monte  Carlo  method. 

Now,  we  introduce  the  static  approximation, 
where  all  the  integral  variables  are  independent  of 
the  temperature  (the  imaginary  time)  as, 

{C,*,C,}  =  {C?(/),C,(/)}.  (18) 

Therefore,  we  have  an  approximate  energy; 


The  approximation  in  Eq.  (21)  seems  to  be  very 
rough;  however,  in  the  zero-temperature  limit  we 
can  give  Eq.  (22)  a  positive  meaning  as  shown  in 
the  following. 

Now,  the  set  {|A:)}  is  complete,  so  we  could,  in 
principle,  construct  the  eigenstates  of  the  Hamilto¬ 
nian  as 


(23) 

1! 

n 

(24) 

where  £„  is  the  nth  energy  eigenvalue.  Thus,  for 
the  numerator  of  Eq.  (22),  we  have  an  inequality  as 


.  dCt  dC,  I  M 


X  (25) 


<H>3tatic  =  lim  I  /"  d|JL^^^^AC*,C] 

L->oc 


XZ3.a.ic[C*,C] 


<C|H|0 

(CIO 


x(/ ^Astatic[C*,C]Z,,,,,[C*,C]|  \ 


where  the  measure  of  integral  and  the  static  Boltz¬ 
mann  factor  are  given  by 

M 

rfA.taHc[CNC]  =  n-  '  .  'w[C*,C],  (20) 

k=l  2771 


Here,  the  equality  holds  when  (3  is  infinity,  i.e.,  in 
the  case  of  the  zero-temperature  limit,  where  only 
the  ground  state  contributes  to  the  integral.  We 
also  have  another  inequality  for  the  denominator 
of  Eq.  (22)  as, 

M  Hr  I  M  \ 

/  n  -^—^5  £  |C,P  -  1 

J  iTTt  I 

(26) 

Therefore,  we  obtain  a  required  result  as, 

lim  <  H  >s,atic  =  £o  ■  (27) 


^staticfC  /C]  =  ((C|e  For  the  finite  temperature  calculations,  we  must 

j_  I  <C|H|C>  \  use  the  exact  expression,  Eq.  (17).  The  approxima- 

«  <C|C>  exp  /(2|r)  r  expression,  Eq.  (22),  works  practically  for  the 

'  I  /  /  zero-temperature  calculations. 
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Monte  Carlo  Method 

An  estimation  of  Eq.  (17)  requires  the  numerical 
calculation  of  two  multidimensional  integrals  over 
M  X  L  variables  with  large  enough  L,  but  we 
cannot  directly  carry  out  this  integration  by  ordi¬ 
nary  ways.  Therefore  we  apply  the  Monte  Carlo 
method  to  do  it. 

We  have  two  kinds  of  closure  relations,  Eqs.  (6) 
and  (7)  with  the  weight  functions  of  the  Gauss-type 
function  and  the  delta  fimction,  respectively; 


exp|-  D  ICj^j, 

(28) 

/  M  \ 

B  EIC/-1  . 

\k=l  1 

(29) 

2.  Choose  randomly  k,  k'  (k  ¥=  k'),  and  S  (0  <  6 
<  Itt). 

3.  Generate  new  variables  by  the  following  or¬ 
thogonal  transformation  of  the  selected  vari¬ 
ables: 


c^a  + 1) 

cpii  +  1) 


COS  6 

—  sin  6 

'c*ii  +  iy 

sin  6 

cos  0 

CpO  +  1)'' 

' 

) 

cos  6 

—  sin  6 

Cj.(/  +  1) 

sin  0 

cos  0 

C;t.(/+1)  ■ 

,  (31) 


.  (32) 


For  the  former,  we  must  generate  random  num¬ 
bers  distributed  from  -oo  to  <»,  and  the  physical 
meaning  of  the  amplitude  is  not  so  clear.  On  the 
other  hand,  the  distribution  of  random  numbers  in 
Eq.  (29)  is  restricted  on  the  surface  of  the  unit 
sphere.  Thus  an  elegant  algorithm  enables  us  to 
generate  them  under  the  restriction  of  the  delta 
function,  which  ensures  the  normalization  condi¬ 
tion  of  the  state.  Therefore,  we  adopt  the  delta 
function  weight  in  the  practical  calculation. 

The  algorithm  of  generating  random  numbers 
with  the  constraint  of  the  delta  function,  which 
generate  the  set  of  the  next  generation,  {Cj^il  +  1)} 
from  the  original  set,  {C^(/)},  is  as  follows: 

1.  Assume  that  [QQ)}  (1  <  /  <  L)  is  normalized 
and  consider  it  as  the  set  of  variables  of  the 
next  generation,  [Cj^Q  +  1)}: 

{C,(l  +  l))^{C,{l)}.  (30) 


Resulting  {€,^(1  +  1)}  is  also  normalized. 

4.  Repeat  procedures  2  and  3  a  suitable  number 
of  times  and  finally  obtain  the  orthonormal 
set  of  the  next  generation,  {C^^d  +1)}. 

This  procedure  generates  the  normalized  variables 
of  the  further  generations  in  succession. 


Numerical  Examples — Hydrogen 
Molecule 

We  calculate  here  the  average  energy,  <H>,  of 
the  hydrogen  molecule  as  numerical  examples.  We 
do  not  intend  to  reproduce  the  experimental  re¬ 
sults  but  concentrate  on  how  our  method  works  by 
comparing  our  results  with  ones  obtained  by  the 
ordinary  full  Cl  method. 

We  carry  out  numerical  calculations  under  the 
static  approximation  using  Eq.  (22),  because  an 


TABLE  I 


Energy  of  hydrogen  molecule  (a.u.):  temperature  vs.  energy. 

Temperature,  p  ^ 

Energy  (STO-3G) 

Energy  (4-31 G) 

0.1 

1 

10 

100 

(HF  approx.) 

(Full  Cl) 

-1.8211376557  ±  0.0000002212 
-1.8211396153  ±  0.0000001303 
-1.8211250591  ±  0.0000045046 
-1.8210604210  ±  0.0000685900 
-1.7991487410 
-1.8211407332 

-1.8668794694  ±  0.0000000519 
-1.8666971526  ±  0.0000021232 
-1.8667716435  ±  0.0000231532 
- 1 .8642273943  ±  0.0002478408 
-1.8105620770 
- 1 .8670253484 
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estimation  of  Eq.  (17)  requires  a  large  amount  of 
machine  time  in  the  Monte  Carlo  simulation  due 
to  two  multidimensional  integrals  over  M  X  L 
variables  with  large  enough  L.  Moreover,  we  are 
interested  in  the  electronic  structure  of  molecules 
and  have  already  a  complete  set  of  orthonormal 
state  vectors  satisfying  Eq.  (23),  that  is,  the  Cl  basis 
set.  Therefore,  even  if  we  restrict  ourselves  to  the 
static  approximation,  we  could  evaluate  physical 
quantities  in  the  zero-temperature  limit. 

The  calculation  is  carried  out  with  the  STO-3G 
and  4-31G  basis  sets  at  the  internuclear  distance  of 
1.46  a.u.  The  Monte  Carlo  integration  was  done 
changing  the  temperature,  from  0.1  to  100  K. 
The  number  of  samplings  is  100  for  each  calcula¬ 
tions.  Calculations  are  executed  on  the  S-4/1000 
(FUJITSU  Co.  Ltd.).  The  temperature  dependence 
of  the  energy  is  presented  in  Table  I.  Note  that  our 
calculation  is  based  on  Eq.  (22)  and  the  calculated 
energy  tends  to  the  full  Cl  ground-state  energy. 
We  might  expect  that  results  for  lower  tem¬ 
perature  are  more  accurate  than  that  for  higher 
ones.  However,  the  calculated  energy  does  not 
necessarily  approach  to  Ef '  monotonously  due  to 
the  sampling  procedure  we  adopted  in  the  Monte 
Carlo  integration.  As  the  temperature  is  lower,  the 
energy  is  closer  to  £,5-''  on  the  calculations  using 
the  4-31G  basis  set.  While,  the  results  of  the  STO-3G 
basis  set  shows  that  the  energy  for  1  K  is  closer  to 
Ef  than  that  for  0.1  K.  In  spite  of  these  results,  we 
could  expect  that  the  energy  for  low  temperature 
is  close  to  £f'. 


Concluding  Remarks 

Fermion  path  integrals  are  usually  based  on  the 
closure  relation  for  the  Fermion  coherent  state 
which  is  expressed  in  terms  of  the  Grassmann 
numbers  [8],  auxiliary  fields  [9],  or  the  bond  order 
[5].  Since  the  standard  computation  of  the  quan¬ 
tum  chemistry  is  based  upon  the  LCAO-MO-CI 
method,  it  is  very  convenient  to  express  the  path 
integrals  in  terms  of  the  related  LCAO  coefficients, 
which  enable  us  to  utilize  conventional  techniques 
and  ideas  of  the  molecular  calculation. 

We  have  derived  the  general  closure  relation 
from  the  complete  set  of  state  vectors  and  have 
shown  that  it  is  possible  to  carry  out  the  ab  initio 
calculation  by  using  the  complete  set  of  the  many- 
electron  configurational  state  in  the  zero  tempera¬ 


ture.  We  have  restricted  ourselves  to  the  zero-tem¬ 
perature  calculation  in  this  study,  but  we  can,  in 
principle,  evaluate  physical  quantities  for  finite 
temperatures.  Moreover,  as  we  could  formulate 
the  finite  temperature  Green's  function  in  the  path 
integral  form,  we  could  calculate  dynamical  quan¬ 
tities  in  the  finite  temperature.  Static  and  dynamic 
quantities  at  finite  temperature  will  be  calculated 
in  separate  studies. 

In  the  path  integral  method,  all  the  independent 
paths  of  particles  must  be  taken  into  account,  but 
"the  classical  path"  is  determined  by  optimizing 
the  action.  If  we  interpret  particle  as  molecule,  this 
image  of  the  classical  path  in  the  path  integral  may 
correspond  to  the  reaction  path.  Therefore,  the 
path  integral  method  of  both  electrons  and  nuclei 
enables  us  to  directly  investigate  the  chemical  re¬ 
action  by  the  ab  initio  calculation.  In  usual  chemi¬ 
cal  reaction  theories  based  on  the  orbital  theory  or 
the  collision  theory,  it  is  very  hard  to  take  simulta¬ 
neously  into  account  the  temperature  and  the 
dynamics.  While,  our  theory  could  treat  the  dy¬ 
namical  processes  of  systems  mentioned  above  and 
includes  explicitly  the  temeprature  which  plays  an 
essential  role  in  the  chemical  reaction.  In  the  fu¬ 
ture,  we  will  discuss  the  chemical  reaction  from  a 
path  integral  point  of  view. 

Now,  the  ab  initio  calculations  of  molecules  are 
usually  carried  out  by  the  LCAO-MO-CI  method. 
The  Cl  method,  however,  requires  a  huge  amount 
of  computer  memory.  On  the  other  hand,  for  the 
zero-temperature  calculations,  the  path  integral 
Monte  Carlo  method  described  in  this  work,  re¬ 
quires  memories  comparable  to  or  less  than  the 
full  Cl  method.  If  we  use  the  closure  relation  in 
terms  of  the  LCAO  coefficients  [6]  instead  of  the 
Cl  relation,  the  memory  required  is  much  less  than 
the  full  Cl  method,  though  we  meet  with  the 
negative  sign  problem  [7].  Therefore,  it  would  be 
possible  to  calculate  large  molecules  by  the  path 
integral  Monte  Carlo  method. 

Since,  in  the  Monte  Carlo  method,  the  comput¬ 
ing  time  is  everything,  we  need  high-speed  com¬ 
puters  rather  than  large-storage  ones.  The  high¬ 
speed  computing  can  be  achieved  by  parallel  com¬ 
puting,  and  the  Monte  Carlo  integration  can  be 
easily  executed  by  parallel  computers.  Thus,  the 
massively  parallel  computers  enable  us  to  calcu¬ 
late  large  molecules  in  realistic  computational 
times.  Therefore,  we  think  that  the  path  integral 
Monte  Carlo  method  is  very  powerful  and  applica¬ 
ble  to  a  large  area  of  chemistry. 
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Appendix  A:  Derivation  of  Closure 
Relation 

Now  we  consider  an  operator  &  in  the  follow¬ 
ing  form: 

1  ^  dCT  dC, 

Yl-^^—^W[C\C]\CXCl  (33) 
k=\  Itti 

where  W[C*,  C]  is  an  unknown  weight  function 
and  yf  the  normalization  factor.  In  order  to  make 
(f  be  the  resolution  of  unity,  we  must  find  a 
suitable  weight  function.  Substituting  Eq.  (5)  into 
(33),  we  have 


1 

./f 


w  dCt  dC^ 

)C=1  Zttz 


W[C*,C] 


M 

In  this  equation,  if  we  could  construct  W[C*,C] 
satisfying  the  following  relation: 

M  dcf  dC, 

f  n-^W[C%C]QC,*  (35) 

J  Itti  ^  " 


where  A  is  a  real  constant,  then  Eq.  (34)  is  reduced 
to  the  closure  relation  of  the  set  {\k)}.  This  means 
that  the  square  terms,  ICj^'s,  with  appropriate 
W[C*,  C]  could  be  integrated  to  give  finite  values, 
and  on  the  other  hand,  the  cross  terms, 
with  W[C*,C]  to  vanish.  Thus,  we  could  easily 
show  that  one  of  the  simplest  forms  for  W[C^^C] 
is  the  Gaussian  function: 


M 

-  E  ic/ 

k=\ 

Therefore,  the  explicit  form  of  a  novel  closure 
relation  is 


W[C*,  C]  =  exp 


Gaussian  weight  function  in  Eq.  (6)  as 


M 

-  E  icy 

k=l 


=  j  dxx  '6|l  — X  '  EICa-I^I^  (38) 

Here  we  have  used  a  fact  that  the  summation  in 
the  exponent  gives  a  positive  value.  While,  in  the 
integrand  of  Eq.  (6),  we  note  the  following  relation: 

|C>  =  X  X  |Cx-'>.  (39) 

Then,  we  could  derive  the  required  relation  as 
follows: 


/  n 

•'  A:=l 


M  dCt  dC, 


lui 


M 


: — exp  -  E  ICaI  |C><C| 


k  =  1 


=  jn'-^fdx 

J  k=i  2m  ^0 

M  \ 

X6|l  -x-i  EICaI" 

k=\ 


e-^\Cx-X(C\ 


,  M  dCt  dCf, 

M 


/  dx 

•'n 


2:iQr-l  ICXCI 


M\j  n 

k  =  l 


k=l 

^  dCt  dCj, 


liri 


M 


5  j:k\  -u\o(a 


k  =  l 


(40) 


Appendix  C:  Real-Number  Closure 
Relation 


M  dC^'  dC  I  ^  \ 

1  =  f  n-f— 

J  2t7Z  \  jt=l  / 

The  proof  is  quite  elementary. 

Appendix  B:  Another  Closure  Relation 

Here,  we  derive  the  delta  function  closure  rela¬ 
tion  from  the  Gaussian  one.  We  can  rewrite  the 


By  following  the  derivation  in  the  Appendix  A, 
we  could  easily  obtain  the  real  Gaussian  weight 
closure  relation  as 

M  dC  I  M  \ 

l  =  2f  ri  ~  E  Ca  |C><C|.  (41) 

J  A=1  VTT  \  A=1  / 

Notice  here  that  the  factor,  2,  appears  in  the  above 
expression.  The  origin  of  the  factor  is  directly  de¬ 
rived  from  the  complex  number  closure  relation. 
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Eq.  (6).  By  separating  a  complex  integral  variable 
into  real  and  imaginary  parts, 

Q  =  +  ^Vk'  (42) 


we  can  rewrite  the  measure  of  integral  as 

dCj,  dx^  dyi^ 

IttI  ypTT  }pjr 


(43) 


transformation  as  in  the  Appendix  B, 


1  =  2r 


M/n 


M 


Lc^-ijicxci. 

k=\ 


(46) 


References 


Thus  nonvanishing  integrals  are  calculated  as 


/ 


dCt  dC^ 


^  -  Cf  Cl. 


277  i 

I 


r  dXu  dVu 

/  =  1,  (44) 

•'  ^|7T  VTT 


dCt  dC, 

liri 


CtC,e 


■etc. 


f  dx^.  dy^.  _  2_„2 
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Thus,  Eq.  (45)  shows  that  the  real  and  the  imagi¬ 
nary  parts  of  the  integral  variable  equally  con¬ 
tribute  to  the  integral.  This  is  the  origin  of  the 
factor. 

Finally,  we  give  the  real  delta  function  closure 
relation.  This  is  obtained  by  the  same  variable 


1.  R.  P.  Feynman  and  A.  R.  Hibbs,  Quantum  Mechanics  and  Path 
Integrals  (McGraw-Hill,  New  York,  1965). 

2.  L.  H.  Ryder,  Quantum  Field  Theory  (Cambridge  University 
Press,  London,  1984). 

3.  F.  A.  Berezin,  Method  in  Second  Quantization  (Academic,  New 
York,  1966). 

4.  H.  Nagao,  K.  Nishikawa  and  S.  Aono,  Chem.  Phys.  Lett.  190 
97  (1992);  Ibid.,  215,  5  (1993). 

5.  H.  Kawabe,  K.  Nishikawa,  and  S.  Aono,  Int.  J.  Quantum 
Chem.  51,  265  (1994). 

6.  H.  Kawabe,  H.  Nagao,  and  K.  Nishikawa,  Int.  J.  Quantum 
Chem.,  to  appear. 

7.  M.  Suzuki,  Ed.,  Quantum  Monte  Carlo  Methods  in  Condensed 
Matter  Physics  (World  Scientific,  Singapore,  1993). 

8.  Y.  Ohnuki  and  T.  Kashiwa,  Prog.  Theor.  Phys.  60,  548  (1978). 

9.  J.  W.  Negele  and  H.  Orland,  Quantum  Many-Particle  Systems 
(Addison  Wesley,  New  York,  1987). 


1230 


QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  30 


Efficient  Numerical  Method  for  Finding 
the  Initial  Response  of  Quantum 
Processes  to  Changes  in  the  Potential 


MARK  J.  HAGMANN 

Department  of  Electrical  and  Computer  Engineering,  Florida  International  University, 

Miami,  Florida  33199 

Received  February  26,  1996;  revised  manuscript  received  July  31,  1996;  accepted  August  2,  1996 


ABSTRACT  _ _ _ _ 

The  time-dependent  Schrodinger  equation  is  solved  for  an  electron  with  potentials  of  the 
form  ffx)  +  f2(r,  t)U(t),  where  LI(0  is  the  unit  step  function.  We  use  a  product 
formulation,  solving  for  F(r,  t)  where  the  wave  fimction  ^(r,  0  =  F(r,  in 

which  is  the  solution  for  f  <  0.  A  simple  implementation  of  the  product 

formulation  that  does  not  use  absorbing  boundary  conditions  and  is  explicit,  without 
using  split  operators,  is  applied  in  two  examples.  The  first  example  pertains  to  resonances 
in  tunneling  with  square  barriers  when  the  barrier  height  varies  sinusoidally  with  time. 
The  initial  response  of  quantum  tunneling  to  oscillations  in  the  barrier  height  shows  a 
buildup  for  oscillations  at  the  resonance  frequency  and  an  off-resonance  response  that 
diminishes  to  approach  the  steady-state  solution  after  several  cycles.  The  second  example 
is  the  initial  response  of  a  hydrogen  atom  to  an  intense  applied  electric  field.  In  both 
examples  the  response  is  delayed,  and  for  tunneling  particles  the  delay  is  approximately 
equal  to  the  semiclassical  value  for  the  duration  of  barrier  traversal,  defined  as  the  time 
for  traversing  the  inverted  barrier.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

Various  methods  have  been  used  to  solve  the 
time-dependent  Schrodinger  equation  for 
time-varying  potentials.  Floquet  methods  have 
been  used  to  determine  the  steady-state  response 
to  a  potential  varying  periodically  with  time  [1,  2]. 


Transient  solutions  have  been  obtained  using  finite 
differences  with  an  alternating-direction  implicit 
propagation  scheme  [3],  finite  differences  with  a 
split-operator  propagator  [4],  finite  differences  with 
a  fourth-order  predictor-corrector  [5],  a  split-oper¬ 
ator  spectral  technique  with  fast  Fourier  trans¬ 
forms  [6],  finite-element  methods  [7],  and  quadra¬ 
tures  with  an  expansion  in  Vokolov  states  [8].  It  is 
the  objective  of  this  study  to  describe  a  product 
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formulation  for  transient  solutions  that  is  particu¬ 
larly  simple  to  implement  and  to  illustrate  its  use 
with  several  examples. 


Analysis 

We  begin  the  present  analysis  with  the  dipole 
approximation  [9]  of  the  time-dependent  Schrodi- 
nger  equation  for  an  electron  in  a  static  potential 
plus  a  radiation  field, 

d"^  h} 

(1) 


where  V^ir)  is  the  static  potential  and  E(r,  t)  is  the 
electric  field  intensity  of  the  radiation  field.  We 
define  the  function  F(r,  t)  such  that  the  wave  func¬ 
tion  t)  —  F(r,  where 

is  the  solution  without  the  radiation  field.  Substi¬ 
tuting  into  Eq.  (1)  leads  to  the  following: 


SF 

dt 


ih  ih 

--v^f  +  — - Vf  + 
Im  m<b 


ier  •  E(r,  t)F 

li  * 
(2) 


For  problems  with  only  one  spatial  dimension, 
when  the  electric  field  E(x,  0  is  in  the  x  direction, 
Eq.  (2)  may  be  written  as  follows: 

dF  ih  d^F  ih  dF  iexE{x,  t)F 

=  — - r  -j- - -f-  - . 

dt  2m  dx  dx  dx  h 

(3) 


We  set  E(x,  t)  =  E^sm((ot)U(t),  where  Eq  is  a 
constant  and  U(t)  is  the  unit  step  function  (1  for 
f  >  0,  else  0).  The  following  finite-difference  ap¬ 
proximation  of  Eq.  (3)  is  obtained  by  using  the 
midpoint  method  [10]  to  approximate  the  time 
derivative  of  F(x,  t): 


ih  At 


m(Ax) 


ih  At 


-h 


m  A  X 


1 

dx 


2  /  A  f/  A  xeE, 

+ - ^ - sin(a)/At)Lf(/)Fq^^,  (4) 


where  F^  ^  =  F(J  •  Ax,  /  •  At).  The  product  formu¬ 
lation,  which  was  used  for  the  examples  presented 
in  this  study,  consists  of  the  following  three  steps: 
First,  analytical  or  numerical  methods  are  used  to 
solve  the  time-independent  Schrodinger  equation 
to  determine  (I>(x).  Next  Eq.  (4),  or  the  correspond¬ 
ing  equation  for  two-  or  three-dimensional  prob¬ 
lems,  is  solved  by  time  stepping  from  f  =  0  where 
Fq  Q  =  1.  Finally,  the  wave  function  ^(x,  t)  is  de¬ 
termined  from  the  definition  4[^(x,  t)  = 
Fix,  t)^ix)e~^^^^^. 

For  all  of  the  examples  studied  thus  far,  we 
have  found  that  it  is  sufficient  to  limit  the  spatial 
increment  so  that  |A:|  A  x  does  not  exceed  a  value  of 
0.2  at  any  point  in  the  grid,  where  \k\  is  the 
magnitude  of  the  wave  vector.  If  the  radiation 
field  is  not  too  strong  when  compared  with  the 
static  potential,  then  F(x,  t)  varies  more  slowly  in 
space  than  does  ^(x,  f),  and  so  it  is  possible  to  use 
values  of  Ax  with  the  product  formulation  that 
are  somewhat  larger  than  this  recommended  limit. 
Our  numerical  studies  have  also  shown  that  there 
are  six  criteria  which  are  necessary  (but  not  suffi¬ 
cient)  conditions  limiting  the  maximum  size  of  the 
time  step  Af.  The  first  three  criteria  may  be  under¬ 
stood  from  physical  considerations:  (1)  The  ratio 
A  x/ A  t  must  exceed  the  velocities  that  would  be 
anticipated  under  the  conditions  of  the  problem. 
(2)  The  ratio  A  x/ A  t  should  exceed  the  semiclassi- 
cal  value  for  barrier  traversal  [11]  at  all  points 
where  tunneling  occurs.  (3)  It  is  necessary  that 
At  Itt/oj  so  that  the  time  dependence  of  the 
radiation  field  is  sampled  sufficiently.  The  remain¬ 
ing  three  criteria  for  numerical  stability  limit  the 
coefficients  on  the  right-hand  side  of  Eq.  (4).  (4) 
From  the  first  coefficient.  At  <  m( A  xY/h.  (5)  From 
the  second  coefficient.  At  <  m  AxT>/(ErfT>/dx)  for 
all  points  on  the  grid.  (6)  From  the  third  coeffi¬ 
cient,  At  <  h/(2exE^)  for  all  points  on  the  grid. 
While  each  of  these  six  criteria  must  be  satisfied, 
the  fourth  is  especially  pertinent  because  numeri¬ 
cal  solutions  diverge  sharply  at  the  point  where 
this  criterion  is  violated.  The  sixth  criterion  is  only 
relevant  for  extremely  intense  fields.  While  these 
six  criteria  are  not  claimed  to  be  sufficient  condi¬ 
tions,  in  all  of  the  examples  studied  thus  far  we 
have  found  that  when  they  are  satisfied  the  solu¬ 
tion  is  stable  for  a  short  interval  of  time  ( 1-5  fs). 
A  reduction  in  the  size  of  the  time  step  is  generally 
required  for  stability  when  simulating  much  longer 
durations.  We  have  also  solved  Eq.  (3)  using  the 
modified  midpoint  method  with  Richardson's  de- 
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ferred  approach  to  extrapolate  to  zero  step  size 
with  rational  functions  (Bulirsch-Stoer  technique) 
[12].  However,  the  latter  method  was  found  to 
require  considerably  greater  computer  time,  and  it 
does  not  appear  to  provide  any  noticeable  im¬ 
provement  in  accuracy  or  stability  when  all  of  the 
above  criteria  are  satisfied. 

When  solving  directly  for  the  wave  function, 
others  have  used  various  absorbing  boundary  con¬ 
ditions  [13]  and  empirically  adjusted  "windowing" 
and  "gobbling"  [14].  Split-operator  formulations, 
requiring  matrix  methods,  have  been  used  to  im¬ 
prove  the  numerical  stability  [3-6].  The  product 
formulation  is  much  simpler  to  implement.  At 
each  time  step  Eq.  (4)  is  used  to  update  the  values 
of  the  function  F  at  all  interior  points  of  the  grid. 
A  separate  procedure  must  be  used  for  the  points 
at  the  ends  of  the  grid  because  Eq.  (4)  requires 
values  of  the  function  F  at  increments  of  ±Ax 
from  the  point  that  is  updated.  We  have  found 
that  for  a  given  value  of  A  x,  the  numerical  stabil¬ 
ity  is  greatest  if  (1)  linear  extrapolation  is  used 

(F^  =  2F2  j+i  —  ^3,  J+1'  ^N,  J+1  ~ 

Fa,_2,  ;+i)  when  the  increment  is  relatively  small; 
(2)  continuity  is  imposed  (Fj  =  ^2,  ;  +  u  /+i 
=  F^,_i  |+■^)  for  intermediate  values  of  the  incre¬ 
ment;  and  (3)  the  end  points  are  not  updated  when 
the  increment  is  relatively  large.  Solutions  ob¬ 
tained  by  using  Eq.  (4)  in  the  manner  described  are 
not  unconditionally  stable,  but  they  have  been 
shown  to  be  stable  over  the  range  of  durations 
simulated  in  the  examples. 


iXumerical  Example:  Sinusoidal 
Variation  of  a  Square  Barrier 

Biittiker  and  Landauer  [15]  used  analytical 
methods  to  show  that  when  a  time-dependent  po¬ 
tential  is  added  to  a  square  barrier  ("modulation" 
of  the  barrier)  tunneling  particles  absorb  and  emit 
modulation  quanta.  We  have  used  Floquet  meth¬ 
ods  [2]  and  other  procedures  [16]  to  determine  the 
steady-state  solution  when  the  barrier  height  varies 
sinusoidally  with  time.  These  solutions  show  that 
there  are  resonances  with  a  marked  increase  in  the 
transmission.  For  square  barriers,  these  resonances 
occur  when  modulation  quanta  have  sufficient  en¬ 
ergy  to  take  tunneling  particles  above  the  barrier, 
and  the  length  of  the  barrier  is  an  integral  multiple 
of  one-half  the  (now  real)  de  Broglie  wavelength 
[2], 


FIGURE  1 .  Ratio  of  transmission  to  the  static  value  as 
a  function  of  wavelength  for  1 0-eV  electrons  with  an 
1 1-eV  1-nm  square  barrier.  The  radiation  field  has  power 
flux  densities  of  0.2,  0.4,  0.6,  0.8,  and  1 .0  x  10^^  W  /  m®, 
from  bottom  to  top  curve,  respectively. 


Figure  1  shows  the  ratio  of  the  transmission 
coefficient  with  barrier  modulation  to  the  static 
value  as  a  function  of  the  wavelength  for  10-eV 
electrons  with  a  square  barrier  having  a  height  of 
11  eV  and  a  length  of  1  nm.  Floquet  methods  were 
used  as  previously  described  [2]  to  determine  the 
steady-state  solution  of  Eq.  (1)  for  a  radiation  field 
with  E(x,  f)  =  Ej  sm(wf).  Three  resonances  are 
evident  in  this  figure  (A  =  0.905,  0.506,  and 
0.296  ^im),  for  which  the  barrier  length  equals  5, 
§,  and  I  of  the  de  Broglie  wavelength  for  particles 
absorbing  one  quantum.  It  may  be  seen  in  Figure  1 
that  the  increase  in  transmission  caused  by  the 
radiation  field  is  proportional  to  the  power  flux 
density,  which  is  consistent  with  the  mechanism 
being  the  absorption  of  modulation  quanta. 

Figures  2-5  show  the  transient  response  of  10-eV 
electrons  with  an  11-eV  1-nm  square  barrier  to  a 
radiation  field  with  a  power  flux  density  of  10^^ 
W /m^.  Each  calculation  was  made  using  450  points 
on  the  axis,  with  a  prebarrier  length  of  0.5  nm,  the 
1  nm  barrier,  and  a  1.5  nm  postbarrier  length. 
Thus,  A X  =  0.00667  nm  so  that  kAx  =  0.108  and 
fCAx  =  0.0342,  where  k  =  (ImEy^yh  and  K  = 
[2miV  —  E)]^/Vft,  where  V  is  the  height  of  the 
square  barrier.  The  calculations  were  repeated  for 
other  numbers  of  intervals,  and  with  225  points 
(Ax  =  0.0133  nm)  the  maximum  difference  be¬ 
tween  the  probability  density  and  that  for  450 
points  is  0.68%  at  all  grid  locations  for  the  12-fs 
simulated  duration.  However,  these  errors  in- 
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crease  precipitously  for  A  x  >  0.03  nm  at  which 
point  kAx  -  The  calculations  were  made  for 
radiation  at  two  different  wavelengths.  Figures  2 
and  3  are  for  the  dominant  resonance  seen  in 
Figure  1  (A  =  0.905  ^tm),  for  which  the  barrier 


length  is  one-half  of  the  de  Broglie  wavelength. 
Figures  4  and  5  are  for  the  off-resonance  wave¬ 
length  of  1.50  fim,  which  exceeds  the  limit  for 
which  the  absorption  of  a  single  quantum  can  take 
a  tunneling  electron  above  the  barrier  (1.24  /rm). 


FIGURE  2.  Probability  density  for  10-eV  electrons  with  an  11 -eV  1-nm  square  barrier.  For  time  t  >  0  the  barrier  height 
is  modulated  sinusoidally  at  A  =  0.905  fim  by  a  radiation  field  with  a  power  flux  density  of  10^^  W/m^. 


FIGURE  3.  Ratio  of  the  probability  density  to  the  static  value  for  10-eV  electrons  with  an  1 1-eV  1-nm  square  barrier. 
For  time  f  >  0  the  barrier  height  is  modulated  sinusoidally  at  A  =  0.905  um  by  a  radiation  field  with  a  power  flux  density 
oflO'^W/m^.  ^  ^ 
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The  probability  density  is  shown  in  Figures  m(A  giving  a  limit  of  0.384  attoseconds  (as). 

2  and  4,  and  the  ratio  of  the  probability  density  to  Numerical  solutions  converge  for  simulated  dura- 

the  static  value  (FF*)  is  shown  in  Figures  3  and  5.  tions  of  1  fs  or  less  when  Af  =  0.384  as,  and 

For  this  example,  the  most  stringent  of  the  six  diverge  sharply  when  the  time  step  exceeds  this 

criteria  for  the  time  step  is  the  fourth,  that  At  <  value.  To  obtain  stability  for  a  simulated  duration 


3  15 

time,  fs 


X,  nm 

FIGURE  4.  Probability  density  for  10-eV  electrons  with  an  11 -eV  1-nm  square  barrier.  For  time  f  >  0  the  barrier  height 
is  modulated  sinusoidally  at  A  =  1.50  /am  by  a  radiation  field  with  a  power  flux  density  of  10''^  W/m^. 


3  15 

time,  fs 


X,  nm 

FIGURE  5.  Ratio  of  the  probability  density  to  the  static  value  for  10  eV  electrons  with  an  1 1-eV  1-nm  square  barrier. 
For  time  f  >  0,  the  barrier  height  is  modulated  sinusoidally  at  A  =  1 .50  /am  by  a  radiation  field  with  a  power  flux  density 
of  10^2  W/m^. 
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of  12  fs  in  Figures  2-5,  it  was  necessary  to  use 
time  steps  of  0.002  as  with  A  =  0.905  /xm  and  0.01 
as  with  A  =  1.50  /xm.  When  the  calculations  were 
repeated  using  one-tenth  of  these  time  increments, 
all  values  of  probability  density  were  found  to  be 
unchanged  to  within  1  part  in  10^  for  the  full  12  fs. 
In  this  example  the  end  points  of  the  grid  were 
updated  by  imposing  continuity  = 

^2,  /H-U  ^N,J+\  ^  /+i)- 

The  semiclassical  value  of  the  tunneling  time, 
defined  as  the  time  for  traversing  the  inverted 
barrier  [11],  is  =  1.69  fs  for  the  static  square 
barrier  in  this  example.  It  may  be  seen  in  Figures  2 
and  3  that  there  is  no  response  to  the  radiation 
field  until  after  There  is  a  monotonic  increase 
in  the  magnitude  of  the  consecutive  peaks  in  the 
transmission  during  the  12  fs  simulation  to  ap¬ 
proach  the  steady-state  solution  at  resonance.  At 
A  =  0.905  /xm  (Figs.  2  and  3)  the  height  of  the 
modulated  barrier  is  maximum  at  0.75,  3.77,  6.79, 
and  9.81  fs  and  minimum  at  2.26,  5.28,  8.30,  and 
11.32  fs.  Each  decrease  in  the  barrier  height  causes 
an  increase  in  the  transmission,  and  each  increase 
in  the  barrier  height  causes  a  decrease  in  the  trans¬ 
mission.  However,  there  is  a  delay  of  approxi¬ 
mately  T^^/2  between  each  extremum  in  the  bar¬ 
rier  height  and  the  corresponding  extremum  in  the 
probability  density  at  the  end  of  the  barrier. 

At  A  =  1.50  /xm  (Figs.  4  and  5)  the  height  of  the 
modulated  barrier  is  maximum  at  1.25,  6.25,  and 
11.26  fs,  and  minimum  at  3.75  and  8.76  fs.  It  may 
be  seen  in  Figures  4  and  5  that,  as  with  A  = 
0.905  /xm,  there  is  no  response  to  the  radiation 
field  until  after  However,  with  A  1.50  /xm 
extrema  in  the  transmission  are  irregularly  spaced, 
corresponding  approximately  to  wavelengths  from 
1.0  to  1.2  /xm.  Furthermore,  the  response  dimin¬ 
ishes  to  approach  the  steady-state  solution  for  this 
off-resonance  wavelength  after  several  periods  of 
oscillation.  It  is  known  [17]  that  the  steady-state 
response  of  both  classical  and  quantum  systems  to 
a  periodic  perturbation  is  approached  only  for 
durations  t  such  that  cot  »  277,  and  this  may  be 
understood  by  noting  the  breadth  of  the  spectrum 
for  a  sinusoidal  pulse.  We  attribute  the  monotonic 
increase  in  the  magnitude  of  the  consecutive  peaks 
at  resonance  (Figs.  2  and  3)  and  the  decrease  off 
resonance  (Figs.  4  and  5)  to  increased  definition  of 
the  frequency  of  the  radiation  field  with  elapsed 
time.  For  comparison.  Figure  6  shows  the  square  of 
the  magnitude  of  the  Fourier  transform  of  a  sine 
wave  as  a  function  of  the  frequency  for  pulses 
having  durations  of  0.5,  0.8,  1.5,  and  3.0  periods. 


FIGURE  6.  Square  of  the  magnitude  of  the  Fourier 
transform  of  a  sine  wave  as  a  function  of  the  frequency 
for  durations  of  0.5,  0.8,  1.5,  and  3.0  periods. 


rVuiiierical  Example:  Response  of  a 
Hydrogen  Atom  to  a  Laser 

Pulse  compression  for  the  generation  of  ex¬ 
tremely  short  optical  pulses  has  received  con¬ 
siderable  emphasis  in  laser  science.  High-order 
harmonics  have  been  measured  [18]  when  neon  is 
irradiated  with  intense  femtosecond  optical  pulses, 
and  it  appears  this  effect  may  lead  to  subfemtosec¬ 
ond  pulses  [19].  We  have  simulated  the  response 
of  a  hydrogen  atom  to  intense  fields  as  a  prelimi¬ 
nary  to  modeling  these  phenomena.  First,  the  ef¬ 
fect  of  a  static  electric  field  on  the  ground  state  of  a 
hydrogen  atom  was  determined  by  solving  the 
time-independent  Schrodinger  equation  using  sep¬ 
aration  of  variables  in  parabolic  coordinates  [20], 
with  care  to  allow  for  the  three  turning  points  in 
the  solution. 

In  solving  the  time-dependent  Schrodinger 
equation  to  determine  the  transient  response  by  a 
hydrogen  atom,  we  use  cylindrical  coordinates 
with  the  2  axis  parallel  to  the  applied  electric  field. 
The  solution  is  independent  of  the  polar  angle 
because,  due  to  the  orientation  of  the  applied  field, 
the  azimuthal  symmetry  in  the  ground  state  is  not 
altered  by  the  applied  field.  For  the  product  for¬ 
mulation  we  define  the  function  F(r,  z,  0  such 
that  the  wave  function  T^(r,  z,  t)  =  F(r,  z,  t) 
T>(r,  where  T>(r,  is  the  solution 
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without  the  radiation  field,  with 

<D(r,z)  =  — . — -  (5) 

V'^^o 


where  is  the  radius  of  the  first  Bohr  orbit. 
Substituting  into  Eq.  (2)  results  in  the  following 
partial  differential  equation  for  the  function 
F(r,  z,  f): 


<?F 


ifi 

'  d^F 

I 

f  1  2r  ^ 

1 

2  m 

-A - 

dr^ 

F 

o 

1  (N 

+  z^j 

1  dr 

Uq'Jy^  +  Z^ 


fczEi  ,  ^ 

+ - -sin  (otUit)F.  (6) 

n 


The  following  finite-difference  approximation  of 
Eq.  (6)  is  obtained  by  using  the  midpoint  method 
[10]  to  approximate  the  time  derivative  of 
F(r,  z,  f): 
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for  the  function  F  by  time  stepping  from  f  =  0 
where  F,  ^  =  1  in  the  same  manner  as  Eq.  (4)  was 

used  in  the  example  with  a  square  barrier.  How¬ 
ever,  for  this  example  the  stability  is  greatest  when 
the  points  on  the  boundary  of  the  grid  are  updated 
by  means  of  linear  extrapolation. 

Values  of  Ar  =  Az  =  0.024  nm,  and  At  =  1.0  as 
were  used  in  this  set  of  simulations.  Criteria  limit¬ 
ing  the  maximum  size  of  the  time  step  Af  in  Eq. 
(7)  have  been  determined  as  they  were  for  Eq.  (4). 
For  this  example  the  most  stringent  criteria  are 
that  At  <  m{ArY/h  and  Af  <  m(Az)Vft,  which 
are  analogous  to  those  in  the  first  example,  giving 
a  limit  of  5.0  as.  Numerical  solutions  diverge 
sharply  when  the  time  step  exceeds  this  value,  but 
converge  to  within  1  part  in  10^  for  At  =  1.0  as  in 
the  range  of  simulated  durations  for  this  example. 
Figure  7  shows  the  initial  and  steady-state  re¬ 
sponse  of  a  hydrogen  atom  to  a  step  function  pulse 
having  an  amplitude  of  1  V/nm  for  f  >  0.  The 
ratio  of  the  probability  density  to  the  unperturbed 
value  is  shown  at  0.04,  0.08,  0.10,  and  0.12  fs,  and 
in  steady  state  (dashed  curve).  Figure  8  shows  the 
ratio  of  the  probability  density  at  0.12  fs  as  a 
function  of  r  and  z.  It  may  be  seen  in  Figure  7  that 
there  is  a  delay  of  approximately  0.05  fs,  which 
may  be  attributed  to  the  uncertainty  principle, 
before  the  atom  begins  to  respond  to  the  applied 
field.  Other  simulations  made  using  optical  fre¬ 
quencies  show  that  this  delay  is  relatively  insensi¬ 
tive  to  the  wavelength.  These  results  suggest  that 
delays  present  in  the  transient  response  must  be 


where  F;  ^  ^  =  F(  I  •  A  r,  /  ■  A  z,  K  •  A  t).  For  points 
on  the  z  axis  (1  =  0)  the  equation  is  modified  by 
deleting  the  term  approximating  the  first  deriva¬ 
tive  in  respect  to  r.  Equation  (7)  is  used  to  solve 


FIGURE  7.  Ratio  of  the  probability  density  for  the 
electron  in  a  hydrogen  atom  with  an  applied  field  of 
1  V/nm  to  the  unperturbed  value  as  a  function  of  the 
axial  coordinate.  The  dashed  curve  is  the  static  solution, 
and  the  solid  curves  are  for  a  step  function  pulse. 
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FIGURE  8.  Ratio  of  the  probability  density  for  the  electron  in  a  hydrogen  atom  to  the  unperturbed  value  at  0.12  fs  after 
the  start  of  a  step  function  pulse  having  an  amplitude  of  1  V  /  nm. 


considered  when  modeling  the  response  of  atoms 
to  intense  fields. 


Discussion 

The  simple  implementation  of  the  product  for¬ 
mulation  illustrated  in  this  study  does  not  use 
absorbing  boundary  conditions  and  is  explicit, 
without  using  split  operators.  This  implementation 
is  not  unconditionally  stable,  but  the  solutions  are 
stable  over  the  range  of  simulated  durations  used 
in  the  examples. 

The  first  numerical  example,  for  sinusoidal  vari¬ 
ation  of  the  height  of  a  square  barrier,  offers 
promise  as  a  new  approach  to  the  unsolved  prob¬ 
lem  [21]  of  evaluating  the  duration  of  quantum 
tunneling.  A  variety  of  different  theoretical  ap¬ 
proaches  have  been  used  to  examine  tunneling 
times  [21],  but  all  of  these  are  based  on  steady-state 
solutions  with  the  exception  of  transient  solutions 
for  wave  packets  [22].  While  these  transient  solu¬ 
tions  show  a  delay  between  the  arrival  of  the 
incident  wave  packet  and  the  departure  of  the 


transmitted  packet,  they  have  been  criticized  [23] 
because  there  is  no  valid  basis  for  comparing  any 
part  (e.g.,  peak  or  centroid)  of  the  two  different 
packets.  The  interpretation  is  further  complicated 
because  the  distribution  of  energies  in  a  wave 
packet  would  smear  the  effects  of  tunneling  times. 
The  calculations  in  the  first  numerical  example  do 
not  use  such  a  distribution  of  energies.  While  the 
result  is  by  no  means  unanimous  [21],  12  analyti¬ 
cal  methods  result  in  the  semiclassical  expression 
for  the  tunneling  time  [11]  which  is  the  classical 
time  for  traversing  the  inverted  barrier.  The  results 
in  the  first  example  are  consistent  with  the  semi¬ 
classical  expression  for  tunneling  time. 
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ABSTRACT _ _ _ 

An  algorithm  for  calculating  excitation  energies  and  transition  moments  in  the  random- 
phase  approximation  (RPA)  of  the  polarization  propagator  is  presented.  The  algorithm 
includes  direct  solution  of  the  RPA  eigenvalue  problem  and  direct  evaluation  of  products 
of  superoperator  Hamiltonian  matrices  with  excitation  vectors.  Given  sufficient  memory, 
only  one  integral  evaluation  step  per  iteration  is  needed.  Illustrative  calculations  on  the 
excitation  energies  and  oscillator  strengths  of  ethylene  are  presented.  ©  1996  John  Wiley 
&  Sons,  Inc, 


Introduction 

The  random-phase  approximation  (RPA)  of  the 
polarization  propagator  is  a  clearly  defined 
method  for  calculating  excitation  energies  and 
transition  moments  [1,  2].  It  is  a  convenient  point 
of  reference  for  more  ambitious  models  that  con¬ 
sider  electron  correlation  [3].  Efficient  implementa¬ 
tion  of  the  RPA  in  accessible  and  transparent  pro¬ 
grams  can  facilitate  comparisons  with  other  meth¬ 
ods  that  are  in  common  use.  To  this  end,  we 
present  the  following  report  on  algorithms,  codes, 
and  an  illustrative  calculation  on  ethylene. 

RPA  excitation  energies  and  transition  moments 
obtain  from  solutions  of  the  generalized  eigen- 


^To  whom  correspondence  should  be  addressed. 


value  problem: 


(q^lHq) 

(qlHq)  . 
(1) 

Single  excitation  operators  in  the  Hartree-Fock 
orbital  basis  are  included  in  the  space  and  their 
de-excitation  counterparts  are  found  in  q.  The  su¬ 
peroperator  metric,  (r\s),  and  the  action  of  the 
superoperator  Hamiltonian,  H,  are  defined  in  the 
usual  manner  [4]: 

(r|s)  =  <0|r'*'s  -  sr‘^10)  (2) 

Hr  =  rH-  Hr,  (3) 


(q^lq^)  (q^lq) 
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(qlqh  (qlq) 
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(q'lHq^) 

(qiHqh 
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where  H  is  the  second  quantized  Hamiltonian  and 
the  reference  state  |0)  is  the  Hartree-Fock  determi- 
nantal  wave  function. 

More  detailed  discussions  of  the  RPA  have  been 
presented  elsewhere  [2,  3];  only  essential  aspects 
need  be  considered  here.  The  eigenvalue  problem 
presented  in  Eq.  (1)  is  usually  written  as 


1 

0 

z 

A 

B 

Z 

0 

-1 

Y 

B 

A 

Y 

An  equivalent  formulation  takes  the  form  of  a 
nonsymmetric  eigenvalue  equation: 


z 

A 

B 

Z 

Y 

-B 

-A 

Y 

A  Direct  Algorithm 

Let  z, k  (a,  b,  c)  denote  occupied  (virtual) 
Hartree-Fock  orbitals,  while  Greek  indices  stand 
for  basis  fimctions.  X  stands  for  Y  or  Z,  the  and 
q  components,  respectively,  of  the  eigenvectors.  C 
is  the  matrix  of  Hartree-Fock  eigenvectors.  Fi¬ 
nally,  we  let  D  stand  for  matrices  with  the  same 
dimensions  as  the  A  or  B  matrices  of  Eqs.  (4)-(6). 
Primes  indicate  transformation  to  the  primitive 
basis. 

The  core  of  a  direct  JRPA  algorithm  is  evaluation 
of  products  of  the  H  matrix  with  trial  vectors. 
Such  a  contraction  is 


For  every  eigenvector  obtained  from  the  last  equa¬ 
tion,  there  corresponds  another  solution  with  an 
eigenvalue  equal  in  magnitude  but  opposite  in 
sign: 


Y 

A 

B 

Y 

Z 

-B 

~A. 

Z 

Several  so-called  direct  algorithms  have  been 
proposed  [5-11].  The  term  direct  must  be  under¬ 
stood  in  two  senses.  The  first  refers  to  direct  calcu¬ 
lation  of  excitation  energies  and  transition  mo¬ 
ments  without  actual  recourse  to  matrix  elements 
of  H.  Instead,  products  of  the  latter  matrix  with 
trial  vectors  are  assembled  (directly)  and  reduced 
eigenvalue  problems  (or  linear  equations)  are 
solved  [12].  This  approach  has  been  applied  exten¬ 
sively  in  configuration  interaction  calculations.  The 
second  sense  of  the  word  direct  implies  that  elec¬ 
tron  repulsion  integrals  are  not  stored  but  are 
reevaluated  as  needed.  Here,  we  discuss  an  algo¬ 
rithm  that  is  direct  in  both  senses  [13]. 

Direct  solution  of  eigenvalue  problems  and  lin¬ 
ear  equations  with  the  paired  eigenvalue  structure 
characteristic  of  the  RPA  has  been  discussed  re¬ 
cently  [5].  Direct  treatments  of  electron  repulsion 
integrals  for  RPA  response  properties  [6,  7]  and 
excitation  energies  [8,  9]  have  emphasized 
arithmetic  scaling,  where  N  is  the  number  of  basis 
functions.  Two  other  implementations  stress  the 
formal  convenience  of  so-called  one-index-trans¬ 
formed  integrals  arising  from  evaluation  of  the  Hq 
and  Hq^  commutators  [10,  11].  Integral  transfor¬ 
mations  necessarily  scale  as  however.  The 
utility  of  these  intermediates  remains  formal,  rather 
than  computational.  We  therefore  present  an  alter¬ 
native  algorithm  that  retains  scaling. 


(DX),«=  (7) 

ib 

In  terms  of  basis  functions, 

(DX),«  =  £c^,.Q,(DX);..,  (8) 

where 

(Dx):c.=  (9) 

per 

and 

(10) 

jb 


The  contraction  of  Eq.  (9)  requires  generation  of 
primitive  electron  repulsion  integrals  only  and  is 
the  most  demanding  step. 

Spin-projected  formulas  facilitate  separate  con¬ 
sideration  of  singlet  and  triplet  final  states  for  a 
closed-shell  reference  state.  Let  the  A  matrix  in  the 
canonical  orbital  basis  be  decomposed  into  its  ze¬ 
roth-  and  first-order  parts  such  that 


and 


A  =  A®  +  A'). 

(11) 

iglet  case. 

(12) 

1 

CN 

II 

(13) 

=  -2(ia\jb)  +  (ib\ja). 

(14) 

where  electron  repulsion  integrals  are  expressed  in 
Mulliken  notation. 
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An  examination  of  the  permutational  symmetry 
properties  of  ±  B)  shows  how  these  matrices 
are  more  suitable  intermediates  for  direct  algo¬ 
rithms  than  A  and  B.  In  the  canonical  orbital  basis, 

( +  B)iajb  =  -(ij\ab)  +  (ib\ja)  (15) 

and 

( -  B)ia,jb  =  4:(ia\jb)  -  [(ij\ab)  +  (ib\ja)]. 

(16) 


acts  of  ±  B  with  Z  may  be  generated  in  this 
procedure,  consideration  of  the  eigenvalue  prob¬ 
lem  in  the  Tamm-Dancoff  approximation  [TDA  or 
single  excitation  configuration  interaction  (Cl)], 

AZ  =  £Z,  (21) 

is  accomplished  straightforwardly  by  addition  of 
intermediates,  i.e., 

A^^^Z  =  ^(A(i>  -h  B)Z  +  -  B)Z.  (22) 


Analogous  expressions  in  the  primitive  basis  are 
-h  BYfJivpa  =  -(/juplva)  +  (ixalvp)  (17) 

and 

-  BYfXppcr  =  4(  pLv\ pa) 

-[(pplva)  4-  (pia\vp)].  (18) 


An  alternative  approach,  suggested  in  Ref.  [16], 
diagonalizes  the  product 

M  =  (A  -  B)(A  -H  B).  (23) 

Two  integral  evaluation  steps  instead  of  one  are 
necessary  to  perform  the  contractions 

(A  +  B)X  =  F  (24) 


(A^^^  +  By  is  antisymmetric  with  respect  to  permu¬ 
tation  of  indices  p  and  a  and  (A^^^  —  B)'  is  sym¬ 
metric  with  respect  to  the  same  permutation.  The 
same  properties  apply  to  permutations  of  /x  and  v. 
While  X'  is  a  square  matrix  that  is  neither  symmet¬ 
ric  nor  antisymmetric,  contractions  with  (A^^^  -h  B)' 
and  (A^^^  ~  B)'  of  the  type  represented  by  Eq.  (9) 
can  be  done  with  the  antisymmetric  and  symmet¬ 
ric  parts  of  X',  respectively.  Only  lower  triangles  of 
these  parts  of  X'  are  needed.  The  expression  for 
(A^^^  -  By  is  identical  to  the  contribution  to  the 
closed-shell  Fock  matrix  suggested  by  Raffenetti 
[14];  the  expression  for  (A^^^  +  B)'  corresponds  to  a 
second  Raffenetti  combination  used  for  open-shell 
calculations.  For  triplet  final  states,  (A^^^  -h  B)'  re¬ 
mains  the  same,  but  for  (A^^^  —  B)'  one  finds 

(A^^^  -  B)iajb  =  -[(ij\ab)  4-  (ib\ja)].  (19) 

In  the  primitive  basis,  the  matrix  elements  are 

( A^^^  -  BYpii'po- =  -(  pip\va)  -  (  pia\vp),  (20) 

and  thus  correspond  to  the  Raffenetti  format  re¬ 
quired  for  complex  Hartree-Fock  calculations. 

All  summations  pertaining  to  Eq.  (9)  can  be 
accomplished  using  the  Gaussian  94  [15]  routine 
FoFDir,  which  performs  direct  contractions  of  vari¬ 
ous  types  of  Raffenetti  combinations  with  density 
matrices.  These  contractions  can  be  executed  for  a 
set  of  vectors  through  a  single  call  to  FoFDir  per 
iteration,  provided  there  is  enough  memory  to 
store  all  the  necessary  matrices.  Singlet  and  triplet 
transitions  are  treated  separately.  Because  prod- 


and 


(A  -  B)F  =  M.  (25) 

This  method  is  used  in  the  TURBOMOLE  [8,  9] 
RPA  code. 

For  the  present  direct  algorithm,  intermediates 
from  Eq.  (10)  must  be  contracted  with  the 
Raffenetti  combinations  according  to  Eq.  (9).  Maxi¬ 
mum  performance  is  achieved  when  as  many 
contractions  as  possible  (depending  on  memory 
available)  are  done  in  one  call  to  the  appropriate 
routine.  The  vectors  may  pertain  to  different  irre¬ 
ducible  representations.  After  performing  these 
contractions,  construction  of  new  vectors,  and  or- 
thogonalizations,  errors  are  introduced  which  vio¬ 
late  symmetry  restrictions  in  the  Hartree-Fock  or¬ 
bital  basis.  These  small  errors  are  converted  to 
zeros  and  the  new  vectors  are  renormalized.  Solu¬ 
tions  of  the  TDA  are  used  as  initial  guesses  for  the 
Z  components  of  the  RPA  guess  vectors. 

Eigenvalues  and  eigenvectors  are  obtained 
through  a  generalization  of  Davidson's  method  for 
symmetric  eigenvalue  problems  [12].  This  proce¬ 
dure  consists  of  the  following  steps. 


1.  A  set  of  guess  vectors  is  chosen  such  that 


[zt  Y+], 


=  S, 


P‘1’ 


(26) 


For  example,  Z  components  of  the  guess  vec¬ 
tors  may  be  TDA  eigenvectors,  while  Y  com¬ 
ponents  are  zero  vectors.  The  vectors  are 
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then  transformed  to  the  primitive  basis  ac¬ 
cording  to  Eq.  (10). 

2.  Contractions  represented  by  Eq.  (9)  generate 

±  By  X'  in  the  primitive  basis.  These  in¬ 
termediates  are  transformed  back  to  the 
Hartree-Fock  basis  through  Eq.  (8). 
contributions  are  then  added  to  produce  AZ, 
AY,  BZ,  and  BY. 

3.  The  projected  H  matrix  is  assembled  from 
the  previous  intermediates  according  to 

H,,  =  Z;(AZ)u  +  Zj(BY)u  ~  y;(BZ). 

-  y/'(AY)..  (27) 

4.  Eigenvalues,  e,  and  eigenvectors,  W,  of  the 
projected  H  matrix  are  obtained  using  rou¬ 
tines  appropriate  for  small  matrices. 

5.  Eigenvectors,  W,  having  the  largest  overlaps 
with  previous  guesses  or  solutions  are  se¬ 
lected. 

6.  Residual  vectors,  R,  are  built,  such  that 


A 

B 

Wz' 

.  -B 

-A 

Wy_ 

“  e 

Wy 

where  e  is  the  eigenvalue  pertaining  to  W. 

7.  Additional  expansion  vectors,  S,  are  con¬ 
structed  such  that 

Sp  =  Rp/(e  ±  (29) 

where  the  +  sign  is  used  for  the  Y  compo¬ 
nents  of  S  and  the  -  sign  is  used  for  the  Z 
components. 

8.  The  new  vectors  are  orthogonalized  to  the 
old  vectors  and  to  each  other.  In  the  absence 
of  convergence  of  the  eigenvectors  and  eigen¬ 
values,  the  algorithm  returns  to  step  2  with 
an  enlarged  vector  space.  These  iterations 
may  continue  until  a  maximum  number  of 
vectors  is  reached,  whereupon  the  process 
returns  to  step  1. 

9.  After  achieving  convergence,  the  solutions 
are  renormalized  such  that 

Z^Z  -  Y'  Y  =  1.  (30) 

Transition  moments  are  properly  normalized 
in  this  manner. 

Because  this  procedure  solves  a  nonsymmetric 
eigenvalue  problem,  there  is  always  the  danger  of 


obtaining  complex  solutions.  In  the  present  appli¬ 
cations,  where  TDA  guesses  closely  resemble  the 
RPA  solutions,  this  problem  has  not  been  in  evi¬ 
dence.  Iterations  on  all  the  vectors  are  performed 
simultaneously. 


Excitation  Energies  of  Ethylene 

Excitation  energies  of  ethylene  were  calculated 
using  the  algorithm  presented  here.  In  order  to 
make  the  results  comparable  with  those  published 
previously  [17,  18],  we  used  the  geometry  opti¬ 
mized  at  the  MP2(Full)/6-31G*  level.  For  the  RPA 
calculations  themselves,  we  used  the  following 
basis  sets:  6-311(2  +  ,2  -h  )G'^*  from  Ref.  [17],  aug¬ 
mented,  correlation-consistent  valence  double  ^ 
(aug-cc-pVDZ)  [19]  and  two  additional  basis  sets 
derived  from  aug-cc-pVDZ.  The  latter  we  denote 
as  aug2-cc-pVDZ  and  aug3-cc-pVDZ;  these  in¬ 
clude  one  and  two  additional  sets  of  diffuse  s,  p, 
and  d  atomic  orbitals  (AOs),  respectively.  Expo¬ 
nents  of  the  additional  diffuse  functions  were  ob¬ 
tained  by  dividing  those  from  aug-cc-pVDZ  by 
3.32.  The  performance  of  the  algorithm  and  the 
code  is  illustrated  by  the  timings  given  in  Table  I. 
In  each  of  these  calculations,  15  singlet  states  were 
computed  and  the  convergence  threshold  on  the 
norm  of  the  residual  vector  was  set  to  10 For 
the  aug-cc-pVDZ  basis  set,  direct  contractions  were 
done  in  one  pass  and  the  RPA  CPU  time  is  about 
two  times  that  of  the  self-consistent  field  (SCF). 
With  the  larger  basis  sets,  two  passes  were  made 
to  test  the  code,  but  this  choice  was  not  necessary 
to  execute  the  calculations.  The  timings  indicate 
that  RPA  calculations  are  no  less  feasible  than  the 
SCF  step. 

Tables  II-V  compare  TDA  and  RPA  excitation 
energies  and  oscillator  strengths  for  C2H4  with 
reliable  experimental  data.  RPA  results  obtained 
with  the  6-311(2  +  ,2  +  )G*'^  basis  set  do  not  dif¬ 
fer  much  from  TDA  values  for  transitions  assigned 
as  Rydberg  states  (see  entries  1,  2,  4“-12, 14,  and  15 


TABLE  I _ 

SCF  and  RPA  CPU  time  (sec). 


Basis  Set 

Number 
of  AOs 

SCF 

RPA 

Iterations 

aug-cc-pVDZ 

82 

185.6 

378.0 

7 

aug2-cc-pVDZ 

116 

491.6 

1231.7 

5 

aug3-cc-pVDZ 

150 

2161.8 

8036.0 

8 
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TABLE  II _ 

6-31 1  (2  +  ,2  +  )G**  C2H4  excitation  energies  (eV). 


Transition 

TDA 

f 

RPA 

f 

Expt. 

ie3„ 

7.13 

.0913 

7.11 

.0865 

7.11  [20] 

2eig 

7.71 

.0000 

7.70 

.0000 

7.80  [22] 

3B,„’ 

7.74 

.5127 

7.39 

.4152 

7.60  [20] 

4629 

7.86 

.0000 

7.85 

.0000 

6/t, 

8.09 

.0000 

8.07 

.0000 

8.29  [20] 

6  63^ 

8.63 

.0061 

8.63 

.0058 

8.62  [20] 

7  A, 

8.77 

.0000 

8.77 

.0000 

8e3„ 

8.93 

.0108 

8.93 

.0099 

8.90  [22] 

9.09 

.0000 

9.08 

.0000 

10  6, „ 

9.09 

.0544 

9.04 

.0205 

11  63„ 

9.11 

.1003 

9.10 

.0942 

12629 

9.18 

.0000 

9.18 

.0000 

13  6,9 

9.28 

.0000 

9.23 

.0000 

14  629 

9.56 

.0000 

9.56 

.0000 

15  A„ 

9.65 

.0000 

9.65 

.0000 

in  Table  II)  [20,  21].  However, 

a  significant  red 

shift  is 

observed 

for  the 

valence 

state.  A 

smaller  shift  occurs  for  the  valence 
transition  to  the  state  (entry  13  in  Table  II). 

RPA  vertical  excitation  energies  obtained  with 
the  6-311(2  +  ,2  +  )G**  basis  set  agree  very  well 
with  established  experimental  results  for  ^83,,, 

^Bj^,  and  ’B3,,.  Every  transition  is  described 
by  more  than  one  excited  configuration.  D  orbit¬ 
als  are  unimportant  in  describing  transitions 
calculated  with  the  aug-cc-pVDZ  and 
6-311(2  -f  ,2  -I-  )G**  basis  sets,  but  they  can  be- 


TABLE  III _ 

aug-cc-pVDZC2H4  excitation  energies  (eV). 


Transition 

TDA 

f 

RPA 

f 

Expt. 

1 63a 

7.15 

.0930 

7.13 

.0875 

7.11  [20] 

7.75 

.0000 

7.74 

.0000 

7.80  [22] 

3e,„ 

7.73 

.5166 

7.38 

.4082 

7.60  [20] 

4629 

7.92 

.0000 

7.92 

.0000 

5  a/ 

8.80 

.0000 

8.74 

.0000 

8.29  [20] 

6  A„ 

9.00 

.0000 

8.99 

.0000 

7  6,3 

9.21 

.0000 

9.15 

.0000 

3  63U 

9.23 

.0181 

9.24 

.0169 

8.90  [22] 

9  623 

10.28 

.0000 

10.17 

.0000 

10  ej 

10.47 

.0000 

10.45 

.0000 

11639 

10.48 

.0000 

10.47 

.0000 

12623 

10.50 

.0000 

10.48 

.0000 

1363, 

10.52 

.1729 

10.50 

.1711 

14  6, „ 

10.60 

.0800 

10.52 

.0376 

I5e3„ 

10.87 

.1511 

10.83 

.1171 

TABLE  IV _ 

aug2-cc-pVDZ  C2H4  excitation  energies  (eV). 


Transition 

TDA 

f 

RPA 

f 

Expt. 

1 63^ 

7.12 

.0903 

7.10 

.0851 

7.11  [20] 

2  6,3 

7.70 

.0000 

7.69 

.0000 

7.80  [22] 

3  6„ 

7.71 

.4932 

7.37 

.3996 

7.60  [20] 

4629 

7.85 

.0000 

7.84 

.0000 

5a; 

6  63U 

8.08 

.0000 

8.06 

.0000 

8.29  [20] 

8.60 

.0017 

8.59 

.0016 

8.62  [20] 

7  A, 

8.76 

.0000 

8.76 

.0000 

3  63U 

8.88 

.0030 

8.88 

.0046 

8.90  [22] 

9  63^, 

8.98 

.1084 

8.97 

.0987 

10  6,3 

9.04 

.0000 

9.03 

.0000 

116,; 

9.06 

.0483 

9.01 

.0187 

12623 

9.12 

.0000 

9.11 

.0000 

13  6,3 

9.21 

.0000 

9.16 

.0000 

14  62U 

9.29 

.0855 

9.28 

.0774 

15  A, 

9.55 

.0000 

9.55 

.0000 

come  significant  for  some 
basis  sets  are  used. 

states  when  the  other 

The  first  ^^3^^ 

transition  is 

chiefly 

a  7T  ^  3s 

transition. 

The  order  of  the  next  two  states,  ^Bj^,  and  'Bj„, 
which  were  almost  degenerate  in  the  TDA  calcula¬ 
tions,  is  changed  in  RPA  calculations.  The  gap 
between  the  states  widens,  in  agreement  with  ex¬ 
periment.  This  change  is  due  to  the  significant 
lowering  of  the  excitation  energy  in  the  RPA 
calculations.  While  the  ’Bj,,  state  clearly  has 
Rydberg  character  (tt  ->  3py),  the  'Bj„  state  seems 


TABLE  V _ 

aug3-cc-pVDZ  C2H4  excitation  energies  (eV). 


Transition 

TDA 

f 

RPA 

f 

Expt. 

1 63U 

7.12 

.0902 

7.10 

.0851 

7.11  [20] 

2  6,9 

7.70 

.0000 

7.69 

.0000 

7.80  [22] 

3  6,„ 

7.71 

.4929 

7.37 

.3996 

7.60  [20] 

4623 

7.85 

.0000 

7.84 

.0000 

BA, 

8.08 

.0000 

8.06 

.0000 

8.29  [20] 

6  63U 

8.55 

.0023 

8.55 

.0025 

8.62  [20] 

7  63„ 

8.72 

.0586 

8.72 

.0550 

8  62„ 

8.74 

.0334 

8.74 

.0306 

9  A, 

8.76 

.0000 

8.76 

.0000 

10  63^ 

8.87 

.0169 

8.87 

.0150 

8.90  [22] 

11  6,„ 

9.00 

.0360 

8.97 

.0147 

12  6,3 

9.01 

.0000 

9.00 

.0000 

13623 

9.08 

.0000 

9.08 

.0000 

14  A3' 

9.15 

.0000 

9.14 

.0000 

15  6,3 

9.21 

.0000 

9.16 

.0000 

INTERNATIONAL  JOURNAL  OF  OUANTUM  CHEMISTRY 


1245 


ZAKRZEWSKI,  DOLGOUNITCHEVA,  AND  ORTIZ 


to  be  a  mixture  of  valence  (tt  ^  77*)  and  Rydberg 
(tt  3p^)  transitions,  the  contribution  of  the  latter 
decreasing  in  the  RPA. 

There  is  another  pair  of  neighboring  transitions, 
and  (transitions  9  and  10,  Table  II),  the 
order  of  which  changes  in  the  RPA  calculations. 
These  transitions  are  described  as  77  np^ 
and  77  np^ 

Results  obtained  with  aug-cc-pVDZ  are  less  sat¬ 
isfactory.  The  first  three  transitions  are  in  quite 
good  agreement  with  experiment  (both  TDA  and 
RPA  results),  while  the  higher  TDA  excitation  en¬ 
ergies  are  not  acceptable.  Augmentation  of  the 
aug-cc-pVDZ  basis  set  with  diffuse  functions 
changes  the  results  of  TDA  calculations  dramati¬ 
cally,  bringing  the  excitation  energies  into  close 
agreement  with  both  experimental  and 
6-311(2  +  ,2  +  )G**  data.  TDA  energies  of  the  first 
seven  transitions  calculated  with  aug2-cc-pVDZ 
differ  only  by  0.01-0.03  eV  from  those  calculated 
with  the  6-311(2  -h  ,2  +  )G**  basis  set.  Changes 
between  aug2-cc-pVDZ  and  6-311(2  +  ,2  -h  )G*^' 
calculations  begin  with  the  ninth  final  state,  which 
is  in  the  latter  basis.  The  energy  of 

transition  is  somewhat  lower  with  aug2-cc-pVDZ. 
A  greater  lowering  occurs  for  ^B^^^  [state  11  in 
6-311(2  +  ,2  -h  )G*'^  at  9.11  eV],  the  energy  of 
which  approaches  that  of  the  preceding  ^B^^^  state 
in  Table  IV. 

The  ^B2^  and  transitions  (12  and  13  in 
Tables  II  and  IV)  have  lower  energies  with  the 
larger  basis.  The  most  significant  feature  of  this 
range  of  the  theoretical  spectrum  is  that  a  transi¬ 
tion  of  different  symmetry  (9.29  eV,  ^B2,,)  appears 
before  which,  in  its  turn,  occurs  at  lower 
energy  (9.55  vs.  9.65  eV),  thus  pushing  the  ^B2g 
transition  out  of  the  calculated  15  states.  CB2g 
probably  would  be  the  next  one,  had  the  number 
of  computed  states  been  larger.) 

Continued  augmentation  of  the  aug-cc-pVDZ 
basis  set  with  additional  diffuse  functions  has  no 
influence  on  the  energies  of  the  first  5  transitions 
(Table  V).  Both  TDA  and  RPA  values  coincide 
with  those  of  aug2-cc-pVDZ  calculations.  The  en¬ 
ergy  of  the  sixth  Cb^J  transition  is  0.05  eV  lower. 
Transitions  ^B3„  and  ^B2,,  occur  at  8.72  and  8.74  eV, 
while  the  energy  (8.76  eV)  remains  the  same 
(see  Tables  II,  IV,  and  V).  The  order  of  transitions 
at  higher  energies  is  changed  again,  however.  A 
new  A^  transition  appears  at  9.14  eV,  leaving  the 
B2,  (9.29  in  aug2-cc-pVDZ,  Table  IV)  and  (9.55 
in  aug2-cc-pVDZ,  Table  IV)  states  beyond  the 
limit  of  15  calculated  transitions.  Unlike  the 


6-311(2  -f-  ,2  -h  )G**  results,  pertinent  molecular 
orbitals  obtained  with  aug2-cc-pVDZ  and  aug3- 
cc-pVDZ  do  contain  appreciable  contributions  from 
d  orbitals.  Only  diffuse  d  functions  with  exponents 
near  0.045  and  0.014  have  nonnegligible  coeffi¬ 
cients  in  those  virtual  molecular  orbitals  to  which 
transitions  occur.  The  ^B2„  transition,  which  is  ab¬ 
sent  in  calculations  with  two  smaller  basis  sets,  is 
described  as  an  excitation  to  virtual  orbitals  with 
pure  d  character. 

Inclusion  of  additional  diffuse  functions  leads  to 
the  appearance  of  new  transitions  at  lower  ener¬ 
gies,  the  increase  in  the  number  of  configurations 
describing  a  given  transition,  and  lower  excitation 
energies  for  many  transitions.  The  first  six  singlet 
states  are  treated  consistently  in  Tables  II,  IV,  and 
V  and  are  not  likely  to  be  affected  by  additional 
diffuse  functions. 


Conclusions 

An  algorithm  that  is  direct  in  both  senses  dis¬ 
cussed  in  the  Introduction  has  been  presented.  It 
retains  arithmetic  scaling  and  is  comparable  in 
difficulty  with  direct  SCF  methods,  therefore.  In¬ 
termediates  dependent  on  electron  repulsion  inte¬ 
grals  take  advantage  of  efficient  techniques  for  the 
evaluation  of  Raffenetti  lists.  Just  one  call  to  the 
corresponding  routines  for  performing  direct  con¬ 
tractions  with  density  matrices  occurs  in  each  iter¬ 
ation.  Only  the  lower  triangles  of  the  symmetric 
and  antisymmetric  components  of  the  X'  interme¬ 
diates  in  the  primitive  basis  are  needed.  Consistent 
assignments  of  the  ethylene  excitation  spectrum 
have  been  made. 
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ABSTRACT _ - 

Avoided  crossing  diagram  parameters  for  the  radical  exchange  reaction  and  the  concerted 
exchange  of  two  and  three  bonds  are  computed  by  using  the  approximated  valence  bond 
method,  which  is  a  nonorthogonal  configuration  interaction  (Cl)  semiempirical  method 
among  the  valence  bond  configuration  functions.  Here,  each  valence  bond  configuration 
function  is  a  spin-adapted  combination  of  Slater  determinants  constructed  from  the 
Heitler-London  or  Coulson-Fischer  hybrid  orbitals.  Atomic  orbitals  integrals  are 
evaluated  using  semiempirical  philosophy,  and  these  provide  considerable  saving  of 
computer  time  compared  with  the  most  standard  ab  initio  multistructure  valence  bond 
methods.  The  results  indicate  that  the  approximate  valence  bond  method  is  capable  of 
yielding  reasonable  results  for  the  avoided  crossing  diagram  parameters.  These  results 
also  indicate  that  the  diagram  gap  (G)  is  the  decisive  factor  for  the  stability  of  symmetric 
clusters,  X„,  although  no  clear  correlation  between  the  gap  G  and  the  geometric 
distortion  is  found  for  different  values  of  n.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

Valence  bond  (VB)  calculations  provide  us  with 
useful  information  about  the  structure  and 
physical  properties  of  organic  and  inorganic 
molecules.  Ab  initio  VB  theory  has  been  exten¬ 
sively  developed  in  the  last  years,  representative 
examples  are  the  generalized  VB  method  (GVB)  of 
Goddard  [1],  the  resonating  GVB  method  (RGVB) 


of  Voter  and  Goddard  [2],  the  spin-coupled  VB 
method  (SCVB)  of  Cooper,  Gerratt,  and  Raimondi 

[3] ,  the  spin-free  VB  method  (SFVB)  of  McWeeny 

[4] ,  the  biorthogonal  VB  method  (BVB)  of  Mc¬ 
Douall  [5],  and  the  new  SFVB  method  proposed  by 
Li  and  Wu  [6].  In  spite  of  considerable  advances, 
such  methods  are  currently  slow,  and  their  routine 
application  for  many  systems  of  chemical  interest 
is  still  not  practical.  The  use  of  a  nonorthogonal  set 
of  orbital  remains  the  major  difficulty  of  such 
kinds  of  VB  calculation.  A  few  semiempirical  VB 
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methods  have  also  been  proposed  [7].  In  all  of 
these  semiempirical  methods,  the  zero  differential 
overlap  (ZDO)  approximation  has  been  used  for 
the  integral  evaluation  and  for  the  simplification  of 
the  nonorthogonal  matrix  elements  of  the  Hamilto¬ 
nian.  The  use  of  the  ZDO  approximation  is  appar¬ 
ently  contradictory  with  the  VB  philosophy,  since 
is  well  known  that  the  use  of  an  orthogonal  set  of 
atomic  orbitals  has  been  incapable  of  predicting 
the  chemical  bonding  [8].  In  order  to  overcome  this 
contradiction,  many  of  the  semiempirical  ZDO-VB 
calculations  have  been  implemented  using  the  so- 
called  overlap-enhanced  atomic  orbitals  (OEAO). 
Although  the  use  of  OEAO  produce  a  nonorthogo¬ 
nal  set  of  hybrid  orbitals,  the  contradiction  due  to 
impose  the  ZDO  approximation  remains.  Here  we 
want  to  emphasize  that  an  approximate  valence 
bond  (AVB)  method,  without  the  ZDO  assump¬ 
tion,  is  more  consistent  with  the  spirit  of  the  VB 
theory.  It  is  the  purpose  of  this  communication  to 
outline  one  possible  AVB  formalism  without  the 
ZDO  approximation.  In  these  preliminary  calcula¬ 
tions,  this  method  will  be  used  to  compute  the 
reactivity  parameters  of  curve-crossing  VB  dia¬ 
grams  [9,  10]  for  the  linear  radical  exchange  reac¬ 
tion  [Fig.  (la)]  and  the  concerted  exchange  of  two 
and  three  bonds  shown  in  Figures  1(b)  and  (c).  As 
representative  test  cases,  we  have  selected  three 
different  atomic  systems  representing  one  orbital 
for  one  center,  X  =  H(ls),  Li(2s),  and  Li(2p^). 


Some  of  this  system  has  been  studied  by  Maitre 
and  co-workers  [11]  using  ab  initio  multistructure 
VB  method  to  understand  the  stability  of  small 
atomic  clusters,  a  problem  that  is  closely  related  to 
the  concept  of  aromaticity  [12]. 


Approximate  Valence  Bond  Method 

This  communication  deals  with  an  approximate 
method  for  calculating  VB  wave  function  for  va¬ 
lence  electrons  of  a  molecule.  The  molecular  wave 
function,  (F,  is  expanded  as  a  linear  combination  of 
VB  configuration  function, 

^  =  LCi<t>r  (1) 

i 

Each  VB  configuration  function  can  be  repre¬ 
sented  as  a  single  VB  resonance  structure  or  a 
linear  combination  of  these  and  can  be  written  in 
the  form 

^  •  (2) 

The  spatial  factor,  is  a  product  of 

nonorthogonal  hybrid  orbitals,  where  the  subindex 
K  indicates  a  particular  configuration.  For  a  partic¬ 
ular  configuration,  is^one  of  N-electron  spin 

eigenfunctions  of  and  with  eigenvalue  S(S 
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FIGURE  1.  (a)  Linear  radical  exchange  reaction;  (b)  D4;,-concerted  exchange  of  two-bond  exchange;  (c)  Dg^-concerted 
three-bond  exchange. 
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+  1)  and  M,  respectively.  In  order  to  obtain  a 
chemical  interpretation  of  the  final  wave  function, 
the  Rumer  spin  function  is  used  [13].  Using  the 
properties  of  the  antisymmetrizer,  the  term 
be  expressed  as  a  linear  combina¬ 
tion  of  2^  Slater  determinants,  where  p  is  the 
number  of  covalent  bonds  in  the  configuration  k. 
The  VB  wave  fimction  yields,  after  minimizing  the 
total  energy,  the  secular  matrix  equation  [H][C] 
=  [S][C][  £],  where  the  elements  of  the^  [H]  and 
[S]  matrix  are  defined  by  and 

^Ki  ^  respectively.  The  matrix  elements 

between  VB  structures  are  given  in  terms  of  matrix 
elements  between  nonorthogonal  Slater  determi¬ 
nants. 

In  the  computational  strategy  two  problems 
must  be  addressed:  an  efficient  scheme  for  evalua¬ 
tion  of  matrix  elements  between  nonorthogonal 
Slater  determinants  and  the  optimization  of  the 
orbitals  to  be  used  in  the  wave  function.  These  are 
the  crucial  problems  in  the  implementation  of  the 
VB  theory.  The  matrix  elements  between  VB  struc¬ 
tures  are  computed  by  means  of  the  Lowdin  cofac¬ 
tor  formula  [14],  using  the  factorized  cofactor- 
driven  algorithm  developed  by  Raimondi  and  co¬ 
workers  [15],  which  results  in  important  savings  in 
computer  time.  Let  us  now  consider  the  choice  of 
hybrid  orbitals.  We  distinguish  two  types  of  hy¬ 
brid  orbitals.  First,  the  atomic  Heitler-London  (HL) 
hybrids  orbitals,  which  in  the  case  of  H  and  Li 
atoms  correspond  to  the  atomic  basis.  Second,  the 
Coulson-Fischer  (CF)  hybrid  orbitals  [16].  In  the 
CF  orbitals  the  HL  orbitals  must  be  allowed  to 
deform  and  delocalized  onto  other  nuclear  centers. 
This  delocalization  is  implemented  here  by  using  a 
unique  variational  parameter  that  allows  one  to 
mix  HL  orbitals  in  a  nucleus  different  from  the 
reference  one,  with  a  weight  proportional  to  the 
overlap  integral.  This  scheme  reduces  the  opti¬ 
mization  problem  to  one  parameter.  As  is  shown 
in  the  next  section,  this  scheme  is  essential  for  a 
compact  representation  of  the  curve-crossing  dia¬ 
grams  for  the  reaction  of  Figure  1. 

At  this  point  a  discussion  of  the  semiempirical 
approximation  is  appropriated.  The  task  now  is  to 
discuss  the  manner  in  which  atomic  orbital  inte¬ 
grals  can  be  computed,  and  the  objective  is  that 
the  approximate  integrals  should  reproduce  the 
atomic  integrals  as  accurately  as  possible  in  each 
point  of  the  nuclear  configuration  space.  This 
method  uses  a  basis  set  of  Slater-type  orbitals 
(STO).  Once  the  configuration  fimctions  are  gener¬ 
ated,  the  superposition  integral  and  the  cofactor  of 


the  Lowdin  formula  are  evaluated  without  approx¬ 
imation.  In  a  typical  calculation,  the  evaluation  of 
cofactors  require  about  80%  of  CPU  time.  After 
some  manipulation,  the  expression  for  the  diago¬ 
nal  one-electron  atomic  integral,  becomes 

H,,  =  -  E  (3) 

B^A 

both  and  are  approximated  as  in  the 
CNDO /I  parametrization  of  Pople  and  co-worker 
[17,  18].  This  approximation  can  be  justified  with¬ 
out  any  introduction  of  the  ZDO  approximation. 
By  using  a  modification  of  the  standard  Mulliken 
approximation,  the  one-electron  nondiagonal 
terms,  are  approximated  as 

fiki  =  PkAi  +  exp(  - (4) 

where  and  are  empirical  parameters  de¬ 

rived  from  the  equilibrium  bond  length  and  disso¬ 
ciation  energy  of  diatomic  homonuclear  molecules, 
with  ==  and 

are  the  exponents  of  the  orbitals  k  and  /,  R^/  are 
the  bond  distances  between  the  center  of  orbitals  k 
and  /,  and  is  the  overlap  integral.  This  approxi¬ 
mation  requires  some  comments.  In  the  first  ver¬ 
sion  of  this  method,  we  tried  to  implement  the 
standard  Mulliken  approximation,  proportional  to 
the  overlap  integral.  However,  with  this  approxi¬ 
mation  it  was  impossible  to  obtain  good  prediction 
both  for  the  equilibrium  bond  length  and  dissocia¬ 
tion  energy  for  the  simple  case  of  H2.  The  H2 
molecule  is  very  instructive,  since  with  the  excep¬ 
tion  of  the  nondiagonal  elements,  the  rest  of 
the  integrals  are  evaluated  without  approximation. 
Then,  it  is  clear  in  this  case  that  the  error  in  the 
parametrization  is  due  to  H^,;.  The  exact  form  of 
for  the  H2  molecule  is 

a  -  -a^  -h  2a 

^ ^ - wii  expC-zi;^;), 

(5) 

where  a  is  the  orbital  exponent  of  hydrogen  and 
the  rest  of  the  terms  have  the  same  meaning  as  in 
Eq.  (4).  Equation  (4)  is  a  generalization  of  the 
result  for  the  H2  molecule;  the  use  of  this  equation 
is  perhaps  a  better  approximation  than  other 
semiempirical  formulas  for  All  three-  and 
four-center  integrals  are  equal  to  zero,  and  it  is 
assumed  that  the  two-  and  one-center  integrals 
depend  only  upon  the  nature  of  the  atom  and 
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not  upon  the  nature  of  the  orbitals  involved.  We 
only  have  computed  one  type  of  one-center-two- 
electron  integral  for  each  atom:  ( AAl  A  A),  and  four 
types  of  two-center-two-electron  integral  for  each 
diatomic  pair:  (AA\BB),  (AA\AB),  (AB\BB),  and 
(AB\AB).  Furthermore,  the  requirements  of  space 
invariance  are  fulfilled  by  treating  all  the  integrals 
as  if  they  only  involved  spherically  symmetric 
orbitals. 


Avoided  Crossing  Diagrams 

The  construction  of  avoided  crossing  diagrams 
(ACD)  has  been  discussed  in  great  detail  by  Shaik 
and  Pross  [9,  10],  therefore  only  a  brief  description 
is  presented  here.  ACD  for  the  single-step  chemi¬ 
cal  reaction  R  ^  P  is  shown  in  Figure  2.  This 
diagram  consists  of  two  diahatic  curves  as  a  func¬ 
tion  of  an  abstract  reaction  coordinate,  one  for  a 
reactant  VB  configuration,  £^,  and  other  for  the 
product  VB  configuration,  Ep,  The  avoided  cross¬ 
ing  of  these  two  curves  generate  two  adiabatic 
curves,  denoted  by  dashed  lines.  The  crossing  point 
corresponds  in  a  good  approximation  to  the  transi¬ 
tion  state,  i.e.,  the  point  where  the  formation  of  the 
new  bonds  compensates  for  the  breaking  of  the  old 
ones  [19]. 

Following  the  Shaik-Pross  theory,  the  activa¬ 
tion  energy  can  be  characterized  by  three  quanti- 


FIGURE  2.  Curve-crossing  diagram. 


ties.  The  first  quantity  is  the  vertical  energy  gap  G. 
This  parameter  is  the  energy  required  for  the  elec¬ 
tronic  reorganization  that  converts  the  reactant 
configuration  to  the  product  configuration  at  the 
geometry  of  the  reactant.  The  second  quantity  is 
the  height  of  the  crossing  point  ( A  £^.)  which  can  be 
expressed  as  a  fraction  (/)  of  the  gap  G;  that  is 

A£,  =/G.  (6) 

The  fraction  /  depends  on  the  curvature  of  the 
reactant  and  product  diabatic  curves.  The  third 
quantity  is  the  degree  of  avoided  crossing  B.  This 
is  the  quantum  mechanical  resonance  energy  at  the 
transition  state.  The  reaction  barrier  can  be  ex¬ 
pressed  in  terms  of  these  three  quantities  and 
reads: 

AE*=fG~B.  (7) 

The  intersection  point  of  diabatic  states  corre¬ 
sponds  to  the  symmetric  geometry  of  the  sys¬ 
tems  in  Figure  1.  The  energy  of  the  avoided  curve 
in  this  point,  relative  to  the  minimum  energy  point 
of  the  reactant  and  product  diabatic  curves,  is 
determined  by  two  opposing  factors.  One  is  the 
energy  to  reach  the  cross  point,  which  is  due  to  the 
geometrical  distortion  of  the  reactant  in  order  to 
achieve  the  crossing  point.  This  energy  depends 
both  on  the  electronic  gap  G  and  the  curvature 
factor  /.  The  second  factor  is  the  resonance  interac¬ 
tion.  Equation  (7)  expresses  this  balance  in  a  quan¬ 
titative  form.  Shaik  and  co-workers  [12]  have  pos¬ 
tulated  that  the  variation  in  the  energy  gap  G 
dominates  the  other  factors  of  Eq.  (7).  Thus,  for  a 
large  G,  the  symmetric  geometry  of  the  X„  clus¬ 
ters  is  expected  to  be  unstable  as  in  2  of  Figure  3. 
In  the  case  of  a  small  gap,  the  symmetric  geometry 
is  stable  as  in  3  of  Figure  3.  We  would  like  to  note 
that  explicit  calculations  of  reactivity  parameters. 


FIGURE  3.  Avoided  curve-crossing  diagrams.  The 
resonance  interaction  is  S. 
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Here  e  is  a  unique  variational  parameter  of  the  CF 
orbitals.  If  e  =  0  the  orbitals  of  the  reactant,  Eq. 
(10)  and  the  product,  Eq.  (11)  are  the  same,  and  the 
configuration  function  of  Eqs.  (8)  and  (9)  is  re¬ 
duced  to  the  HL  configuration  of  reactant  (4)  and 
product  (7),  respectively.  In  the  case  of  e  0,  the 
inclusion  of  orbitals  [Eq.  (10)]  in  the  reactant  con¬ 
figuration  function  only  increases  the  ionic  contri¬ 
bution  of  the  reactants,  (5)  and  (6),  in  the  form 

=  (1  -  +  e(a\b)(.5  +  6).  (12) 


FIGURE  4.  Classical  VB  resonance  structures  for  three 
atoms /three  orbitals. 

/,  G,  and  B  for  an  ample  spectrum  of  reactions,  are 
scarse  in  the  literature. 

We  now  discuss  how  to  construct  the  ACD  for 
reactions  of  Figure  1.  Consider  the  reaction  la.  If 
one  atomic  orbital  per  atom  is  used  {a,  b,  c),  then 
eight  classical  resonance  structures  are  obtained 
(Fig.  4).  Three  of  these  structures  correspond  to 
reactant  (4-6),  three  to  product  (7-9),  two 
participate  in  the  resonance  at  intermediate  points 
in  the  reaction  coordinate  (lO-ll).  These  eight 
structures  can  be  included  in  the  ACD  in  a  suc¬ 
cinct  form  using  CF  hybrid  orbitals  described  in 
the  previous  section.  Using  the  Heitler-London 
spin  coupling  of  the  reactant  (4)  and  the  product 
(7),  the  wave  function  for  the  diabatic  configuration 
may  be  constructed  as: 

=  |Xi^X«X«|-|Xj^X«X«|,  (8) 

for  reactant,  and 

cPp  =  I  x;xpx;i  - 1  x;x;x;i,  (9) 

for  the  product.  The  electron  of  the  reactant  VB 
configuration  function  occupy  the  CF  orbitals 

Xf  =  fl  +  e{a\b)b, 

=  b  +  €(a\b}a, 

=  c,  (10) 

and  similar  orbitals  for  the  product  VB  configura¬ 
tion  function; 

x;  = 

X,f  =  b  -h  €(b\c)c, 

X,^  =  c  + e{b\c)b.  (11) 


The  CF  orbitals  of  Eq.  (11)  have  a  similar  effect 
on  the  product  configuration  function.  As  can  be 
seen,  the  weight  of  ionic  structures  depend  on  the 
overlap  integrals  between  the  covalent  bond  of  the 
HL  structure  and  the  variational  parameter  e.  The 
intermediate  structures  (10)  and  (11)  of  Figure  4 
can  be  included,  if  the  central  orbital,  both  in  the 
reactant  and  the  product  configuration  function,  is 
replaced  by 

X^  =  X^  =  b  +  ei(a\b}a  +  <()|c>c).  (13) 

This  scheme  is  closely  related  to  the  VB  symbol¬ 
ism  of  Harcourt  [20],  based  on  the  concept  of 
increase-valence  structures.  A  generalization  of  this 
method  is  used  for  calculation  of  the  reactivity 
parameters  in  the  reactions  of  Figure  1. 


Results  and  Discussion 

A  summary  of  the  parametrization  is  presented 
in  Table  I.  The  results  of  dissociation  energy  and 
geometry  optimization  compare  well  with  the  ex¬ 
perimental  values  for  the  H2  (D^  =  109.5  kcal/mol 
and  r^  =  0.746  A)  and  the  Li2  ( =  24.16  kcal/ mol 
and  r^  =  2.67  A).  On  the  other  hand,  our  results 
are  in  moderate  agreement  with  the  full  Cl  calcula¬ 
tion  of  Shaik  and  Hiberty  [21]  for  these  systems: 
H2  (D,  =  128.1  kcal/mol,  =  384.4  kcal/mol 
and  r,  =  0.735  A),  Li2(s)  (D,  =  11.0  kcal/mol, 
AEs2.  =  25.2  kcal/mol  and  r,  =  2.872  A),  and 
Li2(7r)(D^  =  13.4  kcal/mol,  AEjj  =  47.7  kcal/mol 
and  =  2.193  A).  We  think  that  there  are  many 
refinements  to  make  in  the  model,  however,  our 
main  interest  at  this  moment  is  to  explore  whether 
or  not  this  crude  parametrization  works  well. 

The  reactivity  parameters  for  the  reactions  of 
Figure  1  are  shown  in  Table  II  for  the  HL  (e  =  0) 
and  the  CF  orbitals.  We  also  report  the  symmetric 
geometry  distance,  R„  and  two  reactivity  index: 
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TABLE  I _ 

Summary  of  parametrization.^ 


IP 

re 

De 

X2 

HO 

(eV) 

OE 

p' 

(A) 

(kcal  /  mol) 

(kcal  /  mol) 

H2 

CF 

13.6 

1.20 

-1.68 

0.32 

0.75 

109-4 

338.22 

□2(5) 

HL 

13.6 

1.20 

-1.68 

0.32 

0.74 

105.5 

329.23 

CF 

5.4 

0.65 

-0.60 

1.16 

2.88 

35.5 

77.85 

HL 

5.4 

0.65 

-0.60 

1.16 

2.66 

32.9 

76.11 

□  2(71) 

CF 

3.5 

0.65 

-0.60 

1.16 

1.88 

44.3 

100.82 

HL 

3.5 

0.65 

-0.60 

1.16 

1.80 

39.7 

96.15 

®HO,  type  of  hybrid  orbital;  IP,  ionization  potential;  OE,  orbital  exponent.  For  and  see  text,  equilibrium  bond  distance; 
Dg,  dissociation  energy,  AE57-;  singlet -triplet  excitation  energy. 


(i)  a  distortion  index,  [  -  ry rj,  which  is  the 

relative  lengthening  of  the  diatomic  bond  to 
achieve  the  symmetric  structure,  and  (ii)  a  reso¬ 
nance  index,  [  B/AEJ,  which  is  the  relative  impor¬ 
tance  of  the  quantum  mechanical  resonance  energy 
in  the  stability  of  the  symmetric  X„  clusters. 

The  CF  results  for  the  collinear  reaction  la  are 
in  reasonable  agreement  with  the  VB  calculation  of 
Maitre  et  al.  [11]  for  the  case  of  H  (AE^  =  15.3 
kcal/mol,  G  =  156.6  kcal /mol,  /  =  0.37,  and  B  = 
42.4  kcal/mol)  and  Li  (AE"^  --  -3.8  kcal/mol,  G 
=  22.4  kcal/mol,  /  =  0.13,  and  B  =  6.6  kcal/mol). 
To  our  knowledge,  no  quantitative  calculation  of 
the  reactivity  parameters  (G,  /,  and  B)  have  yet 
been  performed  for  the  rest  of  the  reactions  in 
Figure  1.  The  estimated  activation  energy  and  opti¬ 
mized  geometry  for  the  three-bond  exchange  reac¬ 
tion  are  in  general  higher  than  the  ab  initio  value 
of  Shaik  and  Hiberty  ^[21]:  (AE^  =  109.3 

kcal/mol,  =  0.9886  A),  Li^Cs)  (AE^  =  -h0.04 
kcal/mol,  E,,  ==  3.0717  A)  and  LI/tt)  (AE^  - 
+  27.4  kcal/mol,  R^,  =  2.7718  A).  Comparison  with 
some  ab  initio  calculations  for  the  two-bond  ex¬ 
change  reaction  also  shows  higher  values  for  the 
activation  energy  and  equilibrium  geometry  [22]. 
Considering  the  smallness  of  the  atomic  orbital 
(AO)  basis  set  used  and  the  neglect  of  the  three- 
and  four-center  two-electron  integrals,  it  is  not 
surprising  that  these  estimates  differ  considerably 
from  the  ab  initio  results. 

Two  clear  correlations  emerge  from  Table  II. 
The  first  is  the  strong  relationship  between  the 
energy  gap  G,  the  energy  of  the  crossing  point 
AE^,,  and  the  activation  energy  AE^.  This  result 
shows  that  the  gap  G  is  the  factor  that  governs  the 
stability  of  the  symmetric  structures;  this  relation¬ 
ship  has  already  been  derived  within  VB  approach 


by  Maitre  et  al.  [11]  and  Shaik  et  al.  [12].  The 
second  is  the  relationship  between  the  curvature 
factor  /  and  the  geometric  index  Ar/r.  Finally,  no 
clear  correlation  with  the  resonance  index  is  found, 
although  it  is  apparent  that  the  quantum  mechanical 
resonance  energy  is  underestimated  in  this  calcula¬ 
tion. 

The  results  given  in  Table  II  show  that  G  is 
proportional  to  the  number  of  bonds  broken.  Then 
for  a  given  atom,  X,  the  value  of  the  gap  increase 
in  the  order,  >  X3.  This  is  the  same  order 

for  A  E^  and  A  E* ,  Also,  the  energy  gap  increases 
in  the  order:  H  >  LKtt)  >  Li(s).  As  is  shown  in 
Table  I,  this  is  the  same  order  for  the  dissociation 
energy  and  the  singlet- triplet  excitation  energy  of 
the  X2  bond.  This  result  agrees  with  the  analysis  of 
Maitre  et  al.  [11];  they  have  found  that  G  follows 
the  relation 

G  =  (|)HA£sr,  (14) 

where  n  is  the  number  of  bonds  broken  in  the 
process.  The  previous  equation  works  relatively 
well  both  for  HL  and  CF  orbitals.  In  general,  the 
deviation  of  this  equation  is  greater  for  H  than  for 
Li  due  to  the  neglect  of  the  AO  overlap  in  the 
derivation  of  Eq.  (14). 

Table  II  shows  a  strong  correlation  between  the 
curvature  factor  /  and  the  geometric  index.  These 
factors  increase  in  the  order  X4  >  X^  >  X3.  The 
value  of  /  reveals  a  parabolic  curvature  of  the 
diabatic  curves  for  the  case  of  X3  (/  <  0.5),  semi- 
linear  for  X^  (/  0.5),  and  exponential  for  X4 

(/  >  0.5).  The  tendency  of  the  symmetric  X4  clus¬ 
ter  for  higher  geometric  distortion  is  well  docu¬ 
mented  in  many  four-member  rings.  Therefore,  no 
trivial  correlation  exists  between  the  geometric 
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ABSTRACT  _ _ _ _ 

In  carrying  out  our  plan  for  doing  multic6nt6r  molecular  integrals  over  Slater- type 
orbitals,  it  is  necessary  to  evaluate  the  Lowdin  a-f unction  over  a  grid  from  the  origin  of 
the  coordinate  system  to  the  displacement  distance  of  the  center  of  the  orbital.  A 
previous  article  obtained  excellent  results  by  expanding  the  exponentials  in  the  a-function, 
for  both  interior  and  exterior  regions.  However,  if  the  displacement  distance  multiplied 
by  the  screening  constant,  i.e.,  the  parameter,  is  larger  than  16,  we  suggest  that  it  may 
be  more  efficient  in  time  and  storage  if  we  use  the  closed  formula  for  the  a-function  for 
values  of  the  radial  distance  r  greater  than  8.  This  remarkable  rule  of  thumb  was  tested 
for  a  variety  of  orbitals  up  to  =  64  and  one  to  =  128.  Also,  in  the  exterior  region, 
the  formula  may  always  be  used  if  >  16.  This  strategy  necessitates  using  the  formula 
in  quadruple  precision  arithmetic.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

We  continue  to  fill  in  the  details  that  will 
make  the  Lowdin  a-function  method  capa¬ 
ble  of  dealing  with  all  the  kinds  of  Slater-type 
orbitals  (STOs)  needed  for  problems  in  quantum 
chemistry.  In  a  companion  article  [1],  the  case  of 
small  values  for  i^a),  namely,  <  16,  where  ^  is 
the  screening  constant  of  an  orbital  and  a  is  its 
displacement  from  the  origin  of  the  working  coor¬ 
dinate  system,  was  examined.  The  exponentials  in 
the  a-f  unctions  were  expanded  and  the  number  of 
terms  needed  for  acceptable  accuracy  were  noted 
for  various  orbital  parameters.  In  this  article,  to 


evaluate  an  a-function  over  a  grid  when  r  <  a,  we 
show  that  it  is  expedient  and  efficient  when  ^a  > 
16  to  deal  with  the  closed  formula  for  the  a-func¬ 
tion.  Of  course,  in  the  region  of  the  orbital  when 
r  <  a,  we  are  compelled  to  again  use  expanded 
exponentials.  In  Table  I,  we  tabulate  this  crossover 
point  for  a  variety  of  orbitals  and  their  harmonics. 
We  note  that  the  evaluation  of  our  closed  formulas 
always  results  in  a  loss  of  significant  figures  due  to 
cancellation  errors.  Our  aim  was  to  work  with 
numbers  accurate  to  the  16  decimal  digits  (double 
precision  on  our  Silicon  Graphics  Power  Chal¬ 
lenge).  To  achieve  this,  we  started  with  32  digit 
numbers  (quadruple  precision),  and  to  be  conser¬ 
vative,  we  determined  the  value  of  r  when  12 
digits  are  sacrificed.  Below  this  value,  the  a-func- 
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TABLE  I _ _ 

Switch  over  value  of  r  when  the  closed  formula  Is  valid;  r  <  a;  (^a)  =  16. 


Orbitals  /  r 


100  0  1 

4  1 

8  2 

12  4 

200  0  1 

4  1 

8  2 

12  5 

210  0  1 

4  1 

8  2 

12  4 

211  1  1 

4  1 

8  2 

12  4 

300  0  1 

4  1 

8  2 

12  5 

310  0  1 

4  1 

8  2 

12  5 

311  1  1 

4  1 

8  2 

12  5 

320  0  1 

4  1 

8  2 

12  5 

321  1  1 

4  1 

8  2 

12  5 

322  1  1 

4  1 

8  2 

12  5 

400  0  1 

4  1 

8  2 

12  6 

410  0  1 

4  1 

8  2 

12  6 


Orbitals  /  r 


411  1  1 

4  1 

8  2 

12  5 

420  0  1 

4  1 

8  2 

12  5 

421  1  1 

4  1 

8  2 

12  6 

422  2  1 

4  1 

8  2 

12  5 

430  0  1 

4  1 

8  2 

12  6 

431  1  1 

4  1 

8  2 

12  6 

432  2  1 

4  1 

8  2 

12  5 

433  3  1 

4  1 

8  2 

12  6 

500  0  1 

4  1 

8  2 

12  7 

510  0  1 

4  1 

8  2 

12  6 

511  1  1 

4  1 

8  2 

12  6 

520  0  1 

4  1 

8  2 

12  7 


Orbitals 


521 


522 


530 


531 


532 


533 


540 


541 


542 


543 


544 


1 

4 

8 

12 

2 

4 

8 

12 

0 

4 

8 

12 

1 

4 

8 

12 

2 

4 

8 

12 

3 

4 
8 

12 

0 

4 

8 

12 

1 

4 

8 

12 

2 

4 

8 

12 

3 

4 
8 

12 

4 

8 

12 


1 

1 

2 

6 

1 

1 

2 

6 

1 

1 

2 

5 

1 

1 

2 

7 

1 

1 

2 

6 

1 

1 

2 

5 

1 

1 

2 

6 

1 

1 

2 

7 

1 

1 

2 

6 

1 

1 

2 

6 

1 

2 

6 
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tion  is  evaluated  by  expansion;  above  this  value, 
the  a-function  is  evaluated  by  a  closed  formula. 


The  a-Function 


A  complex  STO  a"  =  (f>)  in  its 

local  coordinate  system  has  its  origin 

displaced  to  (0,,0,  fl)  in  the  working  coordinate 
system  (r,  0,  (f)).  Its  expansion  in  spherical  har¬ 
monics  is  [2,  3] 


A 


(2L  +  IKL  +  M)! 
47r(L  —  M)! 

4'7r(/  -I-  M)! 

(21  +  DU -MV. 


1/2 


X  ^N-1 

cc 

X  E 

;=M  L 


(-1) 


M 


1/2 


(1) 


Interior  Evaluation  of  the  a-Function 


The  a-function  for  r  <  fl  is  given  by 


af'^^(^fl,^r) 


(21  +  l)(i  -  M)! 

2(1  +  M)! 

istop  jstop 

X  E  E 

i=0  i  =  0 

■(^ay-^-‘-\^ry-’-\  (2) 


with 


and 


and 


H,y  = 


i  stop  =  N  +  L  +  /  —  M 


jstop  =  N  +  /. 

For  efficiency  in  programming,  we  reduce  the 
double  sum  to  a  single  sum  by  the  introduction  of 
one-dimensional  matrices  T;(y)  [4]: 


with 


T,(/)  = 


(2/  +  l)(f-M)! 
2(L-I-M)! 


i  stop 

X  E  c,^^'^(/,/)-(^fl)' 

1  =  0 


(4) 


Although  the  C-matrix  elements  increase  explo¬ 
sively  with  the  harmonic  I  [5],  they  can  be  brought 
under  control  with  sufficiently  large  values  of  r 
and  (  ^fl).  The  price  is  the  loss  of  significant  digits. 
In  our  aim  to  achieve  16-digit  accuracy,  we  start 
with  quadruple  precision  of  32  digits  and,  to  be 
conservative,  find  the  lowest  value  of  r  that  will 
result  in  a  loss  of  only  12  digits  of  accuracy.  These 
switch-over  values  are  tabulated  in  Table  I.  Hence, 
for  r  less  than  this  value,  we  evaluate  the  a-fimc- 
tion  by  the  expansion  method  using  an  E-matrix. 
For  r  greater  than  this  value,  we  may  use  the 
closed  formula  in  quadruple  precision. 

We  note  that  the  expansion  method  is  very 
reliable,  having  been  tested  for  Is  orbitals  (1  =  0) 
to  ^fl  =  128  with  =  238.  However,  for  effi¬ 
ciency  and  to  minimize  storage  requirements  for 
the  E  and  F  matrices,  the  closed  formula  approach 
should  be  considered.  To  avoid  a  profusion  of  "IF" 
statements  in  our  programming  implied  by  Table 
I,  it  was  decided  to  just  have  one  condition:  If 
r  <  8,  use  the  expansion  method,  and  if  r  >  8,  use 
the  formula.  This  is  surprisingly  valid  at  least  up 
to  ^fl  =  64.  In  all  cases  of  <  16,  use  the  expan¬ 
sion  method. 


Exterior  Evaluation  of  the  ot-Function 

The  situation  is  much  clearer  for  the  r  >  a  case 
in  the  evaluation  of  the  a-function.  We  start  with 
the  exterior  definition  of 

Hij  = 

Again,  the  double  sum  is  changed  to  a  single 
sum  by  the  introduction  of  a  one-dimensional  ma¬ 
trix  X,(;): 


aP^^(^a,^r) 

N+l 

=  LT,(j)[(-)ye^^  (3) 

/=o 


X,(;)  = 


(2/  +  l)(/-M)!  .  ,  , 

2(1  +  M)l 


i  stop 

E  cy^^a,]) 


■[(-lYe^^  (5) 
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Then, 

N+! 

^r)  ^  X^i j)  ■  (6) 

7  =  0 

We  recall  the  definition  of  the  Z^{j)  matrix 
obtained  by  the  expansion  of  exp(^"^7)  and 
exp(™^^?): 

/  max 

/ = i Sturt 

Then, 

N+l 

0L,i^n,  L,r)  =  e~^’'  XI  Z,( /)  •  (8) 

/  =  0 

As  an  update,  we  found  by  numerical  experi¬ 
mentation,  that  i  start  =  \l  -  L\  and  that  only 
nonzero  values  occur  in  the  summation  at  every 
other  position  starting  with  i  start,  i.e.,  the  i  index 
starts  at  i  start  and  proceeds  in  steps  of  two. 

The  form  of  the  a-function  is  the  same  whether 
X,(/)  or  Z,(/)  is  used.  We  have  found  that  the 
closed  formula  at  quadruple  precision  is  adequate 
for  all  the  orbitals  in  Table  I  with  ( ^«)  >  16  and  up 
to  at  least  ^a  =  64.  Multicenter  integrals  in  the 
exterior  region  may  be  done  analytically,  since  all 
the  exponentials  have  negative  arguments. 


Conclusion 

We  have  made  it  plausible  that  the  a-function 
can  be  dealt  with  in  its  closed  formula  form  and  in 
its  expanded  form;  the  two  expressions  of  the 
a-function  complement  each  other.  Hence,  we  state 
that  the  a-function  method  can  be  applied  to  all 
orbitals  of  chemical  interest. 
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ABSTRACT  _ _ _ _ 

We  have  already  developed  the  many-electron  wavepackets  (MEWP)  method  in  order  to 
study  the  dynamics  and  electronic  structure  of  molecular  systems.  We  extended  the 
MEWP  method  to  study  the  nonadiabatic  effects  and  formulated  a  nonadiabatic  molecular 
theory,  where  both  electron  and  nucleus  are  treated  equivalently.  Then  we  applied  our 
method  to  the  isotope  series  of  hydrogen  molecule  i.e.,  H2,  HD,  and  D2/  and  calculated 
the  total  energy  and  the  average  distance  between  nucleus -nucleus,  electron- electron, 
and  nucleus— electron  in  order  to  analyze  numerically  the  nonadiabatic  effect  in  the 
molecule.  Finally  we  calculated  the  real-time  evolution  of  the  polarization  by  means  of 
Chebyshev  scheme;  and  by  Fourier  transforming  this,  we  found  out  the  excitation 
spectrum  of  the  system,  which  corresponds  to  the  electronic  excitation  and  the  nuclear 
vibrational  frequency.  ©  1996  John  Wiley  &  Sons,  Inc. 
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Introduction 

The  time-dependent  formulation  is  an  ex¬ 
tremely  important  tool  in  the  field  of  molecu¬ 
lar  dynamics  [1,  2].  We  have  already  developed 
the  many-electron  wavepacket  (MEWP)  method 
and  succeeded  in  describing  the  static  and  dy¬ 
namic  electronic  structure  of  molecules.  In  the 
MEWP  method,  we  calculate  the  imaginary-time 
evolution  of  the  many-electron  wavepacket,  where 
this  time  evolution  is  accurately  evaluated  by  the 
Chebyshev  polynomial  method  [3],  We  appropri¬ 
ately  divide  the  real  space  of  electrons  into  the 
small  grid,  and  so  we  made  up  the  discretized 
Hamiltonian  of  the  molecule.  We  construct  the 
Slater  determinant  as  an  initial  state  of  system  in 
the  imaginary-time  development.  This  relaxation 
method  [4]  directly  generates  the  ground  state, 
and  excited  states  of  the  system  could  be  also 
obtained  by  using  the  projection  operator  method 
after  one  gets  the  ground  state.  We  could  perfectly 
treat  the  electron  correlation  by  this  method,  how¬ 
ever  this  method  is  based  upon  the  Born-Op- 
penheimer  (BO)  approximation  as  well  as  in  the 
ordinary  ab  initio  calculation. 

Recent  progress  of  experiments  using  the  laser 
have  revealed  the  detailed  properties  of  molecular 
state  and  dynamic  process  in  molecules,  so  new 
molecular  theories  beyond  the  BO  approximation 
have  been  tried  from  the  different  point  of  view  [5, 
6],  Thus  we  will  extend  our  MEWP  method  to 
include  nonadiabatic  BO  (NBO)  effects  completely. 
We  regard  a  molecule  as  a  quantum  mechanical 
collective  of  electrons  and  nuclei,  and  we  treat 
electrons  and  nuclei  equivalently  by  extending  the 
MEWP  method  to  include  both  electrons  and  nu¬ 
clei.  Thus  the  initial  molecular  state  for  electrons 
and  nuclei  is  described  as  a  product  of  Slater 
determinant  corresponding  to  electrons  and  a 
wavepacket  of  nuclei,  where  the  explicit  form  of 
the  nuclear  wavepacket  depends  on  the  quantum 
statistics  of  the  nucleus  under  consideration.  We 
would  like  to  call  our  extended  wavepackets 
method  the  molecular  wavepackets  (MWP) 
method. 

In  the  following  section  we  give  the  outline  of 
the  new  molecular  theory.  Here  we  show  how  to 
represent  the  discretized  Hamiltonian  by  expand¬ 
ing  the  field  operator  in  terms  of  basis  set,  which 
discretize  the  real  space,  and  how  to  construct  the 


initial  wave  function.  In  the  third  section,  we  ap¬ 
ply  our  method  to  the  isotope  series  of  one-dimen¬ 
sional  hydrogen  molecules  (H2,  HD,  and  D2).  Here, 
we  calculate  the  energy  and  the  mean  square  root 
of  particle  distance  in  order  to  analyze  the  isotope 
effect  due  to  the  nonadiabaticity,  and  then  evalu¬ 
ate  the  real-time  development  of  polarization  to 
calculate  the  excitation  energy  of  molecular  sys¬ 
tem.  In  the  last  section  we  conclude  with  remarks 
and  future  work,  and  we  address  the  vahdity  of 
our  method. 


Molecular  Wavepacket  Method 

RELAXATION  METHOD 

We,  first,  summarize  briefly  the  relaxation 
method,  which  plays  an  important  role  in  the 
MWP  method.  The  time-dependent  Schrodinger 
equation  and  its  formal  solution  are  given  by 

d 

=  (1) 

at 

|qf(f)>  =  e-''^'|'T(0)>.  (2) 

The  basic  idea  of  the  relaxation  method  [4]  is  as 
follows:  First,  we  introduce  the  imaginary  time 
/3  =  it.  The  component  of  the  excited  state  in  the 
time-developed  state  decreases  faster  than 

the  ground  state  due  to  the  exponential  factor 
fpj.  g  sufficiently  large  /3,  we  get 
only  the  information  of  the  ground  state  as  follow; 

l'P(j8)>  =  e-'3«|T^(0)> 

n  =  0 

e-^£o|ip^><-vir^|qf(0)>,  as  )3  ^  00,  (3) 

where  I'P,,)  means  the  nth  eigenstate  of  Hamilto¬ 
nian  and  1%)  is  the  ground  state.  In  order  to 
evaluate  the  imaginary-time  evolution  of  the  sys¬ 
tem  explicitly,  we  divide  the  imaginary  time  /3 
into  L  infinitesimal  time  slices,  i.e.,  ^  =  Lr; 

l'k(/3)>  =  (c-^«)"|T^(0)>.  (4) 

Therefore  we  consider  the  following  recurrence 
formula  in  the  infinitesimal  evolution: 

|T^(r,)>  =c-^H|'k(T,_i)>,  T,  =  /t.  (5) 
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This  infinitesimal  time  evolution  operator  could  be 
estimated  by  various  methods.  For  time-indepen¬ 
dent  Hamiltonian,  we  adopt  the  Chebyshev 
scheme,  where  this  evolution  operator  is  approxi¬ 
mated  by  the  Chebyshev  polynomial  as 

^max 

e-rH  =  £  (6) 

1  =  0 

The  number  of  term  is  large  enough  to  con¬ 
verge  the  sum;  then  this  method  practically  gives 
most  accurate  results.  Substituting  Eq.  (6)  into  Eq. 
(5)  and  using  the  resolution  of  unity,  we  obtain 

^  (7) 

m 

where 

^max 

C„„,  =  L  <«|A,(H)|m>.  (8) 

1  =  0 

After  time-developing  the  initial  wave  packet  ac¬ 
cording  to  the  above  procedure,  we  could  obtain 
the  ground  state  of  the  system  under  considera¬ 
tion.  On  the  other  hand,  the  excited  state  could  be 
generated  by  the  projection  method.  Namely,  we 
first  remove  the  ground-state  component  from  the 
initial  wavepacket;  then  we  apply  the  relaxation 
method  to  this  wavepacket  and  get  the  excited 
state. 

Finally,  we  note  that  we  calculate  the  time  de¬ 
velopment  of  the  polarization  by  means  of  the 
real-time  evolution,  which  is  treated  in  the  same 
way  as  the  imaginary-time  evolution  method  ex¬ 
cept  for  the  complex  Chebyshev  polynomial.  The 
Fourier  transformation  of  this  polarization  results 
in  the  excitation  spectrum  of  the  system. 

MODEL  HAMILTONIAN 

We  here  construct  the  discretized  Hamiltonian 
according  to  following  procedure.  For  a  system 
composed  of  electrons  and  nuclei,  the  Hamiltonian 
is  given  in  terms  of  the  field  operator  i/'o-fr)  and 
as  follows: 

H  =  +  +  (9) 


where 

H,- 

^  (T(t' 

X  ^  (10) 

|r  -  r  I 

+  lEE/'i’R'i"R'  <,(R)'(':tCR') 

^  aa'  ir 

x|^<...,(R')*.,(R),  ai) 

a  a  I 

(12) 

|r  -  R| 

where  H„,  and  H,„  represent  the  electron, 
nuclear,  and  interaction  Hamiltonian  between  elec¬ 
trons  and  nuclei,  respectively;  m^,  M„,  and  Z„  are 
an  electron  mass,  a  nuclear  mass,  and  an  atomic 
number  of  ath  nucleus,  respectively.  The  electron 
field  operator  with  spin  a  a  space  coordinate  r, 
i/r„(r),  and  the  field  operator  for  a  nucleus  a  with 
nuclear  spin  I  at  space  coordinate  R,  </>aj(R)/  sat¬ 
isfy  the  following  commutation  relations: 

[V^»,^„,(r')],=  [./';(r),e(r')],  =  0,  (13) 
[<^„,(R),  cA:.,.(R')]  ,  =  5„„.S(R  -  R')5„„ 

[  <^„,(R),  <^a'/'(R')]  +  =  [  <^:,(R).  (R')] ,  =  0, 

(14) 

['/'.(r),  <^>I/(R)]  ±  =  ['/'.(r),  <^c,/(R)]±  ^ 
[</'J(r),<^'„/R)]±=  [«Aj(r),<^I,(R)],  =  0,  (15) 

where  the  -I-  sign  refers  to  the  anticommutation 
relation  for  Fermion  and  the  —  sign  means  the 
ordinary  commutation  relation  for  Boson. 

Then  we  expand  the  field  operator  by  the  basis 
set  y^(r)  ( ygfR)),  which  vanishes  everywhere  ex¬ 
cept  for  one  small  grid  generated  by  discretizing 
the  real  space  and  is  therefore  specified  by  the 
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corresponding  lattice  point  q(Q).  For  example,  wavepacket  as 


=  Lxq(RKq,,  (16) 

Q 

where  is  the  armihilation  operators  of 

electron  (nucleus)  at  the  lattice  point  q{Q)  with 
spin  cr(a),  and  satisfy  the  anticommutation  or 
commutation  relation: 

-p  “  ^qq'^o-a' 

[  ^aQ//  ^a'Q7']  +  ^  ^aa'^QQ'^W 

others  =  0.  (17) 

Then  the  discretize(d  Hamiltonian  is  given  by 


(xp\(l)) 


1 


X  exp 


(x  - 
"  20-2 


+  ik^ix  -  Xq) 


.  (20) 


where  p,  Xq,  /Cq,  and  a  is  the  spin  coordinate,  the 
mean  position,  the  mean  momentum,  and  the  dis¬ 
persion  of  the  initial  Gaussian  wavepacket,  respec¬ 
tively.  Then  we  construct  the  Slater  determinat  of 
the  initial  many-electron  wavepacket  as 


1 

(  Xi  P|| 01 ) 

(^A/Pjvl0i) 

\/nT 

(  Xi  Pil0j^  ) 

(  Pn  1  ) 

(21) 


-H  YL'^^qq'^qa^q'a' 

cr  qq' 

T  2  X/  ^iL'^qq'^qa-^q'a'^q’a-'^qu- 

CTd '  qq ' 

+  I]  IZ  H  ^aQQ'KQl^aQ'I 
a  I  QQ' 

^  ^  ^  r^a'Q' r^aQl 

^  aa'  ir  QQ' 

S  ^  ^qocQ^qa-^aQl^aQl^qa-  ^  (18) 

crq  a  IQ 

where  the  differential  operator  of  the  kinetic  part 
is  estimated  by  second-order  differencing.  Then 
introducing  a  lattice  constant  6,  the  matrix  ele¬ 
ment  of  kinetic  part  is  explicitly  given  by 

(19) 

The  atomic  unit  h  =  =  e  =  I  is  used  through¬ 

out  this  article. 

IMTIAL  MOLECULAR  M^VE  FUACTIOrV 

We  construct  an  initial  many-particle  wave 
function  in  terms  of  single-particle  electron  and 
nuclear  wavepackets.  The  single-particle  state  |</>> 
of  the  initial  state  is  represented  by  the  Gaussian 


where  N  is  the  number  of  electrons.  We  could 
construct  the  wavepacket  for  identical  Fermion 
nuclei  in  the  same  way  as  for  electrons.  For  identi¬ 
cal  Boson  nuclei,  we  use  the  permanent  instead  of 
the  determinant.  Thus  total  initial  wave  function  is 
written  as  the  product  form  of  the  electron  and 
nuclear  part: 

(  p^ , . . . ,  Xjy  p^ ,  1 0;, )  (  Xj  f  I , . . . ,  ,  1 ) . 

(22) 

If  a  molecule  consists  of  nonidentical  nuclei,  then 
the  nuclear  wave  function  is  factorized  into  parts 
of  the  corresponding  nuclear  species. 


Numerical  Examples 

We  have  applied  the  MWP  method  to  the  iso¬ 
tope  series  of  one-dimensional  hydrogen  molecule 
for  simplicity.  We  discretize  one-dimensional 
molecular  space  by  dividing  its  range  into  N 
segments,  and  the  hydrogen  molecule  is  a  four- 
particle  system  of  two  electrons  and  two  nuclei. 
Thus  this  one-dimensional  model  Hamiltonian  is 
represented  by  a  symmetric  matrix  of  iV^  dimen¬ 
sion.  The  wave  function  for  the  hydrogen  molecule 
is  explicitly  given  as 

<XiPi,  X2P2;  XjfjI'P).  (23) 

In  the  one-dimensional  systems,  Javanainen, 
Eberly,  and  Su  [7]  showed  that  the  Coulomb  po¬ 
tential  is  approximated  by  the  softened  Coulombic 
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form; 


where  Javanainen  et  al.  set  a  =  1.  This  approxi¬ 
mate  potential  retains  the  asymptotic  form  of  the 
Coulomb  potential;  moreover  it  reduces  the  singu¬ 
larity  at  the  origin.  Thus  we  also  adopt  this  soft¬ 
ened  Coulombic  potential  in  the  practical  calcula¬ 
tion. 

ISOTOPE  EFFECTS  OF  H2,  HD,  AND  D^ 

When  we  construct  the  initial  molecular 
wavepacket  of  these  isotope  molecules,  we  have  to 
pay  attention  on  the  statistics  of  nucleus.  Namely, 
the  nuclear  part  of  H2  is  given  by  the  Slater 
determinant,  and  we  use  the  permanent  for  Dj  as 
the  nucleus  D  is  Boson.  On  the  other  hand,  we 
have  to  treat  both  H  and  D  independently  for  the 
HD  molecule.  Moreover,  for  Hj  molecule,  we 
could  take  into  account  the  total  nuclear  spin  an¬ 
gular  momentum,  i.e.,  we  treat  separately  the  or¬ 
tho-  and  para-hydrogen  molecule.  Therefore,  we 
could  analyze  the  effect  of  the  quantum  statistics 
on  the  physical  quantity  in  detail. 

Table  I  summarizes  various  parameters  of  the 
initial  nuclear-electron  wavepackets  adapted  in 
this  calculation.  We  assume  that  the  range  of 
molecular  space  is  L,,  =  21  a.u.,  and  we  divide 
into  N  =  21  grids.  Therefore,  the  grid  interval  dx 
is  1.0  a.u..  For  the  initial  single-particle  Gaussian 
wavepackets  of  nucleus,  we  set  x■^  =  9,  =  0.0, 


o-j  =  1.5  for  up  spin  nuclei,  Xj  =  13,  ^2  = 

<T2  =  1.5  for  downspin.  While  for  those  of  electron 
wavepackets,  we  choose  Xj  =  9,  fcj  =  0.0,  ctj  =  2.1 
for  up  spin,  and  X2  —  13,  ^2  ~  (72  =  2.1  for 

down  spin.  Namely,  the  initial  Gaussian 
wavepacket  of  the  electrons  and  nuclei  with  up 
(down)  spin  are  put  in  the  same  position,  but  with 
the  different  width.  The  width  for  the  electron  is 
taken  to  be  larger  than  the  one  for  the  nuclei.  The 
result  does  not  depend  on  the  initial  value,  which 
has  an  effect  just  on  the  computational  time.  The 
initial  probability  density  is  shown  in  Figure  1.  We 
also  set  the  potential  parameter  a  =  1.0  [7]. 

Total  energy  by  MWP  method  is  shown  in  Table 
II.  We  cannot  definitely  find  out  the  energy  differ¬ 
ence  in  the  singlet  ground  state  of  ortho-  and 
para-H2,  so  that  the  nuclear  spin  yields  almost  no 
contribution  to  the  total  energy.  Table  III  shows 
that  average  nuclear  distance  between  protons  in 
para-Hj  is  rather  larger  than  one  in  ortho-H2,  and 
the  difference  of  average  distance  between  elec¬ 
trons  in  para-  and  ortho-H2  is  a  little  bit  amplified 
due  to  the  mobility  of  electron. 

The  order  of  the  total  energy  increases  like 
D2  <  HD  <  H2,  which  is  good  agreement  with  the 
Kolos  et  al.  [8].  This  indicates  that  the  heavy  nu¬ 
cleus  moves  slowly  compared  to  the  light  nucleus, 
in  other  words,  the  amplitude  of  the  light  nucleus 
is  larger  than  that  of  the  heavy  nucleus. 

The  average  distance  between  nuclei  in  this 
isotope  series  increases  like  H2  <  HD  <  D2,  and 
the  same  tendency  is  also  shown  in  the  average 
distance  between  electrons.  The  order  of  average 


TABLE  I  _ _ 

Parameters  of  initial  single-particle  wavepacket. 


Le 

dx 

Electron  wave  function 

Nucleus  wave  function 

(1)  BO-H2  (MEWP  method) 

21 

1.0 

X 

0 

11 

CD 

X 

0 

ro 

11 

CO 

0 

li 

0 

X 

Xo2  =  12 

0 

1! 

0 

b 

x* 

0 

ro 

II 

0 

b 

=  2.1 

0-2  =  2.1 

(spin) 

Up 

Down 

(2)  NonBO  singlet  ortho-Ha  (MWP  method) 

21 

1.0 

X01  =  9 

CO 

11 

CM 

0 

X 

X01  =  10 

CM 

II 

CM 

0 

X 

0 

d 

II 

0 

kQ2  =  0-0 

0 

b 

II 

0 

^02  ~ 

0-1  =  2.1 

(T2  =  2.1 

0-1  =  2.1 

CTg  =  2.1 

(spin) 

Up 

Down 

Up 

Down 

(mass) 

1 

1836 
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Inidal  Density  (o-H^)  Final  Density  (o-H^ 


FIGURE  1.  Probability  density  of  electrons  and  nuclei  of  ortho-  and  para-Hg  for  the  initial  wavepacket  and  the 
calculated  ground  state:  (a)  singlet  state  and  (b)  triplet  state. 


TABLE  II _ 

Total  energy  of  Hj,  HD,  and  Dj  (a.u.). 


Nucleus: 

Ortho 

Para 

Electron: 

Singlet 

Triplet 

Singlet 

Triplet 

(1)  Hj 

-1.4588412068 

-1.3644142801 

-1.4588412043 

-1.3644129998 

Electron: 

Singlet 

Triplet 

(2)  HD 

-1.4591075654 

- 1 .36449593229 

(3)  Dj 

-1.4593753100 

-1.3645815086 
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Initial  Triplet  Density  (o-Hp 


Initial  Triplet  Density  (p-H2) 


Final  Triplet  Density  (0'H2) 


Final  Triplet  Density  (p-H^) 


FIGURE  1. 

distance  is  reversed  compared  to  the  order  of  the 
total  energy. 

REAL-TIME  EVOLUTION  OF  THE 
POLARIZATION 

Finally  we  calculate  the  real-time  evolution  of 
the  polarization: 

Pit)  =  <^(OI(”X  +  X)l^(0>.  (25) 

The  results  for  BO-H2  and  NBO-ortho-H2  are  il¬ 
lustrated  in  Figure  2.  By  Fourier  transforming  the 
polarization,  we  obtain  the  excitation  spectra  as 
shown  in  Figure  3.  For  the  BO-H2  molecule,  we 
have  one  sharp  peak  at  16496  cm“^  which  corre¬ 
sponds  to  the  electronic  excitation  to  the  first  ex¬ 
cited  state.  On  the  other  hand,  for  the  NBO-ortho- 
H2  molecule,  we  have  two  sharp  peaks  at  4713 
and  16496  cm”k  In  the  nonadiabatic  treatment,  in 


(Continued) 

addition  to  the  electronic  excitation,  we  could  ob¬ 
tained  the  nuclear  vibrational  frequency  of  4713 
cm“h 


Concluding  Remarks  and  Future  Work 

In  this  study,  we  have  formulated  the  nonadia¬ 
batic  molecular  theory  and  analyzed  the  nonadia¬ 
batic  effects  and  the  nuclear  quantum  statistics 
effect  in  the  isotope  series  of  hydrogen  molecule 
and  the  nuclear  spin  multiplicity  effect  of  the  para- 
and  ortho-H2  molecule. 

In  the  imaginary-time  development,  we  ob¬ 
tained  the  total  energy  and  the  average  distance 
between  electrons,  nuclei,  and  electron-nucleus  of 
ortho-H2,  para-H2,  HD,  and  D2.  We  also  calcu¬ 
lated  the  real-time  evolution  of  the  polarization  in 
BO-H2  and  NBO-ortho-H2,  and  by  the  Fourier 
transform  we  got  the  excitation  spectrum  of  the 
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Time  (a.u.) 


Time  (a.u.) 


(a) 


Times  (a.u  ) 


Time  (a.u.) 


Time  (a.u.) 


(b) 


FIGURE  2.  Time  evolution  of  polarization  from  0  to  4000  a.u,:  (a)  BO-H2  and  (b)  NBO-H2. 


molecular  system.  BO-H2  has  only  the  electronic 
excitation;  on  the  other  hand  NBO-ortho-H2  has 
two  peaks  corresponding  to  the  electron  excitation 
and  the  nuclear  vibrational  frequency.  From  these 
results  we  are  sure  that  our  method  is  truely  a 
nonadiabatic  theory,  which  directly  includes  the 
nuclear  motion. 

Because  the  MWP  method  is  an  extension  of  the 
MEWP  method,  the  MWP  method  has  the  same 
merits  as  the  MEWP  method;  that  is,  the  MWP 
method  could  include  the  dynamical  correlation  of 
the  electron-electron,  nucleus-nucleus,  and  nu¬ 
cleus-electron.  The  relaxation  technique  due  to  the 
imaginary-time  development  yields  the  ground 
and  excited  states.  In  order  to  reduce  the  computa¬ 


tional  resources,  we  could  introduce  some  approx¬ 
imations  into  our  method.  The  mean-field  approxi¬ 
mation,  for  example,  will  drastically  reduce  a  CPU 
time  and  core  memories,  which  are  indispensable 
for  molecular  calculations.  The  method  could  be 
applied  to  a  larger  system  by  means  of  the  parallel 
processing  computer. 

There  are  characteristic  features  in  the  MWP 
method,  which  originate  from  classical  and  quan- 
tal  properties  of  nuclei.  Our  method  fully  takes 
into  account  the  quantum  effect  of  the  nuclear 
motion  and  is  beyond  the  BO  approximation.  The 
method,  of  course,  optimizes  automatically  the 
molecular  geometry  and  is  applicable  to  the  vibra¬ 
tional  and  rotational  motion  of  molecule.  In  these 
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TABLE  III 


Mean  square  root  of  particle  distances  (a.u.)- 


Electron  state: 

Singlet 

Triplet 

(1)  Ortho-H2 

e-e: 

2.1897892672 

5.8619611057 

e-H: 

1 .6689229382 

4.1324366305 

H-H: 

1.9996299013 

5.6360858657 

(2)  para-H2 

e-e: 

2.1897893589 

5.9620001840 

e-H: 

1.6689230174 

4.1324636646 

H-H: 

1 .9996300363 

5.6361244806 

(3)  HD 

e-e 

2.1898848722 

5.8457056682 

e-H: 

1.6690108471 

4.1208203462 

e-D: 

1 .6690070541 

4.1207963036 

H-D: 

1 .9997793058 

5.6188735236 

(4)  D2 

e-e: 

2.1899598503 

5.8291993119 

e-D 

1 .6690763368 

4.1089969332 

D-D: 

1.9998963145 

5.6013881927 

BOH2 


(a) 


NBO  0-H2 


FIGURE  3.  Excitation  spectra  of  (a)  BO-H2  and  (b) 
NBO-ortho-H2  obtained  by  Fourier  transform  of  the 
polarization  in  Figure  2. 


cases,  the  isotope  effect  due  to  a  nuclear  mass  on 
the  electronic  structure  is  naturally  described. 
Moreover,  in  our  method,  electorns  and  nuclei  are 
treated  at  the  same  level.  Therefore,  our  method 
could  visualize  chemical  reactions  and  could  iden¬ 
tify  the  reaction  path  and  the  activated  complex. 
As  our  method  can  easily  deal  with  the  excited 
state,  it  is  applicable  to  the  relaxation  process  in 
the  excited  molecule,  the  radiationless  transition, 
the  photo-  and  thermal  dissociation,  the  inter- 
molecular  excitation,  and  so  on.  Therefore,  our 
method  could  simulate  in  principle  all  the  chemi¬ 
cal  processes;  the  simulation  chemistry  may  be 
established  in  this  framework. 

Because  our  method  fully  takes  the  nuclear  spin 
into  consideration,  we  could  discuss  the  isotope 
effect  due  to  the  nuclear  spin  and  could  calculate 
physical  constants  appearing  in  the  magnetic  reso¬ 
nance,  such  as  the  hyperfine  constant,  the  chemical 
shift,  the  spin-spin  coupling  constant,  and  so  on, 
from  the  first  principle. 
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ABSTRACT _ _ 

A  practical  procedure  (FUERZA)  to  obtain  internal  force  constants  from  Cartesian  second 
derivatives  (Hessians)  is  presented  and  discussed.  It  allows  a  systematic  analysis  of  pair 
atomic  interactions  in  a  molecular  system,  and  it  is  fully  invariant  to  the  choice  of 
internal  coordinates  of  the  molecule.  Force  constants  for  bonds  or  for  any  pair  of  atoms  in 
general  are  defined  by  means  of  the  eigenanalysis  of  their  pair  interaction  matrix.  Force 
constants  for  the  angles  are  obtained  from  their  corresponding  two-pair  interaction 
matrices  of  the  two  bonds  or  distances  forming  the  angle,  and  the  dihedral  force 
constants  are  similarly  obtained  using  their  corresponding  three-pair  interaction  matrices. 
©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

Classical  molecular  dynamics  methods  that 
analyze  the  time  evolution  of  macroscopic 
systems  for  several  thousands  of  time  steps  or 
conformations  have  become  very  effective  tools  for 
gaining  insight  into  a  variety  of  macroscopic  sys¬ 
tems  and  processes  [1,  2].  Most  of  this  work  has 
involved  the  use  of  empirical  force  fields  obtained 
from  experimental  measurements  of  geometries, 
heats  of  formation,  vibrational  frequencies,  and 
barrier  heights.  However,  with  the  increasing  ac¬ 
curacy  and  availability  of  first-principles  methods 
(standard  ab  initio  and  density  functional  theory), 
there  is  a  growing  tendency  to  obtain  the  force 
field  data  from  high-level  computations  on  the 


basic  molecular  units  [3-7].  This  makes  it  possible 
to  treat  macroscopic  systems  that  have  not  yet 
been  investigated  experimentally  or  that  may  not 
even  exist  at  present. 

In  early  molecular  dynamics  (MD)  simulations, 
there  was  only  limited  use  of  intramolecular  force 
fields,  primarily  because  the  time  step  could  be 
increased  more  than  twofold  when  molecules  were 
constrained  to  having  fixed  bond  lengths  and 
sometimes  angles  (the  SHAKE  method  [8]).  How¬ 
ever,  for  studying  the  effects  of  external  factors, 
such  as  temperature  and  pressure,  upon  molecular 
properties  of  a  fluid  [9-11],  e.g.,  shifts  in  vibra¬ 
tional  spectra  [12],  the  force  field  clearly  must 
include  an  intramolecular  component. 

If  the  intramolecular  portion  of  the  force  field  is 
expressed  in  terms  of  force  constants  correspond¬ 
ing  to  molecular  internal  coordinates  (bond  lengths. 
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bond  angles,  and  dihedral  angles),  then  a  signifi¬ 
cant  problem  that  arises  is  that  of  assuring  invari¬ 
ance  of  the  individual  internal  force  constants  with 
respect  to  the  chosen  set  of  internals.  A  given 
molecule  can  be  described  by  various  sets  of  inter¬ 
nal  coordinates,  and  these  may  give  different  val¬ 
ues  for  a  particular  internal  coordinate  force 
constant.  Thus  the  internal  force  constants,  and 
therefore  their  intramolecular  force  field,  are  not 
invariant  to  the  choice  of  internal  coordinates.  This 
is  due  to  fact  that  physical  force  constants  are 
tensors  of  rank  2,  but  we  want  to  use  them  in 
practical  MD  simulations  as  scalars  (tensors  of 
rank  0).  An  example  of  this  problem  can  be  visual¬ 
ize  with  the  cyclic  chon  molecule: 


H 


C 

N— O 

A  quick  HF/STO-3G  calculation  yields  a  force 
constant  for  the  CO  bond  of  0.54  a.u.  if  the  chosen 
internals  are  the  CO  bond,  CN  bond,  and  NCO 
angle;  0.57  a.u.  if  we  choose  the  CO  bond,  NO 
bond,  and  CON  angle;  and  0.26  a.u.  if  we  choose 
redundant  internal  coordinates  [13]  CO,  NO,  CN, 


CNO,  NOC,  and  OCN  (for  the  sake  of  clarity,  the 
coordinates  containing  the  hydrogen  atom  are  not 
mentioned  in  this  example).  The  choice  of  a  value 
of  the  force  constant  to  be  used  in  MD  calculations 
is  therefore  ambiguous.  In  this  study,  we  present  a 
procedure  to  compute,  in  an  invariant  fashion,  the 
force  constants  corresponding  to  internal  coordi¬ 
nates. 


Background 

The  3N  component  reaction  force  8  F  due  to  a 
displacement  5x  of  the  N  atoms  in  a  molecular 
system  can  be  expressed  exactly  to  second  order 
on  a  Taylor  series  expansion  as 


81  =  -[k]8x, 


(1) 


where  [/c],  the  Hessian,  is  a  tensor  of  rank  2  and 
dimension  3N  X  3N  defined  by 


[k]  =  = 


dXi  dXj 

Explicitly,  Eq.  (1)  can  be  written  as 


(2) 


8F, 

8F, 

8F, 

d^E 

d^E 

d^E 

dxl 

d^E 

dX2 

d^E 

CO 

^  I 

dXi  dx^f^ 

d^E 

8X2  dX-^ 

d'^E 

dx\ 

d^E 

8X2  8X2, 

d^E 

8X2  dx^fj 

d^E 

dx^  dx^ 

8x^  8X2 

dxj 

8X3  8x-^j^ 

d^E 

d^E 

d^E 

d^E 

8 X3N  8 X-^ 

8X3^  8X2 

<5x3^  dx^ 

8X3N 

8xj 

8x2 

8x0 


8x 


3N 


(3) 


Tensor  [k]  can  be  obtained  by  procedures  avail¬ 
able  in  density  functional  theory  and  ab  initio 
programs  such  as  deMon  [14,  15]  and  Gaussian  94 
[16]. 

The  tensor  [k]  represents  the  intramolecular 
force  field  to  second  order  for  small  displacements 
5x.  The  eigenvalues  Ay  of  [k]  are  the  3N  force 


constants  corresponding  to  the  3  translational,  3 
rotational  and  3N-6  vibrational  modes  of  the  sys¬ 
tem.  The  eigenvectors  of  the  Hessian  [k]  indi¬ 
cate  the  directions  of  the  displacements  of  the 
normal  modes  corresponding  to  each  eigenvalue. 
If  a  displacement  of  magnitude  8r  occurs  in  the 
direction  of  one  of  these  eigenvectors,  the  reaction 
force  acts  exclusively  parallel  or  antiparallel  to  that 
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direction: 

F^- =  —[k]i>j8r  =  —X^j>i8r.  (4) 

Displacements  in  any  other  direction  result,  in 
most  cases,  in  a  reaction  force  that  is  not  in  the 
direction  of  the  displacement  vector. 

The  use  of  the  symmetric  tensor  [fc]  in  a  molec¬ 
ular  simulation  could  be  cumbersome  because  of 
the  large  number  of  independent  elements,  3N(3N 
+  l)/2.  Many  of  these  may  be  relatively  small  in 
magnitude,  as  is  shown  in  Figure  1  for  the  case  of 
nitromethane.  A  more  practical  approach  is  to  re- 
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FIGURE1.  Hessian  matrix  (kcal/A^  mol) 


late  the  force  field  to  internal  coordinates,  which 
can  generally  reduce  the  number  of  terms  to  as 
few  as  3N-6.  The  potential  energy  function  is  then 
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for  the  staggered  conformation  of  CH3NO2. 
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The  pair  interaction  matrix  that  gives  the  reaction  force  on  the  atom  N  due  to  a  displacement  of  atom  C  is 
given  by 
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in  which  b,  0,  (f),  and  co  correspond  to  the  bond 
lengths,  bond  angles,  dihedral  angles,  and  im¬ 
proper  dihedrals  (out-of-plane);  6^,  cf)^,  and 
are  their  equilibrium  values;  and  and 

are  the  respective  force  constants  (the  indexes  for 
bonds,  angles,  dihedrals,  and  impropers  are  not 
shown).  We  will  now  show  how  to  obtain  an 
invariant  force  constant  corresponding  to  any  in¬ 
ternal  coordinate.  The  same  procedure  can  be  ap¬ 
plied  for  force  fields  that  contain  cross  terms  such 
as 


-  bjie  -  ej  (6) 


bonds  &  angles 


Method 


BOND  FORCE  CONSTANTS 


The  essential  concept  is  to  analyze  the  interac¬ 
tions  between  all  pairs  of  atoms  in  the  molecule 
and  to  determine  which  ones  are  pairwise  stable, 
the  criterion  for  which  shall  be  explained.  From 
Eq.  (3),  we  can  get  the  reaction  force  = 
SF.J  on  atom  A  due  to  a  displacement 
Stg  =  (Sxg,  Si/q,  Szg)  of  atom  B: 


d^E 

d^E 

dX^  SXg 

dXA  dz^ 

d^E 

d-E 

^Va  ^Vb 

dVA  '’Zb 

SF- 

-.1 

d^E 

d^E 

d^E 

Sza  dx^ 

d-^A  SVb 

dz^  BZb 

3xb 


X 


^l/s 


(7) 


8zb 

or  in  compact  notation, 

^  (8) 


The  interatomic  force  constant  matrix  [/c^g],  de¬ 
fined  including  the  minus  sign,  permits  an  analy¬ 
sis  of  the  nature  of  the  interaction  between  atoms 


A  and  B.  It  has  three  eigenvalues  and  three 
eigenvectors  (i  =  1,2,3),  which  we  assume  to 
have  unitary  norm.  An  example  of  one  interatomic 
force  constant  matrix  of  nitromethane  molecule  is 
also  shown  in  Figure  1. 

The  physical  interpretation  of  Eqs.  (7)  and  (8)  is 
as  follows:  If  a  particular  eigenvalue  Af®  of  [^abJ 
is  positive,  this  means  that  the  reaction  force  on  A 
due  to  a  displacement  of  B  in  the  direction  of 
is  in  the  same  direction.  If  all  three  eigenvalues  are 
positive,  then  for  any  displacement  of  B  there  is 
always  a  restoring  reaction  force  on  A  that  seeks  to 
maintain  the  original  interatomic  separation.  The 
atoms  A  and  B  are  then  described  as  pairwise 
stable.  All  pairs  of  formally  bonded  atoms  belong 
to  this  category,  as  well  as  some  between  which 
there  is  no  formal  bond  but  such  a  strong  interac¬ 
tion  that  the  atoms  can  be  viewed  as  bonded.  If 
one  or  more  eigenvalues  are  negative,  then  A  will 
not  feel  a  restoring  reaction  force  trying  to  main¬ 
tain  unchanged  the  AB  separation.  This  condition 
is  described  as  a  pairwise  unstable  interaction,  and 
we  conclude  that  A  and  B  are  not  bonded.  The 
same  conclusion  is  reached  when  two  of  the  three 
eigenvalues  or  eigenvectors  are  complex. 

If  one  of  the  eigenvectors,  is  in  the  direc¬ 
tion  from  A  to  B,  which  shall  be  described  by  the 
unit  vector  and  forms  and  orthonormal  sys¬ 
tem  with  the  other  two  eigenvectors,  then  the  force 
constant  for  the  interaction  between  A  and  B, 
whether  they  are  bonded  or  not,  is, 

U  =  Af .  (9) 

When  the  direction  from  A  to  B  does  not  coincide 
with  that  of  any  of  the  eigenvectors,  or  when  the 
eigenvectors  are  not  orthogonal  to  each  other,  then 
the  force  constant  have  contributions 

from  more  than  one  eigenvalue  of  [  ]  in  propor¬ 

tion  to  the  projections  of  the  corresponding  eigen¬ 
vectors  upon  the  unit  vector 

^AB  =  L  (10) 

/  =  ! 

Equation  (9)  is  clearly  a  special  case  of  Eq.  (10). 
The  magnitude  of  /Cab/  as  given  by  these  equa¬ 
tions,  is  obtained  directly  from  the  eigenvalues  of 
the  matrix  [/CabI  in  Cartesian  coordinates,  and 
hence  is  invariant  to  the  choice  of  internal  coordi¬ 
nates.  This  invariance  is  a  fundamental  require¬ 
ment  for  any  physical  quantity. 
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BOND  ANGLE  FORCE  CONSTANTS 


Consider  the  bonds  AB  and  CB  which  have  unit 
vectors  and  Their  interatomic  matrices 
[/c^b]  and  [^cb]  have  eigenvalues  Af®  and  Af®, 
and  eigenvectors  i’,'^®  and  v,^®  (i  =  1,2,3).  Let 
be  a  unit  vector  perpendicular  to  the  plane  ABC: 


u 


N  ~ 


M^®  X  w'^® 

X  M^®1  ■ 


(11) 


Then  the  imit  vectors  perpendicular  to  the  bonds 
AB  and  CB  on  the  plane  ABC  are 


and 


'.PA  _ 


u^Xu 


AB 


-PC  ^  gCB  X 


(12) 

(13) 


They  are  the  direction  of  small  displacements  of 
atoms  A  and  C  that  result  from  opening  or  closing 
the  angle  ABC.  If  and  Rcb  are  the  length  of 
the  bonds,  the  bond  angle  force  constant  is  then 
given  by 

1  1  1 

=  - 3 -  +  3  . 

^  RIb  E  Af®l(!®^  •  RIb  L  •  ^^"1 
(=1  !=1 

(14) 


DIHEDRAL  ANGLE  FORCE  CONSTANTS 


Consider  the  dihedral  angle  defined  by  the 
atoms  A,  B,  C,  and  D  which  are  linked  by  bonds 
AB,  BC,  and  CD.  The  approach  is  similar  to  that 
used  for  bond  angles.  Unit  vectors  perpendicular 
to  the  planes  ABC  and  BCD  are  given  by 


il^®  X  H^® 

“Nabc  =  I^CB  X  /}AB| 

X  M®C 

«N,co  =  1/jDC  X 

The  dihedral  angle  force  constant  is  then 
1 _ 1 _ 

F  "" 


AB| 


(15) 

(16) 


1=1 


+  ■ 


(17) 


-^fi^bc  ^  ^cdI  x) 
1=1 


DC 


Uk]  '  V 

IVbcd  ' 


DC  I 


IMPROPER  DIHEDRAL  (OLT-OF-PLANE) 

ANGLE  FORCE  CONSTANTS 

Let  atoms  B,  C,  and  D  be  cormected  to  the 
central  atom  A.  Then  the  out-of-plane  angle  w^bcd 
is  defined  as  the  angle  between  planes  ABC  and 
BCD.  The  movement  of  A  perpendicular  to  the 
plane  BCD  feels  a  restoring  force  from  the  bonds 
AB,  AC,  and  AD.  The  effective  force  constant  for 
the  movement  of  A  perpendicular  to  BCD  is  given 
by  the  sum  of  the  three  bond  contributions: 

^AN  =  i  Af®i«^  •  f>,^®i  +  i  •  ^,^^1 

i=\  !=1 

+  Z ■  vt^\,  (18) 

i  =  l 

where  is  a  unit  vector  perpendicular  to  the 
plane  BCD.  Then  the  out-of-plane  force  constant 
for  the  angle  Wabcd  's 

“  ^<uabcd  ^  ^ABCD^AN' 

where  /Iabcd  is  the  length  of  projection  into  the 
plane  BCD  of  the  altitude  of  triangle  ABC  (with 
base  BC). 


Applications 

We  have  coded  the  preceding  equations  in  the 
program  FUERZA.  It  is  presently  designed  to  read 
the  Cartesian  Hessian  tensor  from  the  output  of  a 
Gaussian  [16]  program  after  a  frequency  (FREQ)  or 
second  derivative  of  the  energy  has  been  re¬ 
quested.  The  output  format  of  FUERZA  is  compat¬ 
ible  with  the  input  format  of  the  molecular  model¬ 
ing  program  CHARMm  [17].  These  features  can,  of 
course,  be  modified  to  be  consistent  with  other 
programs. 

In  the  applications  that  follow,  we  used  a  proce¬ 
dure  based  on  density  functional  theory  (DFT) 
available  in  the  program  Gaussian  94  [16]  to  opti¬ 
mize  geometries  and  compute  the  second  deriva¬ 
tives  of  the  energy.  The  Becke  88  [18]  and  the 
Lee-Yang-Parr  [19]  functionals  were  used  to  repre¬ 
sent  the  exchange  and  correlation  potentials, 
respectively.  However,  our  method  can  be  imple¬ 
mented  with  any  other  DFT,  ab  initio  or  semiem- 
pirical  technique.  In  the  cases  to  be  discussed, 
FUERZA  used  the  force  constants  in  the  output  of 
a  frequency  DFT  calculation  to  create  the  in¬ 
tramolecular  force  field,  which  was  subsequently 
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used  in  conjunction  with  CHARMm  for  a  classical 
calculation  of  vibrational  frequencies.  The  extent  to 
which  these  reproduce  the  original  DFT  frequen¬ 
cies  is  a  measure  of  the  quality  of  the  constructed 
force  field.  (The  DFT  procedure  being  used  is 
known  to  be  effective  for  vibrational  frequencies 
[20,  21].)  In  the  trivial  case  of  diatomic  molecules, 
exact  agreement  is  obtained,  as  it  must  be,  since 
there  is  only  one  independent  internal  coordinate 
and  hence  one  possible  force  constant. 

In  the  following,  all  bond  lengths  are  in  A, 
angles  in  degrees,  bond  force  constants  in  kcal/A^, 
angular  force  constants  in  kcal/rad^,  frequencies 
in  cm“’,  and  doubly  or  higher  degenerate  vibra¬ 
tional  modes  are  listed  only  once. 

H2O 

The  DFT  values  for  the  bond  length  and  bond 
angle  are  0.976  and  102.7;  the  corresponding  force 
constants  are  1067  and  80.4,  as  compared  to  900 
and  110  in  the  CHARMm  force  field.  All  three 
eigenvalues  of  the  interaction  matrix  [Khh]  ^re 
positive,  indicating  a  stable  HH  interaction  which 
should  be  included  in  the  force  field.  The  equilib¬ 
rium  HH  distance  is  1.249  and  the  force  constant  is 
31.1.  The  vibrational  frequencies  obtained  with  this 
force  field  are  1555,  3674,  and  3679;  the  DFT  fre¬ 
quencies  are  1636,  3643,  and  3754  respectively. 

NOa 

The  DFT  bond  lengths  and  bond  angle  are  1.224 
and  133.1,  and  the  corresponding  force  constants 
are  1334  and  242.  No  stable  OO  interaction  is 
found.  The  created  force  field  frequencies  are  703, 
1201,  and  1630,  compared  to  the  DFT  713,  1301, 
and  1586. 

CH3 

For  this  radical  in  D3;,  symmetry,  the  DFT  bond 
length  is  1.089  and  the  bond  force  constant  is  768. 
The  bond  angle  and  out-of-plane  force  constants 
are  62  and  12.2.  The  resulting  force  field  frequen¬ 
cies  are  449,  1426,  2997,  and  3186,  while  the  DFT 
are  459,  1386,  3059,  and  3236. 

CH3\'02 

It  should  be  noted  that  parameters  for  nitro 
derivatives  are  not  available  in  any  of  the  standard 
force  fields.  The  frequencies  calculated  with  the 


FUERZA  program,  with  the  DFT  values  given  in 
parentheses,  are:  577(572),  606(619),  945(866), 
1000(1067),  1081(1091),  1176(1323),  1278(1370), 
1294(1436),  1418(1450),  1694(1558),  2912(3014), 
3033(3107),  and  3064(3136).  The  first  two  frequen¬ 
cies  are  not  given,  since  they  correspond  to  the 
lowest  torsional  modes,  in  which  anharmonic  ef¬ 
fects  are  particularly  significant;  accordingly,  nei¬ 
ther  harmonic  approximation,  empirical  nor  DFT, 
is  meaningful. 

CIS-RH(PH3)2(CO)F 

The  results  of  an  earlier  DFT  study  [22]  were 
used  to  make  a  more  demanding  test  of  the  pre¬ 
sent  approach.  The  force  field  frequencies  ob¬ 
tained,  with  the  DFT  values  in  parentheses,  are: 
91(102),  91(107),  91(114),  92(115),  254(271),  260(274), 
292(297),  393(403),  408(448),  419(465),  422(470), 
434(477),  452(522),  782(530),  801(968),  966(993), 
971(1095),  971(1095),  991(1098),  1328(1100), 
1925(1935),  2315(2362),  2315(2364),  2339(2386), 
2339(2388),  2348(2390),  and  2361(2390).  The  fre¬ 
quencies  for  the  lowest  three  torsional  modes  are 
not  given,  again  because  anharmonic  effects  ren¬ 
der  them  useless. 


Conclusion 

Overall,  the  vibrational  frequencies  obtained 
with  the  FUERZA  force  fields  are  in  very  good 
agreement  with  those  resulting  from  the  DFT  orig¬ 
inal  calculations,  over  a  wide  range  of  values.  This 
supports  the  physical  validity  of  the  approach  that 
we  have  presented  for  obtaining  an  intramolecular 
force  field  in  terms  of  force  constants  that  are 
invariant  to  the  choice  of  internal  coordinates.  The 
procedure  can  be  applied  to  any  molecular  system 
for  which  Cartesian  second  derivatives  of  the  en¬ 
ergy  are  available. 

ACKNOWLEDGMENT 

We  greatly  appreciate  the  financial  support  of 
the  Office  of  Naval  Research,  through  contract  No. 
N00014-95-1-0028. 


References 

1.  B.  R.  Gelin,  Molecular  Modeling  of  Polymer  Structures  and 
Properties  (Hanser  Publishers,  Munich,  1994). 


1276 


QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  30 


INTRAMOLECULAR  FORCE  FIELD 


2.  P.  L.  Hyskens,  W.  A.  P.  Luck,  and  T.  Zeegers-Hyskens, 
Eds.,  Intermolecular  Forces  (Springer-Verlag,  Berlin,  1991). 

3.  M.  Dagher,  M.  Kobersi,  and  H.  Kobeissi,  J.  Comp.  Chem. 
16,  723  (1995). 

4.  C.  J.  M.  Huige  and  C.  Altona,  J.  Comp.  Chem.  16,  56  (1995). 

5.  T.  Furuki,  A.  Umeda,  M.  Sakurai,  Y.  Inoue,  R.  Chujo,  and  K. 
Harata,  J.  Comp.  Chem.  15,  90  (1994). 

6.  R.  V.  Stanton,  D.  S.  Hartsough,  and  K.  M.  Merz,  Jr.,  J. 
Comp.  Chem.  16,  113  (1995). 

7.  D.  Cvetko,  A.  Lausi,  A.  Morgante,  F.  Tommasini,  P. 
Cortona,  and  M.  G.  Dondi,  J.  Comp.  Phys.  100,  2052  (1994). 

8.  W.  V.  Gunsteren,  J.  Comp.  Phys.  23,  327  (1977). 

9.  J.  M.  Seminario,  M.  C.  Concha,  J.  S.  Murray,  and  P.  Politzer, 
Chem.  Phys.  Lett.  225,  25  (1994). 

10.  J.  M.  Seminario,  M.  C.  Concha,  and  P.  Politzer,  J.  Chem. 
Phys.  102,  8281  (1995). 

11.  J.  M.  Seminario,  M.  C.  Concha,  and  P.  Politzer,  Int.  ]. 
Quantum  Chem.  29,  621  (1995). 

12.  M.  Diem,  Introduction  to  Modern  Vibrational  Spectroscopy 
(Wiley,  New  York,  1993). 

13.  P.  Pulay,  G.  Fogarasi,  F.  Pang,  and  ].  E.  Boggs,  J.  Am.  Chem. 
Soc.  101,  2550  (1979). 

14.  A.  St-Amant  and  D.  R.  Salahub,  Chem.  Phys.  Lett.  169,  387 
(1990). 


15.  A.  St-Amant,  Ph.D.  Thesis  (Universite  de  Montreal,  1992). 

16.  M.  J.  Frisch,  G.  W.  Trucks,  H.  B.  Schlegel,  P.  M.  W,  Gill, 
B.  G.  Johnson,  M.  A.  Robb,  J.  R.  Cheeseman,  T.  Keith,  G.  A. 
Petersson,  J.  A.  Montgomery,  K.  Raghavachari,  M.  A.  Al- 
Laham,  V.  G.  Zakrzewski,  J.  V.  Ortiz,  J.  B.  Foresman,  C.  Y. 
Peng,  P.  Y.  Ayala,  W.  Chen,  M.  W.  Wong,  J.  L.  Andres,  E.  S. 
Replogle,  R.  Gomperts,  R.  L.  Martin,  D.  J.  Fox,  J.  S.  Binkley, 
D.  J.  Defrees,  J.  Baker,  J.  P.  Stewart,  M.  Head-Gordon,  C. 
Gonzalez  and  J.  A.  Pople,  Gaussian  94,  Revision  B.3,  Gauss¬ 
ian,  Inc.,  Pittsburgh  PA.,  1995. 

17.  B.  R.  Brooks,  R.  E.  Bruccoleri,  B.  D.  Olafson,  D.  J.  States,  S. 
Swaminathan,  and  M.  Karplus,  J.  Comput.  Chem.  4,  187 
(1983). 

18.  A.  D.  Becke,  Phys.  Rev.  A  33,  3098  (1988). 

19.  C.  Lee,  W.  Yang,  and  R.  G.  Parr,  Phys.  Rev.  B  37,  785  (1988). 

20.  B.  G.  Johnson,  P.  M.  Gill,  and  J.  A.  Pople,  J.  Chem.  Phys.  98, 
5612  (1993). 

21.  B.  G.  Johnson,  in  Modern  Density  Functional  Theory:  A  Tool 
for  Chemistry,  J.  Seminario  and  P.  Politzer,  Eds.  (Elsevier 
Science,  Amsterdam,  1995),  Chap.  6. 

22.  E.  A.  Salter,  A.  Wierzbicki,  J.  M.  Seminario,  N.  W.  Hoffman, 
M.  L.  Easterling,  and  J.  D.  Madura,  J.  Phys.  Chem.  98, 
129845  (1994). 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


1277 


Intermolecular  Perturbation  Theory: 
Renormalized  Interaction  Energies 


WILLIAM  H.  ADAMS 

Wright  and  Rieman  Chemistry  Laboratories,  Rutgers  University,  New  Brunswick,  New  Jersey  08903; 
e-mail:  adams@rutchem.rutgers.edu 

Received  February  26,  1996;  accepted  June  5,  1996 


ABSTRACT _ _ _ _ _ 

For  intermolecular  perturbation  theories  in  which  it  is  assumed  that  the  unperturbed 
wave  fimction  of  the  composite  system  is  a  product  of  the  unperturbed  wave  functions  of 
its  components,  and  which  satisfy  one  general  constraint,  we  derive  two  renormalized 
interaction  energy  expressions  which  are  more  accurate  than  the  perturbation  expansions, 
when  all  are  evaluated  to  comparable  order.  This  is  accomplished  by  focusing  on  the 
parameter  A  in  terms  of  which  the  perturbation  expansions  are  derived  rather  than  on  the 
potential  of  interaction  between  components.  In  the  derivation  of  each  renormalized 
energy  formula,  we  discard  zeroth-  through  infinite-order  terms  which  do  not  contribute 
to  the  interaction  energy  when  the  interaction  is  turned  on  fully,  i.e.,  when  A  =  1.  The 
first  renormalized  interaction  energy  when  A  =  1  is  identical  in  form  to  the  interaction 
energy  in  the  symmetrized  Rayleigh-Schrodinger  (SRS)  theory,  but  not  in  interpretation. 
The  wave  function  appearing  in  the  renormalized  energy  cannot  generally  be  that 
assumed  in  the  SRS  theory,  and  the  renormalized  energy  to  zeroth  order  in  A  is  not  zero. 
The  latter  is  not  surprising  because  we  discarded  a  zeroth-order  term  in  the  derivation. 
The  second  renormalized  interaction  energy  formula  is  derived  from  the  first  by  using 
the  same  set  of  assumptions  and  arguments  that  were  used  in  deriving  the  first.  We 
expect  it  to  be  more  accurate  than  the  first,  which  is  expected  to  be  more  accurate  than 
the  sum  of  the  perturbation  energies,  all  evaluated  to  comparable  order.  These 
expectations  are  supported  by  the  results  of  calculations  on  LiH  using  two  perturbation 
theories,  the  polarization  approximation  and  the  Amos-Musher  theory.  The  first-order 
wave  functions  for  both  were  calculated  in  the  configuration  interaction  (Cl) 
approximation;  then  the  interaction  energies  were  calculated  by  summing  the  perturbation 
energies  through  third  order  and  by  evaluating  the  renormalized  energy  expressions.  The 
perturbation  results  are  compared  to  interaction  energies  calculated  by  full  Cl  with  the 
same  basis  set.  As  important  as  the  formulas  is  the  light  our  analysis  throws  on  the 
meaning  of  order  in  intermolecular  perturbation  theory.  ©  1996  John  Wiley  &  Sons,  Inc. 
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Iiitrodiictioii 

We  show  that  for  a  certain  class  of  inter- 
molecular  perturbation  theories,  interac¬ 
tion  energies  can  be  calculated  more  accurately 
from  two  renormalized  energy  formulas  than  from 
the  energy  expansions  defined  by  the  perturbation 
theories,  using  in  each  case  just  the  zeroth  through 
;jth-order  wave  functions.  In  the  theories  we  con¬ 
sider,  it  is  assumed  that  the  unperturbed  wave 
function  for  two  interacting  atoms/molecules  is 
the  product  of  the  ground-state  atomic/molecular 
wave  functions  for  the  noninteracting  systems. 
There  are  additional  conditions  that  the  theories 
must  satisfy.  The  most  important  is  that  they  shoul 
give,  in  principle,  exact  energies  and  wave  func¬ 
tions  when  carried  to  infinite  order.  It  is  this  condi¬ 
tion  that  the  simplest  intermolecular  perturbation 
theories  fail  to  satisfy,  due  to  the  existence,  for 
nearly  all  systems,  of  an  infinite  number  of  un¬ 
physical  solutions  to  the  Schrbdinger  equation,  ly¬ 
ing  lower  in  energy  than  the  physical  ground  state. 

Unphysical  solutions  of  the  Schrodinger  equa¬ 
tion  are  those  which  violate  the  Pauli  exclusion 
principle.  Claverie  [1]  explained  how  they  become 
accessible  in  the  simplest  intermolecular  perturba¬ 
tion  theory,  the  polarization  approximation  (PA) 
[2],  and  that  the  lowest  energy  unphysical  solu¬ 
tions  have  lower  energies  than  the  physical  ground 
state.  Morgan  and  Simon  [3]  later  observed  that 
there  were  an  infinite  number  of  unphysical  solu¬ 
tions  of  lower  energy  than  the  physical  ground 
state  in  any  system  containing  at  least  one  atom  of 
atomic  number  greater  than  2,  and  that  the  physi¬ 
cal  ground-state  energy  lies  in  a  continuum  of 
unphysical  state  energies.  Consequently,  the  PA 
theory  not  only  gives  a  divergent  energy  expan¬ 
sion  [4],  but  cannot  be  used  in  principle  to  calcu¬ 
late  the  ground-state  energy  if  carried  to  infinite 
order  [5].  The  simplest  improvement  on  the  PA 
theory,  the  symmetrized  Rayleigh-Schrodinger 
(SRS)  theory  [6-9],  fails  similarly  [10].  In  contrast 
to  these  negative  results,  we  have  shown  that  one 
can  circumvent  the  unphysical  states  problem  by 
changing  the  Hamiltonian  for  the  interacting  sys¬ 
tems  in  ways  that  make  its  lowest  energy  eigen¬ 
function  the  physical  ground-state  function  [5,  11, 
12],  Perhaps  the  simplest  change  is  that  made  in 
the  Amos-Musher  (AM)  theory  [13],  It  was  rela¬ 
tively  inaccurate  when  applied  to  H  j  [14],  but  this 


defect  is  corrected  significantly  by  the  renormal¬ 
ized  energy  expressions  derived  in  this  study. 

An  essential  element  in  our  derivation  of  the 
renormalized  interaction  energy  formulas,  and  in 
our  understanding  of  these  formulas,  is  the  explicit 
use  of  a  parameter  A  in  carrying  out  the  perturba¬ 
tion  expansions.  It  might  seem  unnecessary  to  in¬ 
clude  A  explicitly  since  it  multiplies  the  perturbing 
potential  V  in  the  Hamiltonian  for  an  interacting 
pair  of  molecules,  but  it  is  the  key  to  deriving  and 
understanding  the  renormalized  energy  expres¬ 
sions.  In  particular,  the  focus  on  A,  as  we  will 
explain,  gives  us  a  mathematical  handle  for  under¬ 
standing  perturbation  theory.  We  feel  that  an  oft- 
cited  defect  of  intermolecular  perturbation  theories 
[2,  15],  that  orders  are  not  uniquely  defined,  is 
more  imaginary  than  real.  It  is  based  on  the  false 
assumptions  that  U  is  “weak"  and  that  expansion 
of  the  energy  and  wave  function  in  powers  of  A  is 
equivalent  to  expansion  in  powers  of  V.  If  V  were 
weak,  an  infinite  number  of  unphysical  states 
would  not  be  lower  in  energy  than  the  physical 
ground  state.  The  renormalized  energy  formulas 
show  that  expansion  in  A  is  not  equivalent  to 
expansion  in  V.  For  example,  our  first  formula 
gives  to  zero  order  in  A  the  Heitler-London  inter¬ 
action  energy.  There  is  no  mystery  to  this  because 
we  have  identified  terms  of  all  orders  which  can¬ 
not  contribute  to  the  interaction  energy  in  the  limit 
A  =  1  and  then  discarded  those  terms.  The  resul¬ 
tant  energy  expression  is  identical  to  that  of  the 
SRS  perturbation  theory,  except  for  its  A  depen¬ 
dence  and  except  that  the  wave  function  appearing 
in  it  cannot  be  the  PA  function  if  the  system  has 
more  than  two  electrons.  Our  second  interaction 
energy  formula  is  derived  from  the  first  by  apply¬ 
ing  to  it  the  same  reasoning  that  led  to  the  first. 

In  the  next  section  we  summarize  our  under¬ 
standing  of  the  perturbation  problem  defined  by 
the  assumption  that  the  unperturbed  wave  func¬ 
tion  for  the  system  is  the  product  of  the  unper¬ 
turbed,  ground-state  wave  functions  of  two 
atoms/molecules.  The  renormalized  interaction 
energy  formulas  are  derived  in  the  third  section 
for  perturbation  theories  based  on  Hamiltonians 
that  satisfy  certain  general  conditions.  We  infer 
that  the  second  formula  is  more  accurate  than  the 
first,  and  that  both  are  more  accurate  than  the  sum 
of  the  perturbation  energies  when  calculated  to  the 
same  order  in  the  wave  function.  This  section  is 
followed  by  one  in  which  the  Hamiltonians  of 
several  published  perturbation  theories  are  recast 
in  the  form  assumed  in  the  derivation  of  the  renor- 
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malized  energy  formulas.  In  the  next  to  last  sec¬ 
tion,  we  apply  the  energy  formulas  to  the  LiH  2 
ground  state  using  the  PA  and  the  AM  perturba¬ 
tion  theories.  The  results  support  our  assertion  that 
the  renormalized  energies  are  more  accurate  than 
the  sum  of  the  perturbation  theory  energies.  In 
addition,  the  results  suggest  how  one  might  calcu¬ 
late  still  more  acurate  interaction  energies  than  the 
renormalized  energy  formulas  give.  This  conjec¬ 
ture  is  tested,  too. 


Nature  of  the  Problem 

Most  of  the  work  on  intermolecular  perturba¬ 
tion  theory  published  until  the  1990s  did  not  take 
into  consideration  the  unphysical  states  problem 
delineated  by  Claverie  [1]  and  Morgan  and  Simon 
[3].  In  this  section  we  define  our  notation  as  we 
summarize  what  the  existence  of  these  states  im¬ 
plies  for  perturbation  theories  based  on  a  zeroth- 
order  wave  function  which  is  a  product  of  unper¬ 
turbed  atomic  or  molecular  functions,  specific  elec¬ 
trons  being  assigned  to  each  subsystem.  The  prob¬ 
lem  is  defined  by  the  PA  theory. 

The  PA  theory  is  just  the  Rayleigh-Schrddinger 
perturbation  theory  with  a  specific  choice  for  the 
unperturbed  Hamiltonian  [2].  Let  A  and  B  repre¬ 
sent  two  interacting  systems,  which  have,  respec¬ 
tively,  and  Ng  electrons.  We  arbitrarily  assign 
the  first  electronic  position-spin  coordinates  to 
A,  the  coordinates  +  1  to  N  =  +  IVg  to  B. 

Let  H  be  the  nonrelativistic  Hamiltonian  for  the 
A~B  system  in  the  Born-Oppenhieimere ^approxi¬ 
mation.  The  unperturbed  Hamiltonian  is  the 
Hamiltonian  of  A  when  B  is  absent,  plus  the 
Hamiltonian  of  B  when  A  is^  absent.  The  two  sys¬ 
tems  interact  via  V  =  H  -  H^.  The  lowest  energy 
eigenfunction  of  HMs  T®  and  its  eigenvalue, 

The  function  in  our  calculations  is  the  spin- 
coupled  product  of  the  A  and  B  ground-state  wave 
functions,  and  is  the  sum  of  the  A  and  B 
ground-state  energies.  In  the  PA  theory  one  solves 

(H^  +  XV)F(X)  =  #^(A)F(A)  (1) 

by  the  Rayleigh- Schrodinger  method  setting 

f(A)=F°+  i;  and 

71=1 

00 

r(A)=r°+  £  A''r<"\  (2) 

n  =  l 


Note  that  A  has  only  two  physically  significant 
values,  zero  and  one.  No  attempt  is  made  in  the 
PA  theory  to  exploit  the  electronic  exchange  sym¬ 
metry  that  is  so  important  for  chemical  bonding. 
The  result  is  a  theory  with  major  defects:  the 
expansions  (2)  must  diverge  for  systems  contain¬ 
ing  more  than  two  electrons  [1,  4]  and,  for  systems 
containing  one  atom  of  atomic  number  greater 
than  2,  one  cannot  give  a  closed-end  prescription 
for  calculating,  in  principle,  the  exact  ground-state 
energy  [5].  This  raises  a  question  as  to  the  signifi¬ 
cance  that  should  be  attached  to  low-order  PA 
calculations  and  to  refinements  of  the  PA  theory. 

One  can  gain  a  good  intuitive  understanding  of 
the  eigenvalues  of  (1)  at  A  =  0  and  1  by  consider¬ 
ing  the  limit  of  large  separations  R  between  A  and 
B.  We  require  F(A)  to  be  antisymmetric  in  the  A 
electronic  coordinates  and,  separately,  in  the  B 
electronic  coordinates.  Thus,  the  set  of  ^(0)'s  equal 
all  possible  sums  of  one  unperturbed  A  and  one 
unperturbed  B  eigenenergy.  For  R  infinite,  some 
members  of  the  set  of  ^(l)'s  equal  members  of  the 
set  of  ^(0)'s,  but  there  are  additional  eigenvalues 
corresponding  to  states  which  violate  the  Pauli 
exclusion  principle.  We  have  explained  in  some 
detail  the  origin  of  these  states  for  LiH  [16].  We 
have  used  the  configuration  interaction  (Cl) 
method  to  solve  Eq.  (1)  nonperturbatively  for  2 
LiH  for  0  <  A  <  1.  Our  results  show  that  the  ^(X) 
at  finite  R  have  the  dependence  on  A  inferred  by 
Claverie  [1],  namely,  that  the  lowest  eigenvalue 
^/A)  equals  at  A  =  0,  that  it  shows  behavior 
characteristic  of  an  avoided  crossing  with  ^2^X) 
well  before  A  =  1, . . . ,  and  that  it  equals  E ",  the 
lowest  unphysical  state  energy,  at  A  =  1  [5, 10-12]. 
We  find  for  LiH,  from  the  equilibrium  bond  length 
to  infinity,  that  -  E"  =  1  hartree,  which  is 
much  larger  than  the  binding  energy  of  about  0.08 
hartree.  The  magnitude  of  -  E"  is  consistent 
with  the  insight  of  Morgan  and  Simon  [3]  in  that  it 
suggests  that  the  unphysical  continuum  at  large  R 
begins  about  0.5  hartree  below 

To  understand  the  mathematical  properties  ot 
the  energy  expansion  in  Eq.  (2),  one  lets  A  be 
complex  and  interprets  ^(X)  as  a  multivalued 
function  of  A.  The  general  properties  of  ^(A)  are 
well  understood  in  the  Cl  approximation  [17-19]. 
In  particular,  it  is  known  that  ^(A)  can  have  only 
branch  point  singularities.  We  denote  the  sheets  of 
the  ^(A)  Riemann  surface  by  i^^(A)  and  order 
them  so  that  <  3ir^+/A).  The  eigenfunc¬ 

tion  corresponding  to  ^j^(X)  is  denoted  by  Fj^iX).  In 
an  exploratory  set  of  calculations  we  have  foimd 
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that  the  avoided  crossing  between  and  ^2^^ 
for  LiH  arises  from  a  pair  of  branch  points  nar¬ 
rowly  straddling  the  real  A  axis  at  A  =  0.78,  Thus, 
the  power  series  expansion  [Eq.  (2)]  of  i^i(A)  has  a 
radius  of  convergence  =  0.78.  Only  within  that 
radius  of  convergence  does  the  power  series 
uniquely  define  ^(A).  For  |A|  greater  than  the 
radius  of  convergence,  one  can  in  principle  analyti¬ 
cally  continue  the  series  either  along  the  lowest 
Riemann  sheet  or  to  the  second  lowest.  The  physi¬ 
cal  ground-state  energy,  however,  generally  lies  on 
neither  of  these  sheets,  but  on  a  sheet  that  is  an 
indeterminate  number  of  sheets  above  ^^(A).  In  an 
exact  calculation,  that  indeterminate  number  is 
expected  to  be  infinity.  One  can  assume  that  it  is 
possible  by  successive  analytical  continuations  of 
the  power  series  to  move  from  the  lowest  sheet  to 
the  sheet  on  which  the  physical  ground-state  en¬ 
ergy  lies,  but  there  is  zero  probability  that,  after 
any  number  of  continuations,  the  desired  sheet 
will  have  been  reached  [5]. 

One  naturally  conjectures  that  the  simplest  way 
to  correct  the  deficiencies  of  the  PA  theory  is  to 
project  out  the  unphysical  contributions  to  F^(\) 
and  calculate  the  energy  from  it.  This  is  exactly 
what  is  done  in  the  SRS  theory  [6-9].  Unfortu¬ 
nately,  the  SRS  theory  to  infinite  order  is  no  more 
accurate  for  LiH  than  is  the  PA  theory  [10]  when 
one  uses  F^(X).  One  can  again  imagine  analytically 
continuing  the  PA  expansion  from  the  lowest  Rie¬ 
mann  sheet  to  the  sheet  on  which  the  physical 
ground-state  energy  lies,  but  one  still  has  to  carry 
out  an  indeterminate,  if  not  infinite,  number  of 
successive  continuations.  This  will  not  be  the  case 
for  a  perturbation  theory  derived  from  a  Hamilto¬ 
nian  which  has  no  unphysical  states  below  the 
physical  ground  state.  We  show  in  the  next  section 
that  if  F/A)  is  determined  by  such  a  perturbation 
theory,  generally  not  the  PA  theory,  then  the  SRS 
energy  expression  gives  a  renormalized  interaction 
energy  which  sums  an  infinite  number  of  contribu¬ 
tions  to  the  usual  perturbation  approximation  to 
the  interaction  energy. 

We  would  use  the  difficulties  that  arise  in  the 
PA  theory  from  the  existence  of  unphysical  states 
as  an  argument  for  abandoning  intermolecular 
perturbation  theories  based  on  the  choice  for 
the  unperturbed  Hamiltonian,  except  that  we  have 
shown  that  there  are  theories  which  do  not  have 
the  defects  of  the  PA  theory.  These  are  theories  in 
which  ^i(O)  =  and  ^,(1)  =  E,,  the  physical 
ground-state  energy.  It  is  to  these  theories  that  the 


energy  formulas  we  derive  in  the  next  section 
should  be  applied.  The  theories  are  reviewed  in  a 
later  section. 


Interaction  Energy  Formulas 

We  derive  two  interaction  energy  formulas 
which  effect  partial  infinite-order  summations  of 
energy  expansions  for  a  number  of  intermolecular 
perturbation  theories.  In  the  derivations  we  as¬ 
sume  that  the  perturbed  wave  function  F(A)  is  an 
eigenfunction  of  a  Hamiltonian  +  KV  and  that 
the  perturbed  energy  ^(A)  is  the  corresponding 
eigenvalue.  It  follows  that  the  interaction  energy 

Ar(A)  =  r(A)  -  =  A<F^|#|f(A)>  (3) 

when  F(A)  is  normalized  so  that  <f  ®  |  F(A)>  =  1. 
Equation  (3)  gives  the  interaction  energy  directly 
rather  than  as  the  difference  between  two  total 
energies.  The  potential  ^  is  different  for  each  of 
the  perturbation  theories  considered  in  the  next 
section,  but  each  satisfies  si^7/'=s^'V.  It  follows 
that 

+  AU)F(A)  =  g^(A)j^F(A).  (4) 

We  derive  the  first  interaction  energy  formula  from 
Eq.  (4).  It  is  effectively  the  SRS  energy  formula,  but 
its  derivation  provides  a  useful  understanding  of 
what  has  been  accomplished  in  deriving  that  for¬ 
mula.  The  derivation  suggests  that  one  can  expect 
the  formula  to  give  more  accurate  approximations 
to  interaction  energies  when  the  f (A)  expansion  is 
truncated  at  nth  order  than  can  be  obtained  from 
Eq.  (3)  with  the  same  approximation  to  F(A).  From 
the  imderstanding  gained  in  deriving  the  first  for¬ 
mula,  we  are  lead  to  apply  to  that  formula  the 
same  series  of  manipulations.  TTie  derivation  uses 
another  consequence  of  =  sPv,  namely,  that 

_  v)F\  (5) 

where  F^^^  is  the  first-order  wave  function  of  the 
theory  which  defines  F(A),  The  result  is  a  second 
renormalized  interaction  energy  formula.  We  ex¬ 
pect  it  to  be  more  accurate  than  the  first. 

A  key  element  in  the  derivations  is  the  commu¬ 
tator  identity 

=  [U>].  (6) 

It  is  this  identity  which  Hirschfelder  called  para¬ 
doxical  [2]  because  its  left  side  is  zeroth  order  in 
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V ,  the  right,  first  order.  Equation  (6)  would  be 
paradoxical  if  the  perturbation  ppansions  [Eq.  (2)] 
were  developed  in  powers  of  Y  rather  than  A,  but 
they  are  not.  No  A  appears  in  Eq.  (6);  both  sides  of 
Eq.  (6)  are  zeroth  order  in  A.  At  the  time 
Hirschfelder  called  it  paradoxical,  it  was  believed 
that  V  was  a  weak  perturbation.  Given  the  under¬ 
standing  that  we  now  have  of  the  relationship 
between  the  physical  and  unphysical  state  eigen¬ 
values  of  +  V,  aird  the  relationship  of  both  to 
the  eigenvalues  of  H°,  it  is  vmreasonable  to  as¬ 
sume  that  V  is  a  weak  perturbation,  suitable  for 
defining  orders  in  an  expansion.  Furthermore,  ex¬ 
pansion  in  y  is  much  more  complicated  than  ex¬ 
pansion  in  A.  This  is  quite  clear  in  the  Cl  approxi¬ 
mation  in  which  expansion  in  V  means  that  each 
element  of  the  V  matrix  should  be  treated  as  an 
independent  variable.  A  power  series  expansion  in 
terms  of  A  is  simpler,  and  it  gives  some  mathemat¬ 
ical  imderstanding  of  the  properties  of  i’(A) 
[17-19]. 

To  derive  the  first  formula,  we  use  Eq.  (6)  to 
transform  Eq.  (4)  into 


H°J  +  (A  - 


F(A)  =#’(A)jyF(A). 


(7) 


Next,  multiply  Eq.  (7)  from  the  left  by  the  complex 
conjugate  of  F°  and  integrate  over  all  space.  Since 
F°  is  an  eigenfunction  of  H**  and  i’®  is  its  eigen¬ 
value,  it  follows  that 

Ar(A)  =r(A)  -r® 

<F“iW|F(A)>  +  (A  -  lXF°|jyVlF(A)> 

^  <F°|.a?|F(A)>  ■ 

(8) 

In  the  discussion  which  follows,  the  function  F(A) 
should  be  regarded  generally  as  the  exact,  unex¬ 
panded  eigenfunction  of  H°  +  A^.  Only  when 
discussing  order  should  it  be  assumed  that  F(A)  is 
a  power  series  in  A  as  in  Eq.  (2). 

The  term  in  the  numerator  of  Eq.  (8)  that  is 
multiplied  by  A  -  1  is  interesting.  It  does  not  con¬ 
tribute  to  the  energy  when  A  =  1,  but  it  con¬ 
tributes  a  term  to  each  order  of  the  A  expansion 
of  A#’(A).  What  is  more  interesting  is  that  part 
of  what  it  contributes  in  nth  order  is  canceled, 
when  A  =  1,  by  part  of  what  it  contributes  in 
n  +  1th  order.  For  example,  it  contributes  in  the 


numerator  of  Eq.  (8)  to  zeroth  order  -  <  F  °  WV  |  F  “  > 
and  to  first  order  (F^lj/ylF®)  —  <F'’Jjyy |F^^^). 
The  second-order  contribution  is  — 

< F^lj/y  |F®>,  and  so  on.  In  short,  when  one  tnm- 
cates  the  A  expansion  of  Ag’(A)  at  nth  order,  one 
includes  terms  which  do  not  contribute  in  infinite 
order  when  A  =  1.  For  this  reason,  if  one  is  going 
to  approximate  the  interaction  energy  by  a  trun¬ 
cated  expansion  in  A,  one  should  derive  it  from 


AE,(A)  = 


(F°|W/|F(A)) 

<F“|jy|F(A)> 


(9) 


This  is  the  SRS  energy  expression  except  for  two 
differences.  First,  in  the  SRS  theory  it  is  assumed 
implicitly  that  V  is  multiplied  by  A;  that  is  clearly 
not  the  case  in  Eq.  (9).  Second,  the  F(A)  in  Eq.  (9) 
cannot  generally  be  the  PA  wave  function  [10]. 

The  problem  with  using  the  PA  theory  F(A)  in 
Eq.  (9)  arises  even  when  one  restricts  the  theory  to 
the  determination  of  the  potential  energy  surface 
of  the  physical  ground  state.  In  this  case,  in  Eq. 
(8)  should  be  the  lowest  eigenvalue  of  H°.  Thus, 
we  are  interested  in  S’/ A),  the  lowest  sheet  of  the 
g’(A)  Riemarm  surface.  For  nearly  all  systems,  the 
PA  theory  g’/l)  equals  the  lowest  unphysical  state 
energy  and  j/F/l)  =  0,  i.e.,  Eq.  (8)  gives  A^(A)  = 
0/0.  In  contrast,  the  Hamiltonians  considered  in 
the  next  section  all  have  g’/l)  =  E^,  the  physical 
ground-state  energy,  and  for  each,  jYF(1)  is  pro¬ 
portional  to  the  physical  ground-state  wave  func¬ 
tion.  In  the  PA  theory,  g’/l)  =  only  for  one  and 
two  electron  systems. 

The  denominator  of  Eq.  (8)  is  also  interesting.  If 
one  substitutes  in  Eq.  (8)  the  F(A)  expansion  trun¬ 
cated  to  nth  order,  then  expands  this  approxima¬ 
tion  to  Ag’(A),  the  expansion  includes  contribu¬ 
tions  of  infinite  order  in  A,  i.e.,  the  denominator  of 
Eq.  (8)  effects  an  infinite-order  summation  of  se¬ 
lected  contributions  to  the  Af’(A)  expansion.  In 
fact,  we  first  derived  an  approximation  to  the 
energy  formula  given  below  by  summing  selected 
terms  in  the  Ag’(A)  expansion  of  the  AM  theory. 
We  conclude  that  Eq.  (9)  includes  an  infinite-order 
summation  of  contribution  to  the  A^’fA)  expan¬ 
sion.  Equation  (9)  is  the  first  of  the  renormalized 
interaction  energies. 

The  importance  of  the  denominator  in  Eq.  (9) 
increases  as  the  number  of  electrons  increases.  To 
see  this,  one  has  only  to  assume  that  at  large  R, 
the  dominant  contribution  to  F(A)  is  F°.  Then 
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<F°|.c/|f(l)>  =  For  binary  systems,  as 

R  ^  all  permutation  contributions  to  (F^Iqs^'\F^) 
approach  zero  except  that  from  the  identity  opera¬ 
tion,  i.e.,  (F^l^/lF^)  ^  N^lNi^l/Nl.  This  equals  ^  in 
our  LiH  calculations.  It  will  be  much  smaller  for 
systems  containing  more  electrons. 

The  expansion  of  A  £/ A)  differs  from  the  expan¬ 
sion  of  A^(A)  even  to  the  zeroth  order  in  A.  To 
zeroth  ^order,  A^(X)  is  zero,  but  AE,(0)  = 
(F^^\Vj2p\F^)  /(F^l^/lF^^)  =  A£p^j_,  commonly 
called  the  Heitler-London  energy.  It  is  also  the 
first-order  SRS  energy.  It  should  not  be  puzzling 
that  A  £5(0)  is  nonzero.  When  we  discarded  the 
term  in  A^(A)  that  is  multiplied  by  A  -  1  to  get 
AEsiXl  we  discarded  -(F^WV\F^),  which  is  the 
term  which  cancels  the  zeroth-order  contribution 
from  {£^|14c/|f (A))  to  the  numerator  of  A^(A). 
That  A  £5(0)  should  equal  A  is  not  absurd.  It  is 
as  if  one  knew  an  unperturbed  Hamiltonian  hav¬ 
ing  .s/£°  for  an  eigenfunction,  with  -h  AE^^^ 
its  eigenvalue.  A  perturbation  theory  based  on  that 
Hamiltonian  would  give  the  same  zeroth-order 
contribution  to  the  interaction  energy.  No  such 
theory  has  been  proposed. 

Since  in  zeroth  order  AE^CA)  is  equal  to  what  is 
generally  regarded  as  a  first-order  energy,  one 
should  expect  that  to  first  order  in  A  it  will  include 
second-order  energies  and  so  on.  To  look  at 
these  contributions,  we  have  to  consider  a  specific 
perturbation  theory,  which  we  do  in  the  next  sec¬ 
tion.  We  show  there  that  the  numerator  of  A£5(A) 
expanded  to  first  order  in  A  does  include 
There  is  no  reduction  in  computational  effort  asso¬ 
ciated  with  the  shifting  of  terms  from  higher  to 
lower  order,  but  one  does  get  higher  accuracy. 

In  the  Rayleigh-Schrodinger  perturbation  the¬ 
ory,  one  can  calculate  the  energy  to  third  order 
if  one  knows  the  zeroth-  and  first-order  wave 
functions.  This  is  done  by  recognizing  that  the 
equation  which  determined  £^“^  can  be  used  to 
to  eliminate  from  the  third-order  energy,  i.e., 

( <^{^)|F(i)).  We  use  Eq.  (5)  to  achieve  a 
similar  end.  We  solve  Eq.  (5)  for  .o/VF^,  then  sub¬ 
stitute  its  Hermitian  adjoint  into  Eq.  (8)  to  obtain 
ArCAKF^MFCA))  =  <F0WF(A))r^i>  -  <F^'>|(H" 
— ^^XcZ'IFCA))  -f  (a  -  l){F^|.!:/y|F(A)>.  Finally,  we 
use  Eq.  (6)  followed  by  Eq.  (4)  to  arrive  at 


We  assume  here  that  F^^^  is  calculated  with  the 
perturbation  theory  that  is  derived  from  the 
Hamiltonian  which  has  F(A)  as  its  eigenfunction, 
although  this  is  not  ^necessary.  In  addition,  we 
assume  again  that  .c/F(l)  is  proportional  to  the 
physical  ground-state  wave  function.  Equation  (10) 
should  be  compared  to  Eq.  (8).  Note  that  there  is 
an  additional  term  multiplied  by  A  “  1  and  that 
there  is  an  additional  term  in  the  denominator.  We 
have  achieved  a  further  infinite-order  summation 
of  selected  contributions  to  the  A<i^(A)  expansion. 
Again  we  discard  the  terms  multiplied  by  A  -  1 
because  they  do  not  contribute  to  the  interaction 
energy  when  A  =  1,  the  one  value  of  interest.  The 
second  renormalized  interaction  energy  formula  is 

^  <F°MF(A)>#^(^^  +<F(^^iW|F(A)> 

AEdA)  =  - ^ - . 

<F^  +F^'^MF(A)> 

(11) 

To  zeroth  order  in  A,  Eq.  (11)  does  not  vanish.  In 
zeroth  order,  its  numerator  includes  a  contribution 
that  would  ordinarily  be  identified  as  second  or¬ 
der.  We  show  below  that  A  £^(1)  to  zeroth  order  in 
A,  is  identical  to  AE^d)  when  F°  -f  F^^^  is  substi¬ 
tuted  for  F(A).  In  Eq.  (11)  terms  have  been  shifted 
to  lower  order  not  by  approximation  but  by  dis¬ 
carding  those  contributions  to  the  exact  Ai^(A)  of 
Eq.  (10),  which  vanishes  when  A  =  1.  We  show  in 
the  next  section  that  if  we  substitute  F°  -h  AF^^^  for 
F(A)  in  Eq.  (11),  the  resultant  approximation  to 
A£j(A)  includes  terms  that  one  would  customarily 
classify  as  third  order. 

Before  we  go  further  it  simplifies  the  later  dis¬ 
cussion  if  we  introduce  a  notational  convention  to 
indicate  approximations  made  in  evaluating  Eqs. 
(9)  and  (11),  We  consider  only  one  class  of  approxi¬ 
mations,  substituting  F^  +  AF^^^  +  "•  +A"F^”^  for 
F(A).  When  this  is  done,  we  add  a  superscript  [n] 
to  the  £  in  Eqs.  (9)  and  (11).  When  we  also  set 
A  =  1  in  those  formulas,  we  write  simply  AE^”^ 
and  A£{”^  for  the  resultant,  approximate  interac¬ 
tion  energies.  Translated  into  the  new  notation,  the 
observations  of  the  preceding  paragraph  are  that 
AE{^^  =  AE^^^  and  that  Ef;h(A)  includes  terms  cus¬ 
tomarily  identified  as  third  order.  On  the  basis  of 


Ag^(A) 


<F«MF(A)>r^i>  +<F('>|W|F(A)>  +  (A  -  1XF°  +  F^^Wv\F(X)) 

(f^~Vf^J\fO0) 


(10) 
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these  observations,  we  conjecture  that  is 

generally  more  accurate  than  AE^"l  Our  numeri¬ 
cal  results  for  AE^^^  and  AE^  are  consistent  with 
this  conjecture.  Note  that  in  our  calculations,  we 
have  expanded  neither  AE^”KA)  nor  AEf”KA)  in  a 
power  series  in  A. 

It  is  straightforward  to  show  that  AE{^^  =  AE^^l 
Note  that  A£}^^  =  A-T/O),  the  zeroth-order  approx¬ 
imation  to  A^(A).  We  have 

^  <f«  +  f^'VIf«> 

and 

<F^|W/|F°  +  F('^> 

A  pb]  =  - 

If  we  assume  that  F°  and  F^^^  are  real,  which  is 
the  case  in  our  calculations,  then  the  denominators 
of  AEf^  and  A£^^^  are  identical.  We  use  Eq.  (6), 
then  Eq.  (5)  in  the  numerator  of  AE^^^  to  see  that 

<F^|W/|F°  +  F^^^> 

=  <F°1W|F°>  +  (f°WV+Jh°  -Hy|F<^>) 

=  <F°W|F°>  +<F°|VV|F<’^> 

+<F°|V(r(i>  -  y)|F“>, 

which  reduces  to  the  numerator  of  A  EfK 

It  is  natural  to  wonder  if  a  formula  more  accu¬ 
rate  than  Eq.  (11)  might  be  derived  by  applying  a 
sequence  of  operations  to  it  similar  to  the  sequence 
that  led  from  Eq.  (9)  to  (11).  The  answer  is  yes. 
One  can  transform  (F^''|yV|F(A))  in  Eq.  (10)  to 
bring  in  (F^^'lvVlFU))  and  other,  lower  order 
terms.  Thus,  an  additional  summation  of  the  per¬ 
turbation  expansion  can  be  achieved.  We  do  not 
present  the  analysis  here  because  we  have  not  yet 
done  calculations  with  the  resulting  formula. 

Suitable  Perturbation  Theories 

In  the  second  section,  we  explained  why  it  is 
important  that  intermolecular  perturbation  theo¬ 
ries  have  i’j(A)'s  such  that  i’i(O)  =  and  ^i(l)  = 
£j,  the  physical  ground-state  energy.  The  energy 
formulas  of  the  preceding  section  are  strictly  appli¬ 
cable  only  to  theories  satisfying  both  conditions. 
We  have  recently  reviewed  which  of  the  existing 


perturbation  theories  can  satisfy  these  conditions 
[10],  but  we  did  not  state  the  eigenvalue  equations 
for  each  in  the  form  assumed  in  the  preceding 
section.  We  do  that  here.  In  addition,  we  show  that 
the  second-order  AM  energy  contributes  to  both 
AEf)  =  AE^^'  when  the  AM  first-order  function  is 
substituted  in  these  formulas.  Similarly,  the  third- 
order  AM  energy  is  shown  to  contribute  to  AEfl 
The  ri(A)'s  of  the  AM  theory  [13],  Hirshfelder's 
HAV  (van  der  Avoird-Hirschfelder)  theory  [20] 
and  Polymeropoulos's  and  the  author's  theory  (AP 
since  PA  is  taken)  [21],  satisfy  the  conditions  stated 
above.  The  three  theories  are  clearly  related.  Their 
wave  functions  and  energies  are  defined  by  equa¬ 
tions  of  the  form  [FI®  +  Xs/iV  —  D(A))]F(A)  = 
M\)FiX)  [10].  The  three  differ  in  their  definitions 
of  D(A)  and  in  the  meaning  assigned  to  the  eigen¬ 
value  In  the  AM  theory  D(A)  is  identically 

zero,  but  the  Coulomb  potential  energy  between 
the  nuclei  must  be  included  in  V.  We  write  "must" 
because  when  it  is  not,  the  MX)  are  rapidly  vary¬ 
ing  functions  of  A  [5].  The  AM  theory's  M^X)  has 
the  properties  assumed  for  g’(A)  in  the  preceding 
section,  namely,  S’/O)  =  and^^/1)  =  Ej.  Thus, 

^  in  the  AM  theory  is  5^^ 

In  the  HAV  theory,  it  is  required  that  [H°  -I- 
aV(V  -  Dha(A))]Fha(A)  =  i’°FHA(A),  which  im¬ 
plicitly  defines  Djj^(A).  We  substitute  = 
#’ha(A)  -  ADha(A)  in  the  HAV  equation,  i.e.,  we 
set  ADha(A)  =  AfHA(A),  then  rearrange  terms  so 
that  the  Fha(A)  is  an  eigenfunction  of  H®  -I-  A^ 
and  ^ha^A)  is  its  eigenvalue.  We  find  #ha  =■3^1^ 
+  Dha(A)(1  -V).  Since  ADha(A)  is  identical  to 
A#’ha(A),  one  uses  (^Ha(A)  —  ^hA  +  +  ■■■ 

to  derive  the  equations  which  determine  the  F^l- 
In  the  AP  theory  MX)  is  defined  by  requiring 
that  Fap(A)  be  least  distorted  from  F®  in  an  energy 
sense  [22],  under  the  constraint  that  VFap(A)  not 
vanish.  This  gives  [  H®  -I-  X^(V  —  Dap(A))]Fap(A)  = 
^p(A)Fa^(A)  with  Dap(A)  =  <F,^p(A)1Vv|Fap(A)>/ 
<Fap(A)|V|Fap(A)>.  We  have  shown  that  the  AP 
equation  implies  that  jafF^pd)  is  an  eigenfxmction 
of  H®  +  V  with  the  eigenvalue  ^p(l)  +  D^pd). 
To  put  the  AP  equation  into  the  form  assumed  in 
the  preceding  section,  we  substitute  ‘^ap(A)  7 
g’^^pfA)  -  ADap(A)  and  rearrange  terms  to  get  (H° 
-l-  A2^p)Fap(A)  =  ^ap(A)Fap(A)  where  5^p 
+  Dap(A)(1  -V).  This  is  identical  in  form  to  ^ha^ 
but  the  two  D(A)'s  are  different.  In  particular, 
Dap(1)  must  be  less  than  A^Apd)  because  the 
lowest  ^p(l)  >  r®  [22].  In  the  HAV  theory 
Dha(I)  =  ^'^ha(I)-  Through  first  order  in  the  wave 
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function  and  second  order  in  the  energy,  however, 
the  HAV  and  AP  theories  are  identical  [21], 

In  the  last  theory  we  consider,  a  Hermitian 
operator  ^  is  introduced  which  satisfies  =  0 
and  =  0,  and  has  the  property,  when  added  to 
+  y,  of  shifting  all  unphysical  energies  above 


this,  one  has  first  to  derive  the  identity  = 

To  do  this,  we  used  the 
equations  which  determine  the  first-  and  second- 
order  AM  functions  and  Eq.  (6).  It  its  then  straight¬ 
forward  to  show  that 


+  2f;»)  +<Faio)  +  OF»ij/iF°> 
<F"  +  +  fffl) 


the  physical  ground-state  energy  [12],  Two  such 
operators  have  been  constructed  explicitly  [11,  12], 
but  it  should^  be  possible  to  construct  additional 
ones.  With  V  +^,  the  lowest  eigenvalue  in 
this  level  shifted  (LS)  theory  can  satisfy  the  condi¬ 
tions  ^  srid  'i’Ls(l)  =  El-  The  resultant 

perturbation  theory  is  identical  through  first  order 
in  the  wave  function  and  second  order  in  the 
energy,  to  the  PA  theory.  We  have  used  the  LS 
theory,  without  an  explicit  definition  of  ^  to  ar¬ 
gue  that  results  obtained  with  the  PA  and  SRS 
theories  are  fundamentally  valid  if  only  F°  and 
FpA  were  used  [5,  11].  In  this  context,  can  be 
regarded  as  a  valid  higher  order  extension  of  the 
SRS  energy  whereas  AE^^^  cannot. 

It  is  not  possible  to  relate  the  contributions  to 
AEgfA)  or  AE,(A)  to  orders  in  the  PA  theory  be¬ 
cause  the  numerators  of  both  generally  depend  on 
matrix  elements  of  s/V,  whereas  the  PA  perturba¬ 
tion  energies  are  determined  by  matrix  elements 
of  V  alone.  The  AM  perturbation  theory,  on  the 
other  hand,  is  well  suited  to  this  task  because  its 
perturbation  energies  are  defined  by  = 

{F°lcPv  iFl'll)  when  the  normalization  {F°| 
Fam(A)>  =  <F°|F°>  =  1  is  imposed.  Thus,  for  the 
AM  theory,  the  numerator  of  Eq.  (12),  assuming 
real  functions,  equals  <F"WF‘’ +^am-  This 
shows  that  A  includes  contributions  through 
second  order  in  the  perturbation  energies  although 
AE["'  is  the  zeroth-order  term  in  the  expansion  of 
AE/A)  in  powers  of  A.  This  is  not  surprising 
because  in  the  derivation  of  A£^(A),  we  discarded 
contributions  to  A^(A)  of  all  orders  in  A. 

In  the  calculations  described  in  the  next  section, 
we  have  calculated  the  PA  and  AM  wave  func¬ 
tions  only  through  first  order  in  A  since  it  can  be 
argued  that  it  is  not  practical  to  go  to  higher  order. 
This  means  that  we  can  calculate  only  AE^^  = 
AE{^^  and  AEjh.  The  numerator  of  AEj^^  when  the 
AM  functions  are  used,  includes  contributions 
from  the  first  through  third  order  To  show 


We  include  this  equation  here  to  show  that  the 
numerator  is  not  just  a  sum  of  the  AM  perturba¬ 
tion  energies  as  well  as  to  show  that  it  explicitly 
includes  the  third-order  energy. 

Finally,  we  must  note  that  the  numerator  of 
can  be  evaluated  using  just  F®  and  f  but 
that  the  evaluation  of  its  denominator  requires 
also  We  could,  of  course,  trimcate  the  denom¬ 
inator  of  AE^^^  to  a  different  order  than  the  numer¬ 
ator,  but  we  have  not  done  so  because  we  feel  that 
it  would  be  inconsistent  with  the  formal  analysis 
in  which  changes  in  the  denominator  are  linked  to 
changes  in  the  numerator. 


Numerical  Test 

A  numerical  test  is  required  to  verify  the  infer¬ 
ences  drawn  in  the  preceding  sections,  namely  that 
is  more  accurate  than  A£^^^^  for  each  pertur¬ 
bation  theory,  and  that  both  are  more  accurate 
than  the  perturbation  energies  summed  through 
the  highest  order  calculable  with  through  F^^\ 
We  have  calculated  only  f  for  the  PA  and  AM 
theories,  which  might  be  thought  of  as  the  practi¬ 
cal  limit  for  intermolecular  perturbation  calcula¬ 
tions.  Using  the  PA  theory,  is  the  interaction 
energy  given  by  the  SRS  theory,  also  known  as  the 
symmetrized  PA  theory  [8].  Accordingly,  we  refer 
to  AE^^  calculated  with  the  AM  functions  as  the 
symmetrized  AM  (SAM)  energy.  In  the  same  man¬ 
ner,  according  to  the  perturbation  functions  used 
in  evaluating  A  EP^,  we  refer  to  it  as  the  IRS  or  the 
I  AM  energy.  The  properties  of  AEP  and  AEP  as 
functions  of  the  nuclear  separation  R,  as  exhibited 
in  our  calculations,  suggest  a  method  by  which 
more  accurate  approximations  to  the  interaction 
energy  might  be  obtained  than  from  either  AE^^ 
or  AEP^  alone.  We  test  it  as  well. 

We  have  chosen  to  study  LiH  because  it  is  the 
simplest  system  which  has  an  energy  spectrum 
like  nearly  all  other  systems,  i.e.,  with  an  infinite 
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number  of  unphysical  states  below  the  physical 
ground-state  energy.  Our  calculations  were  done 
with  a  bideterminantal  Cl  basis  in  the  manner,  and 
with  a  basis  set,  we  have  described  previously  [5, 
10].  The  orbital  basis  set  consisted  of  Li  Is,  2s, 
2  per,  and  Ipn  Hartree-Fock  functions  and  the 
hydrogen  atom  Is,  2s,  2 per,  and  Ipir  eigenfunc¬ 
tions.  Full  Cl  with  this  basis  gave  2840  bidetermi¬ 
nants.  A  finite  Cl  basis  set  cannot,  of  course,  repre¬ 
sent  an  infinite  number  of  unphysical  solutions 
below  the  physical  ground-state  energy,  but  this 
one  does  represent  five,  including  the  three  lowest. 
Thus,  our  calculations  allow  for  effects  that  a  finite 
number  of  unphysical  states  below  the  physical 
ground-state  energy  might  introduce.  How  this 
may  differ  from  the  effect  of  the  infinite  number 
that  would  be  present  in  an  exact  calculation  is 
unknown.  Our  calculations  show  that  the  basis  set 
is  sufficient  to  model  both  short-  and  long-range 
interactions  between  Li  and  H.  It  is  not  large 
enough  to  give  an  accurate  LiH  potential  surface, 
but  that  is  not  our  goal.  Our  goal  is  to  show  that 
is  more  accurate  than  AE^^,  and  that  both 
are  more  accurate  than  the  corresponding  pertur¬ 
bation  theories  summed  through  third  order  in  the 
energy.  For  that  purpose,  the  basis  set  is  adequate. 

The  results  obtained  for  the  PA,  SRS,  and  IRS 
energies  are  summarized  in  Figure  1.  Each  line  in 
the  figure  represents  one  of  the  interaction  ener¬ 
gies  divided  by  the  full  Cl  interaction  energy.  A 


R  in  bohr 

FIGURE  1.  For  the  ""S  LiH  ground  state,  the  ratios  of 
the  PA,  SRS,  and  IRS  interaction  energies  to  the  full  Cl 
interaction  energy  are  plotted  against  the  nuclear 
separation  R.  The  lower  dashed  curve  represents  the 
sum  of  the  first-  through  second-order  PA  energies;  the 
upper  dashed  curve,  the  sum  through  third  order.  The 
lower  solid  curve  represents  the  SRS  interaction  energy, 
the  upper  solid  curve,  the  IRS.  If  one  of  the  perturbation 
theories  gave  the  exact  interaction  energy,  its  curve 
would  overlay  the  light,  horizontal  line  in  the  graph. 


value  of  one  for  the  ratio  indicates  that  the  pertur¬ 
bation  theory  gives  the  energy  exactly.  Note  that 
the  range  of  R  values  extends  from  short,  less 
than  the  equilibrium  bond  length  of  3.0  bohrs,  to 
long  range  where  the  PA  theory  if  known  to  give 
exactly  the  leading  terms  in  the  asymptotic  1/R 
expansion  of  the  energy  [23].  That  all  four  curves 
are  approximately  equal  to  one  at  20  bohrs,  shows 
that  the  SRS  and  IRS  energies  are  also  asymptoti¬ 
cally  correct.  It  is  clear  that  AE^^g  is  much  more 
accurate  than  the  PA  energy  expansion  summed 
through  third  order  and  that  AEj^g  is  modestly 
more  accurate  than  AE^^g.  These  results  are  con¬ 
sistent  with  the  discussion  of  the  preceding  sec¬ 
tions.  Note  that  the  IRS  energy  is  less  than  90%  of 
the  exact  interaction  energy  for  R  <  12  bohrs, 
dropping  to  only  75%  at  the  equilibrium  bond 
length. 

The  results  obtained  for  the  AM,  SAM,  and 
lAM  energies  are  presented  in  Figure  2  in  the  same 
manner  that  results  were  presented  in  Figure  1.  It 
is  noteworthy  that  the  not  one  of  the  curves  in 
Figure  2  approaches  1  as  R  approaches  20  bohrs. 
This  was  expected;  the  AM  theory  is  known  not  to 
give  correctly  the  coefficients  of  the  leading  terms 
in  the  asymptotic,  1  /R  expansion  of  the  energy  for 
H2  [14].  It  is  clear  from  Figure  1  that  the  SAM 
energy  is  more  accurate  than  the  AM  energy  ex¬ 
pansion  summed  through  third  order  and  that  the 
I  AM  energy  is  more  accurate  than  the  SAM  en- 


R  in  bohr 

FIGURE  2.  For  the  LiH  ground  state,  the  ratios  of 
the  AM,  SAM,  and  lAM  interaction  energies  to  the  full  Cl 
interaction  energy  are  plotted  against  the  nuclear 
separation  R.  The  lowest  dashed  curve  represents  the 
sum  of  the  first-  through  second-order  AM  energies;  the 
dashed  curve  immediately  above  it,  the  sum  through 
third  order.  The  lower  solid  curve  represents  the  SAM 
interaction  energy;  the  upper  solid  curve,  the  lAM.  The 
short-dash  curve  represents  the  corrected  lAM  interaction 
energy. 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


1287 


ADAMS 


ergy.  This  is  consistent  with  the  inferences  drawn 
from  the  formal  analyses  of  the  preceding  sec¬ 
tions.  The  lAM  energy  is  not  so  accurate  as  one 
would  wish/  equaling,  at  best,  less  than  80%  of 
the  exact  interaction  energy  at  any  R.  The  IRS  en¬ 
ergy  is  more  accurate  than  the  lAM  for  R  > 
4  bohrs,  but  the  two  are  of  comparable  accuracy  at 
smaller  R. 

The  lAM  energy  as  a  function  of  R  exhibits  an 
interesting  property:  It  varies  rather  slowly  with 
R,  giving  between  67  and  78%  of  the  exact  interac¬ 
tion  energy  from  3  bohrs,  the  equilibrium  bond 
length,  to  20  bohrs.  This  is  the  basis  for  a  specula¬ 
tion.  Suppose  that  in  calculations  more  accurate 
than  the  ones  presented  here,  the  lAM  theory 
could  be  expected  to  give  the  same  fraction  /  of 
the  exact  interaction  energy  at  all  R.  Since  the  PA, 
SRS,  and  IRS  theories  are  asymptotically  correct  at 
large  R,  one  could  calculate  /  at  the  largest  R, 
then  multiply  the  lAM  energies  at  each  R  by  1// 
to  get  a  corrected  lAM  interaction  energy.  The 
short-dash  line  in  Figure  2  shows  how  this  works 
when  we  set  /  =  A  El^M/Efys,  using  the  R  =  20 
values  of  those  energies.  The  corrected  lAM  en¬ 
ergy  overestimates  the  interaction  energy. 

To  compare  the  IRS  and  corrected  lAM  energies 
we  have  plotted  them  with  the  full  Cl  energy  in 
Figure  3.  We  have  not  plotted  the  lAM  energy 
because  it  is  almost  indistinguishable  from  the  IRS 
energy  on  the  scale  of  the  graph.  Note  that  the 
same  range  of  R  values  is  used  in  Figure  3  as  in 


R  in  bohr 

FIGURE  3.  For  the  LiH  ground  state,  the  IRS  and 
the  corrected  lAM  interaction  energies,  and  the  full  Cl 
interaction  energy  are  shown  as  functions  of  R.  The  solid 
curve  represents  the  cull  Cl  results;  the  long-dash  curve, 
the  IRS  results;  and  the  short-dash  curve,  the  corrected 
lAM  results.  The  lAM  results  were  not  plotted  because 
they  are  indistinguishable  from  the  IRS  results  on  this 
scale. 


the  first  two  figures,  even  though  the  energies,  on 
the  scale  of  the  graph,  are  all  zero  for  R  >  10 
bohrs.  Where  the  interaction  energy  is  significant, 
the  corrected  lAM  energy  is  clearly  more  accurate 
than  the  IRS  energy.  We  are  undertaking  an  exten¬ 
sive  series  of  calculations  to  see  if  the  method  we 
have  used  to  correct  the  I  AM  energies  works  with 
larger  basis  sets  for  LiH  and  for  other  systems. 


Coiichisions 

There  is  no  ambiguity  as  to  order  if  one  focuses 
on  the  expansion  parameter  A  in  developing  per¬ 
turbation  theory  for  interacting  atoms/molecules 
from  the  assumption  that  the  unperturbed  wave 
function  for  the  system  is  a  product  of  unper¬ 
turbed  atomic/molecular  wave  functions.  The 
renormalized  energy  A  E^(\)  is  more  accurate  than 
the  sum  of  the  perturbation  energies  when  both 
are  calculated  using  through  In  turn, 

AEj(X)  is  more  accurate  than  AE^fA)  under  the 
same  conditions. 
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ABSTRACT _ _ _ _ _ _ _ 

We  present  a  computational  scheme  to  study  the  dynamics  of  many-electrons  in  molecular 
systems  by  wavepackets  method.  Several  approaches  to  calculation  of  nonlinear  optical 
properties  for  molecules  under  time-independent  or  time-dependent  external  electric 
fields  are  presented.  Some  simple  examples  of  one-dimensional  two-  or  three-electron 
systems  are  demonstrated  concretely.  Implications  of  these  results  are  discussed  in 
relation  to  the  validity  of  the  many  electron  wavepackets  (MEWP)  method.  ©  1996  John 
Wiley  &  Sons,  Inc. 


Introduction 

Heller  et  al.  [1-5]  has  developed  a  time- 
dependent  formulation  in  classical  and 
semiclassical  mechanics  by  using  Gaussian 
wavepackets.  The  manifold  of  Gaussian  wave 
packets  is  discussed  by  using  the  time-dependent 
variational  principle  [6].  Numerically  exact  propa¬ 
gation  methods  developed  by  Kosloff  et  al.  [7] 
indicated  that  the  Laplacian  of  wave  function  can 
be  evaluated  by  using  fast  Fourier  transform  (FFT) 
algorithms.  Moreover,  time  evolution  of  the  wave 
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function  may  be  obtained  in  terms  of  a  second- 
order  difference  scheme  developed  by  Askar  and 
Cakmak  [8].  Feit  et  al.  [9]  demonstrated  the  useful¬ 
ness  of  a  split-operator  time  propagation  scheme. 
Tal-Ezer  and  Kosloff  [10]  showed  that  the  time 
evolution  may  be  achieved  to  machine  accuracy  by 
expanding  an  evolution  operator  by  means  of  com¬ 
plex  Chebyschev  pol3momials.  Kosloff  et  al.  [11] 
pointed  out  that  eigenfunctions  and  eigenvalues  of 
Schrodinger  equation  are  determined  by  solving 
the  Schrodinger  equation  in  imaginary-time  evolu¬ 
tion  and  that  excited  states  are  obtained  by  filter¬ 
ing  out  the  lower  states.  The  basic  framework  of 
time-dependent  quantum  mechanical  (TDQM) 
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methods  for  nuclear  dynamics  calculations  are  ex¬ 
plained  by  Kosloff  [12].  Leforestier  et  al.  [13]  com¬ 
pared  time  propagation  algorithms  extensively 
used  so  far  for  the  time-dependent  Schrodinger 
equation.  Zhang  [14]  showed  that  propagation  of 
quantum  wavepacket  in  the  interaction  picture 
improves  the  efficiency  of  the  TDQM  method  that 
has  emerged  as  a  powerful  tool  in  nuclear  dynam¬ 
ics  over  the  last  decade. 

The  TDQM  has  been  applied  successfully  to 
study  eigenvalue  problems,  photodissociation,  Ra¬ 
man  scattering,  mode  selectivity  in  chemical  reac¬ 
tions,  gas-surface  scattering,  etc.  Deumens  et  al. 
[15]  have  presented  an  overview  of  methods  used 
for  time-dependent  treatments  of  molecules  as  sys¬ 
tems  of  elecfrons  and  nuclei. 

Tire  study  of  molecular  systems  has  tradition¬ 
ally  proceeded  by  solving  the  electronic  problem 
first  to  obtain  a  molecular  potential-energy  surface 
(PES)  with  an  associated  electronic  eigenstate.  The 
dynamics  of  the  molecule  is  then  reduced  to  the 
dynamics  of  the  nuclei  on  that  PES,  at  the  classical, 
semiclassical,  or  quantum  level.  It  was  found  that 
small  errors  in  curvature  of  the  fit  can  produce 
significantly  different  dynamics  [16,  17]. 

Manthe  et  al.  [18]  have  shown  the  wavepacket 
dynamics  on  three-dimensional  PES  by  the  multi- 
configurational  time-dependent  Hartree  (MCTDH) 
approach.  The  ability  of  the  MCTDH  approach  to 
an  accurate  description  of  a  splitting  of  wavepacket 
on  a  saddle-shaped  surfaces  have  been  discussed 
[19,20].  Many  applications  of  the  TDQM  were  per¬ 
formed  in  various  fields. 

Zhang  et  al.  [21]  presented  a  computational 
scheme  to  propagate  wavepackets  describing  sin¬ 
glet  states  of  two-electron  systems,  and  the  low- 
energy  spectrum  of  helium  was  computed.  Dehnen 
and  Engle  [22]  calculated  the  dynamics  under  the 
influence  of  a  stronger  laser  pulse  and  considered 
how  above-threshold-ionization  peaks  can  be  in¬ 
terpreted  in  terms  of  momentum-space  wave 
functions.  The  results  were  compared  with  those 
obtained  from  calculation  using  a  soft-core  poten¬ 
tial. 

Our  interests  are  in  the  study  of  electron 
wavepackets  near  the  Wannier  ridge,  strong-field 
ionization,  static  and  dynamic  hyperpolarizabili¬ 
ties  of  molecular  systems,  and  so  on.  Our  concerns 
lie  in  the  real  spatial  dynamics  of  electrons  for 
such  situations  in  molecular  systems.  The  nonlin¬ 
ear  response  to  the  external  intense  electric  field  is 


described  by  the  hyperpolarizabilities.  These  have 
been  calculated  using  several  approaches,  namely 
perturbative  or  nonperturbative  theories  [23-25] 
and  so  on  [26,  27],  We  have  newly  proposed  some 
approaches  [26,  27]  for  the  nonlinear  optical  phe¬ 
nomena  on  the  basis  of  the  electron  wavepackets 
dynamics.  In  order  to  study  the  dynamics  of  elec¬ 
trons  in  real  space,  a  wavepackets  method  is  pre¬ 
sented  and  is  applied  to  molecular  systems.  Eirst, 
we  explain  our  scheme  to  study  the  dynamics  of 
many-electron  systems  with  our  model  Hamilto¬ 
nian.  Then  we  present  three  procedures  for  com¬ 
puting  electronic  hyperpolarizabilities.  Some  sim¬ 
ple  examples  for  our  approach,  which  is  called  the 
many-electron  wavepackets  (MEWP)  method,  is 
then  demonstrated.  Einally,  these  results  are  dis¬ 
cussed  in  connection  with  the  validity  of  MEWP 
method. 


Theory  and  Our  Scheme 

OUR  SCHEME 

The  TDQM  method  consists  of  numerically 
solving  the  time-dependent  Schrodinger  equation 

d 

(1) 

where  H  is  a  time-dependent  Hamiltonian; 
is  a  wave  function.  The  atomic  unit  ^  =  m,,  =  e  =  1 
is  used  throughout  this  article.  Equation  (1)  is 
solved  by  means  of  some  algorithms  numerically. 

In  case  of  a  time-independent  Hamiltonian,  the 
solution  of  Eq.  (1)  is  expressed  by  the  time  evolu¬ 
tion  operator.  This  exponential  function  of  the  time 
evolution  operator  is  estimated  by  various 
schemes,  which  are  the  Euler  scheme,  the  simplest 
Crank-Nickolson  scheme,  the  product  formula,  the 
second-order  difference  scheme,  the  Chebyshev 
scheme  (CH),  and  so  on. 

The  CH,  which  is  classified  as  a  global  propaga¬ 
tor  method,  is  the  most  accurate  and  stable  method. 
The  CH  approximates  the  exponential  function  by 
a  Chebyshev  polynomial  expansion.  The  error  of 
the  CH  can  be  reduced  down  to  the  machine  limit. 
These  schemes  are  discussed  in  detail  by  litaka 
[28]. 

In  the  other  case  of  a  time-dependent  Hamilto¬ 
nian,  Eq.  (1)  is  solved  by  some  schemes,  which  are 
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the  Runge-Kutta  scheme,  the  Lanczos  scheme,  and 
so  on.  The  accuracy  for  the  long  time  evolution  of 
these  scheme  is  not  so  good.  But,  they  are  useful  to 
calculate  the  dynamics  for  systems  with  arbitrary 
Hamiltonian. 

Our  scheme  is  as  follows: 

1.  Our  model  Hamiltonian  is  derived. 

2.  The  primitive  initial  wave  function  is  con¬ 
structed.  By  propagating  the  time-indepen- 
dent  Schrodinger  equation  in  imaginary  time, 
which  is  called  the  relaxation  method,  the 
wave  function  of  the  ground  state  is  ob¬ 
tained. 

3.  Using  the  initial  wave  function  obtained  from 
step  2,  the  Schrodinger  equation  is  solved 
and  the  properties  are  calculated  with  the 
generated  wave  function  at  each  time  step. 


MODEL  HAMILTONIAN 


X 


A_y 
2 
Ay 


(  Ay 

0[y-y;  +  -y- 


X 


Az 

6\  z  -  z,.  -  — 


Az 


-0  z  -  z,,  -l- 


(3) 


where  d(x)  indicates  the  step  function.  Ax,  Ay, 
and  A  z  are  the  grid  intervals  when  a  finite  space  L 
is  divided  into  of  intervals  for  each  direction. 
The  iijk)  means  the  Ujk)th.  interval.  N^orm 
=  y/Ax  Ay  Az  is  the  normalization  factor.  Then, 
the  field  operators  are  expanded  in  terms  of  the 
basis  set  { as 

L  Xijk(r)arit ,  (4) 

ijk=l 


Now,  we  derive  our  model  Hamiltonian.  We 
consider  the  ab  initio  Hamiltonian  for  the  molecu¬ 
lar  systems.  In  the  Born-Oppenheimer  approxima¬ 
tion,  the  Hamiltonian  is  written  as 


where  is  the  creation  (armihilation)  oper¬ 

ator  for  the  (yfc)th  interval.  Operators  also  satisfy 
the  anticommutation  relation. 

Then,  the  Hamiltonian  in  Eq.  (2)  is  rewritten  as 


H  =  L/  dr'¥^(r) 


__V2 

2 


1  /-  1 
+  drdr''¥j:(r}'if^  (r') 


\r  -  r'\ 


X 


-  EE/ drX(r) 

a  n 

z„ 


X 


\r  -  R, 


(2) 


where  the  first  term  means  the  kinetic  part,  the 
second  is  the  electronic  interaction  part,  and  the 
third  is  the  nuclear-electron  interaction  part.  %^(r) 
[■T;,(r)]  means  the  field  operator  which  is  the  cre¬ 
ation  (annihilation)  operator  for  an  electron  with 
spin  a  at  real  spatial  three-dimensional  coordinate 
r,  and  satisfies  the  anticommutation  relation. 

Here,  we  introduce  an  orthogonal  basis  set 
{Xijki^))  as 


Xijk^r)  = 


1 


K. 


e  \  X  -  Xi 


Ax 


Ax 
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+  ar^ijkafjk] 
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where 

W  ^  ^  ^ 

^  i  jk  ^  Imn  ^  Inin  ^  i  jk 
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w, 


.  <T  O' 
'n  njk  f  ijk 
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z^+Az/2  Z, 
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X  f  ^  '  dz 

4, -A  2/2  |r 
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(7) 


For  the  first  term  in  the  right-hand  side  of  Eq.  (5), 
we  approximate  each  e  to  the  grid  interval:  2Ax£: 
->  2  A  2  A  ye  ^  2  A  and  ILze  IbiZ^ .  Then, 
it  becomes  the  kinetic  Hamiltonian  for  the  finite- 
difference  method.  In  Eqs.  (6)  and  (7),  both  inte¬ 
grals  for  the  coulomb  interaction,  which  are  called 
the  two-electron  integral,  are  carried  out  analyti¬ 
cally  and  numerically.  Therefore,  it  is  found  that 
the  matrix  elements  for  the  coulomb  interaction 
are  easily  obtained.  In  our  model,  the  Hamiltonian 
is  constructed  by  the  kinetic  term  for  the  finite-dif¬ 
ference  method  and  the  interaction  terms  are  given 
by  expansion  in  terms  of  basis  set. 

Javanainen,  Eberly,  and  Su  [29]  approximated 
coulomb  potential  to  the  softened  coulombic  form: 

1^1 


This  approximation  implies  that  the  potential  re¬ 
tains  an  asymptotic  form  of  the  coulomb  potential 
while  eliminating  the  singularity  at  the  origin.  We 
also  use  this  softened  coulombic  potential  instead 
of  our  coulomb  matrix  element  shown  as  Eqs.  (6) 
and  (7). 


PRIMITWE  INITIAL  WAVE  FENCTION 

To  obtain  the  initial  wave  function  used  in  our 
last  scheme  of  step  3,  we  explain  how  to  construct 
the  primitive  initial  wave  fimction.  Our  desired 
initial  wave  function  is  generated  from  this  primi¬ 
tive  initial  wave  function  by  the  relaxation  method. 

The  wave  function  of  particles  should  be  anti¬ 
symmetric: 

{rjo-i,  rzcr^,...,  r„cr^|)/(> 

=  -(rzcr^,  r,o-i,...,r^o-^|^>,  (9) 

where  r,  indicates  the  z'th  coordinate,  and  <7,  is  an 
up  or  a  down  spin  state.  For  an  antisymmetric 
state,  the  wave  function  may  be  written  by 


The  wave  function  of  the  form  of  Eq.  (10)  is  com¬ 
monly  called  a  Slater  determinant.  Though  each 
initial  single-particle  wave  function  is  arbitrary, 
we  consider  a  Gaussian  wavepacket  |  </>,■>  of  the 
initial  state  for  fth  electron: 


Xexp 


-  rpif 
20-7 


+  iko^irj  -  Tqi)  , 
(11) 


where  |^,>  is  the  state  vector  of  /th  electron.  (r^\ 
means  the  jth  coordinate,  is  the  spin  for  r^;  rg, 
and  kg,  are  the  grid  of  the  center  and  the  momen¬ 
tum  of  the  zth  initial  wavepacket,  respectively. 
And  (Tj  is  a  dispersion  of  ith  initial  wavepacket. 
We  call  the  wave  function  shown  in  Eq.  (10)  the 
primitive  initial  wave  function,  which  is  con¬ 
structed  by  the  single-particle  wave  function  de¬ 
fined  in  Eq.  (11).  This  wave  function  is  superposed 
by  plural  eigenstates.  Since  our  desired  initial  wave 
function  is  the  ground  state  or  a  particular  excited 
state  that  is  generated  from  the  above  primitive 
initial  wave  function  by  the  relaxation  method, 
this  primitive  initial  wave  function  is  not  so  im¬ 
portant.  But  we  note  that  it  is  significant  that  the 
initial  wave  function  is  antisymmetric. 


BOENDARY  CONDITION 

When  we  carry  out  the  relaxation  method  or  the 
calculation  of  the  time-dependent  wave  function 
and  properties,  the  boundary  condition  is  consid¬ 
ered  as  the  following: 

(  Tj <Tj , . . . ,  r,-^[ ^ g  cr,-, . . . ,  (T^  I  ik ) 

=  .  .  .  ,  <:(;■)]  =  Nj,+  l  O';  /  ■  ■  ■  , 

=  0,  (12) 


(rjo-j, 720-2,. 


<riO-]|</)i> 

<rjO-i|()>2) 


<72  <7-21^1  > 
<720-2|(/>2) 


<  72  0-2 1  (f)j^  ) 


<  O'n  I  4^2  ) 


(10) 
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where 

^ig[k<.i)]  ""  j  ^ 

(i  =  1, . . . ,  N ,  j  =  0, . . . ,  Ng  +  I,  k  =  1,2,3), 

(13) 

where  (fc  =  1, 2, 3)  means  =  x,  or  X;2  =  J/, 
or  x,3  =  z„  which  is  the  ith  coordinate;  gWj)] 
indicates  the  )th  grid  number  for  the  Xj^  direction. 
On  the  edge  grid  of  a  finite  space,  the  amplitude  of 
the  wave  function  vanishes. 


quantum  chemical  calculation  methods  can  be 
used.  In  the  calculation  of  static  hyperpolarizabili¬ 
ties,!  we  usually  use  this  finite-field  (FF)  method, 
i.e.,  finite-field  coupled-Hartree-Fock  (FF-CHF) 
method. 

For  the  time-dependent  external  field,  we  can 
calculate  the  dynamical  hyperpolarizability  by  the 
time-dependent  Hartree-Fock  (TDHF)  method  [30, 
31],  the  time-dependent  perturbation  theory 
(TDPT)  [32],  the  Floquet-Liouville  supermatrix 
(FLSM)  method  [33],  the  numerical  Liouville  ap¬ 
proach  (NLA)  [26],  and  so  on. 


MEWP-BASED  APPROACHES 


INIIMERICAL  CALCIILATIOIN 

We  can  perform  the  d3mamics  of  electrons  in 
molecular  systems  by  the  following  procedure.  Us¬ 
ing  the  model  Hamiltonian  of  Eq.  (5)  and  the 
primitive  initial  wave  function  of  Eq.  (10),  we  can 
obtain  the  ground  state  and  the  excited  states  for 
each  spin  state  by  the  relaxation  method.  We  use 
the  CH,  since  the  Hamiltonian  is  time-indepen- 
dent.  Using  the  generated  initial  wave  function, 
several  properties  and  the  wave  functions  are  cal- 


Approach  via  the  Finite-Field  Method 

This  approach  is  a  scheme  for  the  time-indepen¬ 
dent  external  electric  field.  By  our  scheme  shown 
in  the  previous  section,  the  Schrodinger  equation 
with  the  external  electric  field  is  solved  in  terms  of 
the  relaxation  method  to  obtain  the  ground  state 
and  the  total  energy.  Here,  we  also  use  the  CH. 
The  static  polarizability  and  hyperpolarizabilities 
are  written  as 


culated. 


a,./0)  = 

Approach  to  the  Nonlinear 

Optical  Properties 

A^(0)  = 

The  MEWP  method  is  considered  to  be  useful 
particularly  for  the  phenomenon  with  contribu¬ 

ri;Vc;(0)  = 

tions  from  electrons  in  spatially  extended  regions. 

dEjdEj,  I 

}  ^  /  E-»0 

dEjdEj^dEi 


E^O 


Xicicj-Wiv::/  Lxit  - —  -  j 

be  a  good  example  of  the  MEWP  approach.  In  this 
section,  after  briefly  reviewing  the  conventional 
approaches  to  calculation  of  the  nonlinear  optical 
properties,  three  new  approaches  based  on  the 
MEWP  method  applicable  for  some  different  situa¬ 
tions  are  proposed. 

CALCULATION  OF  HYPERPOLARIZABILITY 


where 


(14) 

(15) 

(16) 


PiE)  =  ixo  +  aE  +  +  yE^  +  - ,  (17) 

^otal(®)  “  ^otalO  “  Ao®  ~  ~  3/^® 

(18) 

(19) 


In  the  calculation  of  static  hyperpolarizability, 
the  finite-field  method  is  useful.  This  procedure 
consists  of  the  calculation  of  the  total  energy  or  the 
polarization  by  a  Hamiltonian,  including  the  exter¬ 
nal  electric  field,  and  the  differentiation  of  the  total 
energy  (or  the  polarization)  with  respect  to  the 
external  field.  It  is  important  to  evaluate  the  error 
caused  by  the  numerical  differentiation.  An  advan¬ 
tage  of  such  a  method  is  that  several  standard 


P(E)  is  the  polarization,  and  Utotai^E)  is  the  molecu¬ 
lar  total  energy  in  the  presence  of  electric  field.  We 
can  obtain  hyperpolarizabilities  by  numerical  dif¬ 
ferentiation  as  shown  in  Eqs.  (14),  (15),  and  (16). 

Approach  via  the  Level  Theory  (NLA) 

We  explain  another  procedure  for  the  dynami¬ 
cal  external  field.  By  our  scheme  mentioned  in  the 
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previous  section,  we  have  the  model  Hamiltonian 
and  the  initial  wave  function  in  the  absence  of 
external  electric  field  by  the  MEWP.  Then,  the 
wave  function  and  the  total  energy  of  the  ground 
state  are  obtained.  Following  Kosloff  [11],  we  can 
obtain  the  excited  states  in  order  of  increasing 
energy.  Therefore,  we  can  get  each  energy  level 
and  each  wave  function  of  the  system. 

In  the  NLA  method  [26],  which  can  provide 
nonperturbative  hyperpolarizabilities  including  the 
population  and  coherent  relaxation  effects,  we  first 
perform  the  static  analysis.  The  transition  proper¬ 
ties  which  are  the  transition  energies,  the  transi¬ 
tion  moments,  and  the  dipole  moments  are  calcu¬ 
lated  in  terms  of  the  above  results  obtained  by  the 
MEWP.  Next,  we  carry  out  the  dynamic  analysis. 
The  polarization  P(0  of  the  system  in  the  external 
electric  field,  which  has  the  frequency  w,  and  the 
field  s(t)  are  generated  by  numerically  solving  the 
Liouville  equation  including  relaxation  effects  in 
the  Runge-Kutta  scheme.  The  obtained  polariza¬ 
tion  is  converted  to  Pico}  by  the  discrete  Fourier 
transformation.  Finally,  the  intensity-dependent 
nonlinear  optical  properties  x^''\nco)  (NHG)  can 
be  obtained  by  the  definition: 


X^g'Krico) 


Pinco) 

n'‘s’'ico) 


Pinco) 

s"'ico)' 


(20) 


where  Pinco)  is  the  «th-order  harmonic  response. 
sico)  indicates  an  amplitude,  s'ico)  =  nsico)  is  the 
amplitude  of  incident  field.  Definitions  of  hyper¬ 
polarizabilities  for  other  nonlinear  optical  phenom¬ 
ena,  e.g.,  EFISH,  DFWM,  and  EFIOR,  are  given  in 
our  previous  studies  [32]. 


Direct  Approach  by  the  iVIEVVP 

Following  our  scheme  of  steps  1  and  2,  we  get 
the  initial  ground-state  wave  function  in  the  pres¬ 
ence  of  external  field.  Using  it,  the  time-dependent 
Schrodinger  equation  with  the  time-dependent  ex¬ 
ternal  electric  field  is  solved  by  the  MEWP  method 
in  the  Runge-Kutta  or  the  Lanczos  scheme.  The 
polarization  Pit)  is  calculated  directly  by  using 
the  wave  function  |i/f(f)>  for  each  time  step.  The 
polarization  Pit)  is  converted  to  Pico)  by  the 
Fourier  transformation.  And  then,  we  can  obtain 
the  nth-order  nonlinear  properties  based  on  the 
definition  of  Eq.  (20). 

An  advantage  of  this  scheme  is  that  we  need 
not  calculate  the  energy  level  of  the  system.  By 
using  the  wavepacket  method  all  schemes  have 


this  advantage.  On  the  other  hand,  we  can  calcu¬ 
late  the  energy  levels  by  our  scheme  of  step  2  or 
the  relaxation  method.  Therefore,  this  scheme 
proves  to  be  a  powerful  tool  in  studying  problems 
on  the  ionization,  dynamics  under  the  influence  of 
the  strong  external  field,  etc. 

A  defect  of  this  scheme  is  that  all  the  contribu¬ 
tions  from  the  relaxations  cannot  be  included. 
Namely,  it  is  not  sufficient  to  simulate  phe¬ 
nomenon  including  resonant  processes  by  only  this 
scheme.  However,  this  approach  will  be  a  useful 
tool  to  obtain  various  informations  for  the  systems 
of  our  interests. 


Examples 

In  this  section,  some  concrete  and  simple  nu¬ 
merical  examples  are  discussed.  For  simplicity,  we 
consider  the  one-dimensional  hydrogen  molecule, 
which  is  two-electron  and  two-nuclei  system.  First, 
we  will  demonstrate  a  propagation  of  the  time- 
independent  Schrodinger  equation  in  the  imagi¬ 
nary  time  for  this  system.  Second,  we  will  calcu¬ 
late  the  nonlinear  optical  properties  of  this  system 
by  our  MEWP  approach  via  the  finite-field  method 
(FF-MEWP  method)  mentioned  in  the  previous 
section.  Later,  another  example  with  respect  to  the 
chemical  reaction  of  H3  will  be  presented. 

IMAGIN  ARY  TIME  EVOLLTION  IN  THE 

ABSENCE  OF  E.XTERNAL  ELECTRIC  FIELD 

At  the  second  step  in  our  scheme,  we  perform 
the  relaxation  method  in  the  absence  of  external 
electric  fields  to  obtain  the  initial  wave  function  in 
the  hydrogen  molecule  system.  Table  I  summa¬ 
rizes  parameters  constructing  the  primitive  initial 
wave  function.  We  divide  a  finite  space  of  21  a.u. 
for  each  electron  into  21  grid  points.  Then,  the  grid 
intervals  A,r,  and  Axj  are  1.0  a.u.  The  first  elec¬ 
tron  wavepacket  is  put  on  the  9th  grid  point,  and 
the  second  is  set  on  the  13th  grid  point.  The  spin 


TABLE  I _ 

Parameters  used  in  MEWP  method. 


L  Ax  Wavepacket  1  Wavepacket 2  Nucleus 

21  1.0  X01  =9 

Xq2  —IS  Xq.|  —  10,  Xqp  —  12 

0 

II 

0 

i 

0 

II 

CM 

0 

=2.1 

0-2  =  2.1 

Spin-up 

Spin-down 
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state  is  the  singlet.  Each  hydrogen  nucleus  is  fixed 
on  the  10th  and  12th  grid  point.  Therefore,  the 
distance  between  atoms  is  2.0  a.u. 

By  the  relaxation  method  in  the  CH,  we  can 
obtain  the  wave  function  of  the  ground  state  and 
the  excited  states.  From  the  primitive  initial  wave 
function  and  the  generated  final  wave  function, 
the  total  energy  £  and  the  square  root  of  the 
square  average  with  respect  to  the  distance  be¬ 


tween  electrons  electron  and 

the  nucleus 

D^_„  and  nuclei 

=  ■y/((x,  -  X2))  , 

(21) 

D,_„  =  ]/ax-Xf}  , 

(22) 

(23) 

ar  e  shown  in  Table  II.  It  is  found  that  the  total 
energy  (-1.45992  a.u.)  of  the  singlet  ground  state 
is  lower  than  that  of  the  primitive  initial  state 
(-1.25275  a.u.).  For  the  distance  between  elec¬ 
trons,  the  former  is  shorter  than  the  latter.  The 
distance  between  electron  and  nucleus  also  be¬ 
comes  short  as  a  result  of  the  imaginary  time 
evolution. 

We  depict  the  results  using  other  parameters 
shown  in  Table  III.  Figure  1  displays  the  square 
amplitude  of  the  ground  states  for  the  singlet  and 
the  triplet  spin  states  and  the  singlet  first  excited 
state.  All  calculations  are  carried  out  at  the  inter- 
nuclear  distance  of  1.6  a.u.  The  square  amplitude 
of  a  wave  function  is  plotted  on  a  X2cr2 

plane.  The  x^a2  -  X2(T2  plane  is  composed  of  four 
subspaces.  There  are  the  a-a  subspace,  and  a-/3 
subspace,  the  /3-a  subspace,  and  the  (3-13  subspace 
with  respect  to  the  spin  state  for  each  r-space. 
Each  subspace  is  expanded  by  spaces  x^  and  ^2. 
For  the  triplet  ground  state,  there  is  one  node  on 
x^  =  X2  line.  This  means  that  two  electrons  are  not 


TABLE  II _ 

Total  energy  and  average  distance. 


Singlet 

Initial  wave  function 

Final  wave  function 

Energy 

-1.251748 

-1.459917 

De-e 

4.262350 

2.190121 

De-n 

2.577675 

1.669181 

Dn-n 

2.0 

2.0 

TABLE  III  _ 
Parameters. 


L 

Ax 

Wave- 
packet  1 

Wave- 
packet  2 

Nucleus 

13.1 

0.1 

0 

II 

0 

X 

0 

CO 

II 

C\J 

0 

X 

X01  =  58,  X02  =  74 

0 

11 

0 

0 

II 

<M 

0 

=  13.1 

0-2  =  13.1 

Spin-up 

Spin-down 

in  the  same  place.  Thus,  spin  space  is  symmetric, 
and  real  space  becomes  antisymmetric.  On  the 
other  hand,  it  is  found  that  there  is  one  node, 
which  is  different  from  that  for  the  triplet,  for  the 
singlet  first  excited  state.  This  node  is  on  x-^  +  X2 
=  L  line,  where  L  is  the  length  of  the  grid,  i.e., 
L  =  33.1  a.u.  And  then,  the  density  of  the  electron 
is  more  widely  expanded  than  that  of  the  ground 
state,  the  total  energy  for  the  singlet  ground  state 
is  -1.452865  a.u.,  and  that  for  the  triplet  ground 
state  becomes  -1.198768  a.u.  We  show  the  poten¬ 
tial  curves  for  the  hydrogen  molecule  by  the 
MEWP  method  in  Figure  2.  It  is  found  that  in  the 
region  where  the  internuclear  distance  is  short,  the 
difference  of  the  curvature  between  the  MEWP 
and  well-known  results  by  Kolos  and  Wolniewicz 
[34]  appears,  and  that  in  the  region  where  the 
internuclear  distance  becomes  long,  the  inequality 
almost  vanishes.  However,  the  qualitative  shape  of 
the  potential  curves  obtained  by  the  MEWP  is 
nearly  equal  to  that  of  Kolos  et  al. 

These  results  indicate  the  validity  of  our  scheme 
and  numerical  calculations  for  qualitative  studies. 

NONLIIXEAR  OPTICAL  PROPERTIES  VIA  THE 

FirVITE-FIELD  MEWP  METHOD 

Next,  we  calculate  the  total  energy  by  the  imag¬ 
inary  time  evolution  under  static  external  electric 
field.  The  same  parameters  shown  in  Table  I  are 
used.  For  each  intensity  of  the  electric  field,  we 
obtain  the  properties  shown  in  Table  IV.  Following 
our  definition,  the  polarizability  and  the  hyperpo¬ 
larizability  are  calculated  straightforwardly.  How¬ 
ever,  since  we  consider  the  spatial  one-dimen¬ 
sional  system,  they  will  correspond  to  (3^^^, 
and  y^xxx'  From  these  results,  the  polarizability 
becomes  12.2395  a.u.,  and  the  hyperpolariz¬ 
ability  y^cxxx  is  306.6565  a.u.  Certainly,  the  value  of 
the  (3^^^  vanishes.  In  order  to  confirm  the  applica- 
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FIGURE  1 .  Square  amplitudes  for  the  singlet  and  triplet  states.  The  distance  between  each  nucleus  is  fixed  by  1 .6  a.u. 
The  amplitude  for  the  singlet  ground  state  is  shown  in  Figure  1  (a).  Figures  1  (b)  and  1  (c)  display  those  for  the  triplet 
ground  state  and  the  singlet  first  excited  state. 


bility  of  the  FF-MEWP  method  to  the  calculation 
of  hyperpolarizabilities,  we  should  further  investi¬ 
gate  the  qualitative  behaviors,  e.g.,  variations  in 
hyperpolarizabilities  during  the  bond  dissociation, 
in  comparison  with  ab  initio  results. 

OTHER  EXAMPLE 

Finally,  we  show  the  results  of  the  chemical 
reaction  of  an  H3  system  which  is  a  spatial  one¬ 
dimensional  three-electron  system.  Parameters  are 
summarized  in  Table  V.  Grid  size  is  20.4  a.u.,  and 
the  number  of  the  intervals  is  51.  Therefore,  the 


grid  interval  becomes  0.4  a.u.  Calculations  are  car¬ 
ried  out  under  a  condition  such  that  both  inter¬ 
atomic  distances  from  the  central  nucleus  are  equal. 

We  depict  the  wave  function  of  the  H3  system 
in  Figure  3.  Figure  3  shows  that  the  wave  function 
of  three  electron  system  consists  of  eight  sub¬ 
spaces.  In  a  subspace  (x^  p  -  X20i  -  x^a),  the  ini¬ 
tial  wave  function  and  the  final  wave  function 
obtained  by  the  relaxation  method  are  illustrated 
in  Figures  3(b)  and  (c),  respectively.  These  figures 
indicate  that  the  initial  wave  function  is  modified 
into  the  shape  expected  from  electron-electron 
coulombic  repulsion. 
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0.0  1.0  2.0  3.0  4.0  5.0  6.0  7.0 

Inter  atomic  distance  (  a.u. ) 

FIGURE  2.  Potential  curve  of  a  one-dimensional  hydrogen  molecule.  The  potential  curves  for  the  singlet  and  triplet 
spin  states  are  shown. 


Figure  4  shows  a  potential  curve  of  the  H3 
system.  When  the  interatomic  distance  is  2.0  a.u., 
the  total  energy  takes  the  lowest  value:  —2.12039 
a.u.  On  the  other  hand,  total  energy  of  the  one-di¬ 
mensional  hydrogen  atom  is  —0.67111  a.u.,  and 
that  of  the  one-dimensional  hydrogen  molecule  is 
—  1.45434  a.u.  Therefore,  because  the  sum  of  these 
total  energies  is  -2.12546  a.u.,  the  energy  barrier 
for  the  reaction  becomes  0.00506489  a.u.  The  result 
of  Liu  [35]  by  an  exact  ab  initio  calculation  gives 
0.01593600102  a.u.  Our  result  is  lower  than  that  of 
B.  Liu,  but  these  results  suggest  that  a  qualitative 
calculation  for  three  electron  systems  or  more  elec¬ 
tron  systems  will  be  able  to  be  performed. 


TABLE  IV _ 

Total  energy  and  average  distance  for  each 
electric  field. 


Electric 

field 

Energy 

Oe-e 

De-n 

Dn-n 

0.000 

- 1 .45992 

2.19012 

1.66918 

2.00000 

0.005 

- 1 .46007 

2.19070 

1.68804 

2.00000 

0.010 

- 1 .46053 

2.19248 

1.70797 

2.00000 

0.015 

-1.46130 

2.19551 

1.72905 

2.00000 

Concluding  Remarks 

In  this  study,  we  have  extended  the  wavepacket 
method  to  many-electron  systems  in  a  molecule 
and  have  proposed  some  wavepacket  approaches 
to  study  of  the  nonlinear  optical  properties.  From 
some  examples  presented  here,  the  MEWP  method 
is  shown  to  be  applied  to  the  qualitative  studies 
for  energies  and  properties  for  small  molecular 
systems. 

The  MEWP  method  includes  the  solving  pro¬ 
cess  of  the  spatially  three-dimensional  or  mathe¬ 
matically  3N-dimensional  Schrodinger  equation. 
Therefore,  it  is  difficult  to  calculate  properties  of  a 
such  system  which  is  composed  of  a  great  number 


TABLE  V 


Parameters  for  H3.® 

L  Ax 

Wavepacket  1 

Wavepacket  2  Wavepacket  3 

20.4  0.4 

^01 

X02  —  26 

^03 

0 

II 

0 

^02  “  ^ 

0 

II 

CO 

0 

=  5.1 

(T2  =  5.1 

tTg  =  5.1 

Spin-up 

spin-down 

Spin-up 

^Distance  XqAx  =  (xo2  - 

26)Ax  =  (26  - 

XqJAx. 
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b 


c 


FIGURE  3.  Space  and  spin  coordinates  for  one-dimensional  three-electron  system  is  shown  in  Figure  3(a).  Figures 
3(b)  and  3(c)  show  the  initial  wave  function  and  the  final  wave  function  is  a  subspace  (x^  /3  -  Xga  -  Xga). 
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A 
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-2.14 

] 

1 

1  t 

1 
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3  4 

5  6 

Distance  (a.u.) 

FIGURE  4.  Potential  curve  of  a  one-dimensional  H3  system  with  equal  H  -H  bond  lengths. 


of  particles.  But  we  will  apply  this  method  to  a 
wide  variety  of  problems  which  include  the  dou¬ 
ble-ionization  threshold,  the  dynamics  is  highly 
excited  states,  electronic  nonlinear  optical,  and 
magnetic  properties  by  external  static  and  dy¬ 
namic  optical  and  magnetic  fields. 
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ABSTRACT _ _ _ 

A  stochastic  path-integral  (SPI)  technique  for  chemical  reaction  dynamics  is  explored.  It 
is  shown  that  this  technique  enables  the  direct  computation  of  the  transition  amplitude 
with  a  finite  space-time  range,  by  generating  a  set  of  classical  paths  subject  to 
simultaneous  stochastic  differential  equations.  The  numerical  values  of  the  Boltzmann 
matrix  elements  for  a  harmonic  potential  are  in  good  agreement  with  the  analytical  ones. 
Within  the  quantum  transition  state  theory,  the  flux-flux  autocorrelation  function  is  also 
evaluated  at  630  K  for  the  H  +  H2  exchange  reaction  and  is  found  to  give  a  satisfactory 
agreement  with  the  previous  studies.  To  appraise  the  influence  of  the  dimensionality, 
both  one-dimensional  Eckart  potential  and  a  full  three-dimensional  (3D) 
Liu-Siegbahn-Truhlar-Horowitz  (LSTH)  potential  calculations  have  been  performed. 
The  calculated  values  of  the  Boltzmann  matrix  elements  for  the  colinear  and  the  full  3D 
cases  are  found  to  deviate  slightly  from  each  other  in  the  lower  temperature  range.  The 
3D  thermal  rate  constant  is  in  very  good  agreement  with  the  previous  one.  ©  1996  John 
Wiley  &  Sons,  Inc. 


Introduction 

So  far,  there  have  been  a  number  of  studies  in 
the  field  of  chemical  reaction  dynamics  where 
the  quantum  mechanical  rate  constant  is  defined 
through  a  path-integral  formulation.  In  doing  so, 
the  relevant  flux-flux  autocorrelation  functions 


consist  of  complex  time  propagators  with  a  finite 
space-time  range  [1,  2].  Path-integral  approaches 
suggested  to  date  for  the  estimation  of  the  desired 
thermodynamic  or  dynamic  quantities  generally 
involve  either  iterated  "short-time''  approximation 
using  the  Trotter  formula  or  the  use  of  Fourier 
expansions  of  the  path  involved  [3].  However,  it  is 
still  a  matter  of  great  concern  for  theoretical 
chemists  to  treat  realistic  chemical  phenomena 
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where  quantum  and  classical  degrees  of  freedom 
exist  inevitably  at  the  same  time  or,  in  other  words, 
to  overcome  the  multidimensionality  problem. 

The  stochastic  quantization  method  (SQM)  [3] 
has  succeeded  in  solving  a  variety  of  problems  in 
high-energy  physics  [4-7].  The  prescription  of  SQM 
to  obtain  the  usual  quantum  mechanics  depends 
on  the  fact  that  propagators  and  related  quantities 
can  be  reduced  to  n-point  functions  of  the  opera¬ 
tors  [8,  9].  Also,  the  SQM  is  based  on  the  equality 
between  the  path-integral  measure  and  the  Fok- 
ker-Planck  measure.  The  latter  can  be  obtained  as 
an  equilibrium  distribution  for  dynamical  vari¬ 
ables  by  solving  a  stochastic  differential  equation 
(SDE)  (e.g.,  the  Langevin  equation)  [10]. 

In  the  field  theory  for  elementary  particles, 
where  SQM  has  been  developed,  the  Feynman 
boundary  condition  is  implicitly  assumed  [9]; 
therefore  the  two-point  functions  O.e.,  the  vacuum 
expectation  values  of  field  operators)  are  nothing 
but  the  transition  amplitudes  themselves  [8,  9].  On 
the  contrary,  in  quantum  statistical  mechanics  for 
finite  temperature  and  nonrelativistic  quantum 
mechanics,  i.e.,  the  realm  of  chemical  reactions,  it 
is  not  appropriate  to  treat  a  transition  amplitude  as 
a  vacuum-to-vacuum  expectation  value  in  the 
presence  of  a  source  [8].  Instead,  the  boundary 
conditions  for  the  motion  of  classical  particles 
should  be  such  that  x{tjr)  =  Xjr  and  x(t,)  =  x-,  i.e., 
boundary  conditions  with  a  finite  space-time  range 
(the  Dirichlet  boundary  condition  [8,  9]). 

Therefore,  in  this  article,  a  new  possible  proce¬ 
dure  of  the  path-integral  method  to  obtain  a  transi¬ 
tion  amplitude  with  a  finite-time  range  is  pro¬ 
posed  by  combining  the  quantum  transition  state 
theory  with  SQM,  namely,  the  stochastic  path-in¬ 
tegral  (SPI)  method  for  chemical  reaction  dynam¬ 
ics.  Simple  examples  are  demonstrated  by  using  a 
harmonic  potential,  a  one-dimensional  (ID)  Eckart 
potential  and  a  full  three-dimensional  (3D)  Liu- 
Siegbahn-Truhlar-Horowitz  potential  for  the  ther¬ 
mal  exchange  reaction  of  the  H  +  H2  system. 


Theoretical  Treatments 


(the  fifth  dimensional  parameter  with  the  dimen¬ 
sion  T^/M)  and  is  introduced  just  to  describe  the 
quantum  mechanical  fluctuation  around  the  classi¬ 
cal  most  probable  path  as  a  hypothetical  stochastic 
process: 

— ^ +'n(u,T),  (1) 

where  x  denotes  a  set  of  Cartesian  coordinates 
describing  the  dynamics  and  depends  not  only  on 
the  inverse  of  temperature  (or  the  Euclidean  time) 
u  (0  <  ii  <  fih/l)  but  also  on  the  fictitious  time  r; 
S£  is  an  Euclidean  action  functional  that  enables 
us  to  calculate  the  partition  function  and  is  a 
random  number  vector  whose  components  are 
Gaussian  white  noises  subject  to 

<17^(1/,  t)>  =  0,  (2) 

r'))  =  2h8ij8(u  -  i08{t  ~  r). 

(3) 


The  brackets  (...)  stand  for  the  average  over  x]'s. 
By  solving  Eq.  (1)  under  the  most  suitable  bound¬ 
ary  conditions  for  each  problem,  we  obtain  a 
Markov  process  x(  u,  r )  reflecting  not  only  thermal 
but  also  quantum  mechanical  characteristics  of  the 
system.  Then,  one  can  retrieve  quantum  mechanics 
in  the  thermal  equilibrium  limit  of  the  hypothetical 
stochastic  process  with  respect  to  the  parameter  r. 

The  probability  distribution  functional  P(x;t) 
with  respect  to  x  satisfies  the  Fokker-Planck  equa¬ 
tion  [11,  12]; 


dP(x;  t) 
St 


8  1  8S^ 
8x  h  8x 


P(x;t),  (4) 


which  yields 

P(x)  =  lim  P(x;  r)  =  N  •  expf-Sg/^)  (5) 

r— >  ^ 

at  the  thermal  equilibrium  limit  with  respect  to  r, 
N  being  the  normalization  constant  [4,  11,  12]. 
Thus,  one  can  find  that  the  Fokker-Planck  mea¬ 
sure  at  the  infinite  fictitious  time  limit. 


STOCILISTIC  OTArSTIZ ATION  METHOD  Ai\D  ,  =p  {  \  a 

THE  BOLTZIVIANA  iVlATRIX  ELEMENTS 


For  such  problems  in  quantum  statistical  me¬ 
chanics  at  temperature  T,  SQM  assumes  the  fol¬ 
lowing  simultaneous  SDE  with  respect  to  a  new 
parameter  r  which  is  called  "the  fictitious  time'' 


is  nothing  but  the  path-integral  measure  [7]. 

One  typical  procedure  for  extracting  quantum 
mechanical  information  is  to  calculate  the  two- 
point  function  A^jiu  -  i/),  i.e.,  two-time  Green 
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function,  by  using  the  following  relationships  [6,  9, 

10], 

A,y(w  -  w')  =  N jD[\]Xi(u)Xjiu')exp(-S2/fi) 

=  Dijiu  -  u';0),  (7) 

Dfjiu  -  u'-,T  -  t') 

=  lim  <x,(m;  t)x  (m';  t')>.  (8) 

3C 

|t-  r'|  =  fixed 

However,  in  this  article,  we  will  pursue  a  direct 
method  which  can  reproduce  the  path-integral 
measure  under  the  Dirichlet  condition  [Eqs.  (11) 
and  (12)].  This  is  a  most  profitable  method  for 
describing  time-dependent  phenomena  in  low-en¬ 
ergy  sciences  as  chemical  reaction  dynamics.  In 
that  case,  the  transition  amplitude  of  the  Boltz¬ 
mann  operator  (the  Euclidean  time  evolution  oper¬ 
ator)  is  expressed  as  follows: 


boundary  conditions: 

(  x(w^-,  r)  -  x(w^,0)  =  x^-  (12) 

i  x(m^,t)  =  x(i/y,0)  =  Xj:  (12) 

throughout  the  fictitious  time  generation.  Only  if 
the  equilibrium  is  reached,  can  we  construct  an 
ensemble  of  paths  {x(w,  r)}  for  the  boundary  condi¬ 
tion  with  two  fixed  Euclidean  space-time  bound¬ 
aries  [4,  11,  12].  The  pictorial  illustration  for  the 
scheme  is  shown  in  Figure  1. 

THERMAL  RATE  COIVSTANT  EXPRESSION 

In  the  chemical  reaction  rate  theory,  it  is  well 
known  that  the  thermal  rate  constant  is  expressed 
as 

HT)  ^  r dtCAt),  (13) 

•'o 


<x^,  Uf\x-,  w-> 

=  <x^|exp(-wH)|xy> 


.  r  1  riif  I  dxiu)\ 

=  N/D[x]exp|--(^^  duL,\xiu),-^j 

(9) 

where  denotes  an  Euclidean  Lagrangean: 

/  dx(u)\  1  dx(u)  dx(u) 

lJx(i/),  — -  =  -  - - - M— -  +  y(x(w)), 

du  j  2  du  du 

(10) 


where  Qj^  is  the  partition  function  normalized  at 
the  reactant  state  per  unit  volume  and  Cjr(t)  is  the 
flux-flux  autocorrelation  function  defined  as 


=  Tr[F  expiu''*H)F  expi-u'^H)] 


- - —  I  <x^|exp(  -  u‘^H)\x  ;>  1^ 

dXf  dX:  ‘  ^ 


with  the  complex  time 


and  M  is  a  mass  matrix.  For  our  purpose,  one  has  ^  ^ _  (15) 

to  solve  the  SDE  (1),  imposing  the  following  2  h 


FIGURE  1 .  Schematic  diagram  for  the  stochastic  path-integral  method.  The  space  axis  is  represented  here  by  one 
dimension.  The  boundary  condition  is  x(Uf)  =x(Uj)  =Xf(=Xj)  =  0. 
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and  the  flux  operator 


F 


8(x) 


(16) 


In  Eq.  (14),  the  transition  amplitude  is  defined  as 
<x^-|exp(“Z/‘'H)|x^-> 


•  A(x/-,x,;^/2  +  it/fi).  (17) 


The  left-hand  side  is  the  analytic  continuation  of 
the  transition  amplitude  of  the  Boltzmann  operator 
[Eq.  (9)], 


piXfrXi)  =  {Xj,  I3fi/2\x^,0) 

=  <x^|exp(-ySH/2)|x,>,  (18) 

which  is  related  to  the  Boltzmann  rate  constant  if 
the  Euclidean  time  is  analytically  continued  to  a 
complex  time  f}  f3/2  +  it  [1].  As  a  result,  for  esti¬ 
mation  of  the  rate  constant,  one  finally  needs  the 
analytic  continuation  (a.c.)  of  the  SQM  value  for 
the  potential  factor  A(x^,  x,;  (3/2)  of  the  transition 
amplitude  as  follows: 

A(x^-,X;;  (3/2  +  it/h)  =  a.c.  A(x.,x^;  (3/2).  (19) 


The  H  +  Exchange  Keaction 

SIMPLE  i;\/V\IPLE:  FIIE 

HARMONIC  BARRIER 

In  the  present  study,  we  employed  two  ID  and 
one  3D  analytic  potentials  for  the  proton  exchange 
reaction  in  the  H  +  H2  system.  The  first  one  was  a 
harmonic  potential: 

V{x)  =  (20) 

where  the  parameters  were  co  ^  4.00  X  10^^  (s“Q 
and  ffi  =  1.67  X  10  (g).  These  values  were  opti¬ 

mally  chosen  to  model  a  colinear  H3  molecular 
system. 

For  each  numerical  simulation,  the  Euclidean 
time  axis  was  discretized  into  50  segments  be¬ 
tween  0  to  13  ft/ 2  and  that  of  the  fictitious  time  into 
10,000  segments.  By  setting  an  arbitrary  initial 
condition  for  the  fictitious  time,  Eq.  (1)  was  solved 
and  the  latter  5000  paths  were  chosen  to  make  an 
ensemble  where  /x  and  v  are  integral 


suffixes  in  the  ranges  between  0  and  50  and  be¬ 
tween  1  and  5000,  respectively.  It  must  be  noted 
that  each  path,  specified  by  p,  satisfies  strictly  the 
Dirichlet  boundary  condition  x(tiQ,  r^)  =  x(u^q,  t^) 
=  0.  To  know  statistical  error  estimation,  we  per¬ 
formed  a  set  of  10  simulations  for  each  tempera¬ 
ture  by  generating  10  sets  of  random  numbers.  For 
each  calculation  at  different  temperatures,  the  same 
element  of  the  same  set  of  random  numbers  was 
assigned  to  the  same  lattice  points.  Each  run  de¬ 
manded  about  10  s  of  CPU  time  in  the  IBM 
RISC/ 6000  355  workstation. 

In  Table  I,  the  calculated  values  of  the  matrix 
elements  of  the  Boltzmann  operator  (0|exp(  — /3H/ 
2)|0)  (the  Euclidean  time  transition  amplitude)  for 
the  harmonic  barrier  are  shown  along  with  the 
averages,  variances,  and  standard  deviations,  ab¬ 
breviated  to  Avg.,  Var.  and  S.d.,  respectively.  The 
exact  values  of  the  matrix  elements  obtained  by 
analytic  expressions  are  also  presented  for  compar¬ 
ison  [13].  The  agreement  is  very  good  especially 
for  temperatures  higher  than  500  K  in  spite  of  the 
naive  center-difference  algorithm.  Although  we 
originally  used  the  first-order  difference  scheme  in 
the  numerical  algorithm  and  obtained  slightly 
worse  values,  a  second-order  difference  scheme 
was  later  found  to  improve  those  values  quite 
well.  However,  as  is  well  known  in  the  quantum 
mechanical  simulation,  the  systematic  error  be¬ 
came  larger  as  the  temperature  was  lowered  [14]. 
At  300  K,  the  systematic  error  in  Avg.  results  in 
~  -^40%  in  comparison  to  the  exact  value,  al¬ 
though  the  accidental  error  is  found  to  be  small 
and  with  almost  the  same  magnitude,  i.e.,  2.6- 
6.4%,  throughout  the  whole  temperature  range. 

In  Figure  2,  the  temperature  dependence  of  the 
matrix  element  is  plotted  for  the  Avg.  and  Avg.  + 
S.d.  values  by  comparison  to  the  exact  ones.  Since 
the  transition  amplitude  itself  becomes  smaller  as 
the  temperature  becomes  lower,  the  absolute  val¬ 
ues  of  accidental  errors  at  low  temperature  become 
small  due  to  the  uniformity  of  percentage  of  the 
accidental  error. 


ONE-D!^IP\SIO^AL  ECK.\RT  POTENTIAL 
AND  FELL  3-DIMENSI()N  AL 
LSTIl  POTENTIAL 

For  demonstration  of  the  SPI  method,  we  em¬ 
ployed  two  more  realistic  potentials  for  describing 
the  H  -h  H2  exchange  reaction.  In  the  first  case,  we 
adopted  a  ID  Eckart  potential,  which  is  often  em¬ 
ployed  as  a  model  potential  in  a  symmetric  Sisj2 
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TABLE  I _ _ _ _ _ _ 

Matrix  elements  of  the  Boltzmann  operator  <s,|exp(-H/2fcT)|s,>  in  the  case  of  s,  =  s,  =  0  for  the  harmonic 
oscillator  (all  values  were  divided  by  10°). 


T(K) 


Data 

300 

400 

500 

600 

700 

800 

900 

1000 

1 

0.5768 

0.8106 

1.0498 

1 .2781 

1.4948 

1 .6993 

1.8915 

2.0718 

2 

0.6052 

0.9184 

1.0112 

1.1967 

1 .3739 

1 .5470 

1.7171 

1 .8836 

3 

0.5807 

0.8322 

1.1541 

1 .3950 

1.6152 

1.8175 

2.0046 

2.1783 

4 

0.6372 

0.8082 

1.0693 

1.2955 

1.5114 

1.7157 

1.9079 

2.0881 

5 

0.5604 

0.8900 

1.1727 

1.4061 

1.6218 

1.8217 

2.0074 

2.1803 

6 

0.5565 

0.8073 

1.0384 

1.2460 

1.4360 

1.6146 

1.7851 

1 .9494 

7 

0.5965 

0.8875 

1.1597 

1 .4070 

1.6306 

1 .8342 

2.0212 

2.1943 

8 

0.5845 

0.8644 

1.1314 

1.3801 

1.6094 

1.8198 

2.0130 

2.1911 

9 

0.6476 

0.9375 

1.2010 

1 .4368 

1 .6490 

1.8416 

1 .9454 

2.1826 

10 

0.5666 

0.7725 

1 .0847 

1 .3034 

1 .5032 

1 .6903 

2.0067 

2.0371 

Avg. 

0.5912 

0.8529 

1.1072 

1 .3345 

1 .5445 

1 .7402 

1 .9300 

2.0957 

Var. 

0.0010 

0.0030 

0.0042 

0.0066 

0.0089 

0.0107 

0.0112 

0.0123 

S.d. 

0.0310 

0.0547 

0.0648 

0.0810 

0.0941 

0.1033 

0.1060 

0.1110 

Avg.+  S.d. 

0.6222 

0.9076 

1.1721 

1.4155 

1 .6386 

1 .8434 

2.0360 

2.2066 

Avg.  -  S.d. 

0.5602 

0.7982 

1 .0424 

1.2535 

1 .4504 

1 .6369 

1 .8240 

1 .9847 

Analytical^ 

0.3523 

0.6659 

0.9764 

1 .2620 

1.5190 

1 .7490 

1 .9560 

2.1450 

T(K) 

Data 

1100 

1200 

1300 

1400 

1500 

1600 

1700 

1800 

1 

2.2409 

2.3997 

2.5494 

2.6908 

2.8250 

2.9525 

3.0743 

3.1908 

2 

2.0456 

2.2023 

2.6353 

2.4988 

2.6383 

2.7723 

2.9009 

3.0246 

3 

2.3404 

2.4923 

2.5651 

2.7705 

2.8990 

3.0213 

3.1384 

3.2507 

4 

2.2571 

2.4158 

2.6370 

2.7062 

2.8398 

2.9669 

3.0881 

3.2042 

5 

2.3421 

2.4939 

2.4084 

2.7723 

2.9009 

3.0234 

3.1406 

3.2530 

6 

2.1080 

2.2610 

2.6488 

2.5501 

2.6864 

2.8174 

2.9434 

3.0646 

7 

2.3555 

2.5066 

2.6540 

2.7833 

2.9110 

3.0327 

3.1491 

3.2609 

8 

2.3561 

2.5099 

2.6175 

2.7897 

2.9182 

3.0404 

3.1571 

3.2689 

9 

2.3360 

2.4806 

2.4992 

2.7480 

2.8726 

2.9922 

3.1072 

3.2181 

10 

2.1985 

2.3524 

2.5369 

2.6393 

2.8726 

3.0239 

3.1417 

Avg. 

2.2580 

2.4114 

2.5752 

2.6949 

2.8364 

2.9577 

3.0723 

3.1877 

Var. 

0.0123 

0.0119 

0.0063 

0.0104 

0.0094 

0.0095 

0.0080 

0.0073 

S.d. 

0.1108 

0.1090 

0.0794 

0.1022 

0.0971 

0.0976 

0.0894 

0.0852 

Avg.+  S.d. 

2.3688 

2.5204 

2.6545 

2.7971 

2.9335 

3.0553 

3.1617 

3.2729 

Avg.  -  S.d. 

2.1472 

2.3025 

2.4958 

2.5927 

2.7393 

2.8601 

2.9829 

3.1026 

Analytical^ 

2.3170 

2.4770 

2.6250 

2.7650 

2.8960 

3.0210 

3.1390 

3.2530 

®  Analytical  values  are  calculated  by  the  analytic  formula  in  Feynman  and  Hibbs  (Ref.  [13]). 


type  of  chemical  reaction  [1-3]:  Siegbahn,  Truhlar,  and  Horowitz  (LSTH),  was  uti¬ 

lized  [15-17].  This  has  been  extensively  used  for 
V(x)  =  VoSech'^(x/a),  (21)  investigating  the  H  -F  exchange  reaction  when 

checking  accuracy  of  different  quantum  mechani- 
where  Vq  =  7.50  (kcal/mol)  and  a  =  3.15  (A).  In  cal  treatments. 

the  second  case,  to  understand  better  the  influence  In  Figure  3,  the  temperature  dependence  of  the 

of  dimensionality  for  this  quantum  mechanical  matrix  element  is  plotted  for  the  ID  Eckart  poten- 

process,  the  full  3D  potential  prepared  by  Liu,  tial  and,  in  Table  II,  nine  sets  of  fhe  matrix  ele- 
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♦  Avg.+S.d.; 
■  Avg.-S.d. 

A  Avg. 

X  Analytical 


T(/K) 


FIGURE  2.  Temperature  dependence  of  the  transition 
amplitude  for  the  harmonic  potential  [Eq.  (20)]. 


merits  relating  to  the  potential  factor  A(  =  0, 
X-  =  0;  wO  and  nine  corresponding  values  of  the 
time  correlation  function  Cy(f)  are  shown  from  0 
to  20  fs  at  630  K  in  Figure  4.  The  numerical  value 
estimated  by  C^(f)  is  3.317  X  10^^  s“^  for  the  ther¬ 
mal  rate  constant  multiplied  by  Qj^  (Table  III)  and 
was  found  to  give  a  reasonable  value  when  the 
previous  values  in  the  temperature  range  between 
200  and  400  K  [3]  are  extrapolated  (Fig.  5). 

On  the  other  hand,  the  temperature  dependence 
of  the  Boltzmann  matrix  element  for  the  LSTH 
potential  is  plotted  in  Figure  6  for  both  the  colin- 
early  restricted  and  the  full  3D  cases  at  the  same 
time.  The  corresponding  C^(f)'s  at  630  K  are  ob¬ 
tained  as  shown  in  Figure  7  and  result  in  6.272  X 
10^  and  2.492  X  lO"^  s~^  for  /cQ^,  respectively.  For 


t  (/fs) 


FIGURE  4.  Flux -flux  autocorrelation  function  for  the 
Eckart  potential,  Eq.  (21),  with  Vq  =  7.50  kcal/mol  and 
a  =  3.15  A,  under  a  temperature  of  630  K. 


the  sake  of  direct  comparison  of  the  rate  constant 
values  themselves,  the  reactant  partition  function 
per  unit  volume  for  the  3D  case  [18~20]: 


Qr 


(22) 


was  calculated  for  distinguishable-atom  H2  with¬ 
out  nuclear  spin  [18,  19,  211,  M  is  the  reduced 
mass,  j  is  the  vibrational-rotational  energy  of 
the  reactant  H2  diatom  [19].  With  the  numerical 
value  1.858  X  10^^  molecule  cm“^  of  for  the 
parameter  set  A  in  Ref.  18(a),  the  present  3D  ther- 
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FIGURE  3.  Temperature  dependence  of  the  transition 
amplitude  for  the  Eckart  potential  [Eq.  (21)]. 


T  (/K) 


♦  Ref.  3(a) 
■  Present 


FIGURE  5.  Temperature  dependence  of  the  thermal 
rate  constant  for  the  Eckart  potential.  The  previous  values 
(♦)  are  shown  also  with  the  present  one  (■). 
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STOCHASTIC  PATH-INTEGRAL  METHOD 


irAL^iub.  II  - - - - - - ; - ;; - -  7;  ,  »  _ 

Matrix  elements  relating  to  the  potential  factor  A(Sf ,  s,-;  u^)  and  the  time  correlation  function  Cf{t)  for 
s,  =  s,  =  0  (T  =  630  K).  _ _ _ 


w  X. 

t  (fs) 

ReA(0,0:uO 

(xlO-2) 

A(0,0;u‘') 

lmA(0,0:u‘') 

(X10"2) 

|A(0,0;u“)|^ 

(xIO"®) 

c,{t) 

(Xl0^''s  ^) 

0.00 

5.023 

0.000 

2.5227 

5.463 

1.00 

4.424 

-2.378 

2.5228 

5.247 

2.00 

2.771 

-4.771 

2.5231 

4.679 

3.00 

4.573 

-5.004 

2.5249 

3.936 

4.00 

- 1 .967 

-4.631 

2.5312 

3.187 

5.00 

-3.931 

-3.168 

2.5484 

2.534 

10.00 

-0.024 

5.523 

3.0503 

0.920 

15.00 

6.249 

0.585 

3.9386 

0.449 

20.00 

4.840 

-4.661 

4.5154 

0.239 

TABLE  III _ - _ 

Thermal  rate  constant  as  kQf,  for  the  Eckart, 
colinear  and  3D  models  of  the  H  +  exchange 
reaction  (T  =  630  K). 


kOf)  (s 

r(K) 

Eckart 

Colinear 

3D 

630 

3.31 7E  -H  10 

6.272E  -1-  09 

2.492E  -t  09® 

^  The  corresponding  rate  constant  k  (T  =  630  K)  is  1 .341  x 
10"'®  (cm®  molecule"'  s“'). 


mal  rate  constant  k  (T  =  630  K)  is  converted  to 
1.341  X  10“’^  cm^  molecule”^  s“S  which  is  in 
very  good  agreement  with  the  previous  result  in 
Table  IX  in  Ref.  [21].  However,  the  value  is  about 
10  times  larger  than  that  (~  2.000  X  10'^^  cm^ 


0.00  5.00  10.00  15.00  20.00  25.00  30.00 


t  (/fs) 

FIGURE  7.  Flux -flux  autocorrelation  function  for  the 
LSTH  potential  in  the  colinearly  restricted  and  the  full 
three-dimensional  cases,  under  a  temperature  of  630  K. 


o  (2olinearH3 
a3DH3 


T(/K) 


FIGURE  6.  Temperature  dependence  of  the  transition 
amplitude  for  the  LSTH  potential  in  the  colinearly 
restricted  and  the  full  three-dimensional  cases. 


molecule'^  s'^)  in  Ref.  [18(a)]  since  we  have  uti¬ 
lized  the  same  LSTH  potential  as  Ref.  [21]  al¬ 
though  Ref.  [18(a)]  adopted  the  double  many-body 
expansion  surface  [18(b)]. 

Furthermore,  comparing  each  matrix  element 
value  at  the  same  temperature  (Fig.  6),  one  can 
understand  that  the  smaller  value  for  the  3D  case 
than  for  the  colinear  case  (Table  III)  is  originating 
from  its  relative  smaller  values.  Because  both  are 
based  on  the  same  LSTH  potential,  the  relative 
difference  should  come  from  the  dimensionality  in 
the  two  cases.  Namely,  one  should  notice  that,  in 
the  colinear  case,  there  are  fewer  bath  coordinates 
orthogonal  to  the  reaction  coordinate  and,  there¬ 
fore,  each  trajectory,  which  determines  the  quan¬ 
tum  mechanical  transition  amplitude  as  a  whole, 
stays  for  relatively  shorter  time  than  in  the  3D 
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case.  Accordingly,  the  difference  in  the  time  of 
stay  results  in  relatively  larger  matrix  elements 
and  rate  constant  for  the  colinear  potential. 


Concluding  Remarks 

In  the  light  of  the  previous  examples,  we  sug¬ 
gest  that  the  stochastic  path-integral  method  can 
become  a  promising  strategy  for  many  applica¬ 
tions  in  the  field  of  low-energy  science,  as  demon¬ 
strated  here  for  a  chemical  reaction.  Furthermore, 
it  should  be  noted  that  there  exist  some  merits  in 
this  method  because  it  is  based  on  the  equation  of 
motion  [13].  In  addition,  we  are  also  convinced 
that  the  present  method  will  be  regarded  as  an 
important  milestone  in  the  progress  of  SQM  in  the 
low-energy  material  sciences. 

Although  one  might  regard  the  present  method 
just  as  one  of  the  importance  sampling  methods,  in 
the  sense  that  the  ensemble  is  constructed  from  the 
solutions  of  a  stochastic  differential  equation, 
which  reflects  the  equation  of  motion  of  the  sys¬ 
tem,  the  method  needs  not  assume  any  approxi¬ 
mation  with  respect  to  the  potential  functional 
forms,  in  comparison  to  usual  treatments.  There¬ 
fore,  this  approach  should,  in  principle,  work  for 
most  of  the  systems  with  any  analytical  or  numeri¬ 
cal  force  fields  for  solving  the  stochastic  equation 
of  motion.  It  is  the  recent  development  of  quantum 
chemistry  which  enabled  us  to  realize  analytic 
potential  functions  for  various  realistic  chemical 
reactions  that  motivate  us  to  do  the  present  study. 
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ABSTRACT  _ _ _ _ _ _ 

Dirac  bra-ket  notation  is  introduced  for  the  Whittaker  cardinal  (Sine)  functions  and  a 
previously  unreported  completeness  relation  for  these  quantities  is  presented  and  derived. 
With  the  use  of  this  completeness  relation  it  becomes  simple  to  transform  to  a  Sine -basis 
the  eigenvalue  equations  arising  from  a  light-cone  quantization  of  field  theory  or  the 
similar  equations  occurring  in  nonrelativistic  quantum  mechanics.  The  simplicity  and 
power  of  Sinc-function  expansions  is  illustrated  by  computation  of  the  eigenvalues  and 
eigenfunctions  of  the  position-space  planar  Coulomb  equation,  a  problem  for  which 
convergence  has  not  been  achieved  by  a  variety  of  other  computational  methods.  ©  1996 
John  Wiley  &  Sons,  Inc. 


Introduction 

Expansion  methods  based  on  Whittaker  cardi¬ 
nal  functions  (usually  referred  to  as  Sine 
functions)  are  becoming  increasingly  recognized  as 
very  powerful  tools  for  solving  various  problems 
within  applied  physics  and  engineering  [1].  How¬ 
ever,  there  has  been  little  use  of  Sine  functions  in 
quantum  chemistry,  and  it  has  not  been  clear  how 
to  apply  Sine  methods  within  the  context  of  the 
operator  formalism  of  quantum  field  theory.  In  a 
recent  contribution  one  of  us  (V.G.K.)  has  shown 
how,  with  the  use  of  Dirac's  bra-ket  notation  and 
with  the  derivation  of  appropriate  mathematical 
*  To  whom  correspondence  should  be  addressed. 


theorems,  a  Sinc-function  approach  can  be  made 
more  transparent  and  flexible  enough  to  become 
applicable  to  problems  in  a  general  occupation- 
number  Fock  space  [2]. 

The  use  of  Sine  functions  constitutes  a  second 
step  in  the  direction  indicated  by  our  contribution 
to  the  1995  Sanibel  Symposium  [3].  That  work 
started  from  the  premise  that  light-cone  quantiza¬ 
tion  (LCQ)  of  quantum  field  theory  was  a  promis¬ 
ing  method  for  solving  nonperturbative  problems 
in  quantum  chemistry,  condensed-matter  physics, 
and  particle  physics,  since  it  led  naturally  to  a 
rational,  closed-form,  and  relativistically  covariant 
Hamiltonian  formulation  with  a  convenient  vac¬ 
uum  state.  In  particular,  a  discretized  LCQ  formu¬ 
lation  permits  gauge  theory  to  be  reduced  to  an 
eigenvalue  problem  (for  the  square  of  M,  the  in- 
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variant  mass)  involving  the  light-cone  Hamilto¬ 
nian  Hlc  and  a  discrete  and  covariantly  regular¬ 
ized  Fock  space  of  functions  |n>: 

XI  <K|HLclm><mi'^f>  =  M^<h|^>.  (1) 

m 


Sine  kets  and  Sine  bras 

If  >  0  and  m  is  an  integer,  the  Sine  function, 
S(m,  a)(x),  is  defined  by  [1] 


Attempts  to  solve  Eq.  (1)  with  plane  waves  \n} 
have  produced  interesting  results,  but  led  to  con¬ 
vergence  difficulties  in  problems  modeling  bound 
states  [4].  Our  previous  contribution  showed  how, 
starting  from  a  plane-wave  basis  in  which  nonper- 
turbative  renormalization  can  be  properly  imple¬ 
mented,  one  could  transform  to  a  localized  Gauss¬ 
ian  basis.  The  main  drawback  of  that  procedure  is 
the  lack  of  orthogonality  of  the  individual  Gauss¬ 
ian  functions.  The  use  of  a  Sine  basis,  whose  mem¬ 
bers  are  orthogonal,  provides  a  route  to  circum¬ 
vent  that  drawback.  We  have  not  yet  applied  Sine 
expansions  to  a  fully  realistic  LCQ  problem.  How¬ 
ever,  we  can  now  report  results  illustrating  the 
power  of  Sinc-function  methods  by  applying  them 
to  a  model  problem  whose  solution  has  not  been 
satisfactorily  obtained  by  other  means. 

Even  though  the  numerical  computation  of  this 
model  is  not  of  direct  interest  to  quantum  chemists, 
the  essential  steps  which  lead  to  its  solution  are 
very  relevant.  The  model  is  rigorously  formulated 
from  a  fully  relativistic  light-cone  formulation  of 
quantum  electrodynamics  (QED)  in  2  +  1  dimen¬ 
sions.  We  then  outline  how  to  derive  the  planar 
Coulomb  Schrodinger  equation  by  taking  the 
weak-coupling  and  nonrelativistic  limits.  A  similar 
need  to  use  a  full-fledged  quantum  field  theoretic 
model  as  a  starting  point  for  solving  relativistic 
many-electron  atomic  problems  is  now  well  estab¬ 
lished  [5].  These  atomic  models  are  also  initially 
formulated  within  a  rigorous  quantum  field  theo¬ 
retic  framework  and  appropriate  low-energy  limits 
are  then  taken  to  facilitate  computation.  Novel 
technical  difficulties  brought  about  by  such  an 
approach  have  created  a  need  for  new  computa¬ 
tional  paradigms  and  their  tests  on  simple  models. 

In  this  spirit,  the  remainder  of  the  present  study 
consists  of  (1)  a  condensed  overview  of  Sine  meth¬ 
ods  in  Fock  space,  expressed  in  Dirac  bra-ket  nota¬ 
tion  and  including  a  completeness  theorem  not 
previously  exhibited  in  the  mathematics  literature, 
followed  by  (2)  a  Sine-method  application  relevant 
in  quantum  field  theory  and  in  nonrelativistic 
quantum  mechanics;  namely,  the  position-space 
radial  Coulomb  equation  in  two  spatial  dimen¬ 
sions. 


S(m,  a)(x)  = 


sin  —(x-  ma) 
L  a 

77 

—  (x  -  ma) 
a 


Let's  define  the  Sinc-ket  and  the  Sinc-bra  to  be, 
respectively,  \m,a)  and  (m,al  We  may  then  ex¬ 
press  the  Sine  function  as 

a)  =  S(m,  a)(x)  =  <m,  a\x),  (3) 

with  the  second  equality  above  following  from  the 
fact  that  S(m,  a)(x)  is  real.  In  order  to  exploit  the 
full  power  of  the  Dirac  bra  and  ket  notation  [6],  we 
must  first  establish  the  completeness  relation, 

oc 

X  \m,a)(m,a\  =  1,  (4) 

}}1=  -  DC 

where  the  1  on  the  right-hand  side  is  understood 
to  be  the  identity  operator.  To  establish  this,  we 
first  prove  the  following  lemmas: 


Lemma  1.  The  cardinal  series  for  the  plane  wave, 
fa/l'ire'^'^,  zvith  \k\  <  rr/a  is 

I  a  [~d~  == 

] h;~  ^  ^  ]hr~  H  S{m,a){x)e‘^'“\  (5) 


Proof  Suppose  that  yja/lrre'’'^  is  represented  by 
the  power  series 


XC,„(x)e'''""’.  (6) 


Multiply  both  sides  of  Eq.  (6)  by  fa/lve^'’""'  and 
integrate  over  k: 


dke'^ 


277  ♦'-w, 


x;  CJx)—f‘‘  '  (7) 


We  obtain 


S(n,a)(x)=  X  =  C„(x),  (8) 


and  this  completes  the  proof. 
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We  denote  the  plane  wave,  ■\/a/2Tre  in  bra 
and  ket  notations  as 


<x|fc>  =  i/  — 


(9) 


Analogously,  we  also  define 

i/  —  g-ikma  ^  (fn,a\k}  =  (k\m,ay.  (10) 
V  277 

Equation  (5)  may  now  be  rewritten  as 

<x|fc>  =  ^{x\m,  a)(m,  a\k}.  (11) 


We  now  state  the  next  lemma. 


Lemma  2. 

Y,(k\m,aXm,a\k')  =  (k\k'}  =  B(k-k').  (12) 

m 

Proof.  By  definition,  we  reexpress  the  left  side  of 
Eq.  (12)  as 

—  =  a'^8{a(k  -  k')  +  27rn) 

m  n 

=  8(k  -  W),  (13) 

where  the  last  equality  follows  because  \k  -  k'\  < 
Itt/u.  This  completes  the  proof. 

It  has  already  been  established  that  [6] 

\k)  dk{k\  =  1  (14) 

—  ir/a 

and 

(  |x>  dx(x\  =  1.  (15) 

—  X 

We  now  use  the  above  closure  relations  to  prove  a 
third  lemma. 

Lemma  3. 


S(m,  a)(x)S{m,  a)iy) 


sin 


77 

—  ix  -  y) 
a 


77 

—  (x  -  y) 
a 


~  h8{x  -  y)  -  <  x|y> 


(16) 


Proof.  The  proof  of  this  theorem  demonstrates  the 
power  of  the  Dirac  bra  and  ket  notation  as  it 
makes  self-evident  the  decomposition  of  the  sums 
and  integrals  necessary  to  prove  it: 


X 

Y.  S(m,  a){x)S(m,  a)(y) 

ni=  -  “ 

m 


m  -'^/a-TT/a 


Xdk'(k'\y) 


(ffj 

r  j 

\  277/  1 

-7T/n‘' 

—  TT/a 

m 

xe-'^^  dkdk' 

\  Irr  j  J  ^  'Tr/rt*'  —  TT  / a 
a 


-  'ir/a''  —  TT/a 


iTTlJ-Tr/a 


sin 

> 

1 

1 

L  a  j 

77 

-(x  -  y) 

a 


(17) 


It  turns  out  that  sin[7r/a(x  -  y)]/[7r/a(x  -  y)]  is 
an  exact  delta  function  for  a  large  class  of  square- 
integrable  functions;  namely,  the  Wiener  space  of 
functions  [1].  For  other  functional  spaces,  the  error 
between  it  and  a  delta  function  is  exponentially 
damped  [1,  7].  This  completes  the  proof. 

The  bra  and  ket  notation  for  a  general  square- 
integrable  function  is 

{x\f)  =/(x).  (18) 

In  this  notation,  the  sampling  of  fix)  at  the  Sine 
points  ma  is  given  by 

(m,a\f)  ^f(ma).  (19) 

We  may  now  state  the  lemma  for  the  cardinal 
series  representation  of  a  function  in  Wiener  space 
by  using  the  bra  and  ket  notation. 


Lemma  4.  Let  a  >  0,  and  let  WiTr/a)  denote  the 
Weiner  space  of  functions  [1].  Every  f  in  Wfrr/fl)  has 
the  cardinal  series  representation 

X 

(x\f)  =  (x\ma)(ma\f).  (20) 

m=  -  oc 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


1313 


KOURES  AND  HARRIS 


Proof.  By  the  Paley-Wiener  theorem  [1],  f(x)  has 
a  Fourier  transform: 

fix)  =  —  e-‘'‘^F(k)  dk.  (21) 

27TJ-^/a 

The  function  F  may  be  represented  on  the  interval 
( -  7r/a,  ir/a)  by  its  Fourier  series  expansion, 

P(x)  =  (22) 

m 

with 

c,u  =  f-  e-‘’^"“'Fik)dk  =  afinta),  (23) 

where  the  last  identity  follows  form  Eq.  (21).  Upon 
substituting  Eqs.  (22)  and  (23)  into  Eq.  (21)  and 
using  the  fact  that 


we  get 

cc 

fix)  =  ^  Sim,  a)ix)fima).  (25) 

JU  —  cc 

This  completes  the  proof.  Notice  that  this  proof 
demonstrates  that  the  cardinal  series  (20)  for  a 
function  in  Wiener  space  will  reproduce  the  func¬ 
tion  with  zero  error.  However,  even  for  more  gen¬ 
eral  function  spaces  the  error  is  exponentially 
damped  [1,  7]. 

These  four  lemmas  are  necessary  and  sufficient 
to  prove  the  closure  relations  (4).  The  upshot  of  all 
this  is  that  Eq.  (4)  may  be  inserted  into  any  second- 
quantized  Hamiltonian  to  obtain  that  Hamiltoni¬ 
an's  Sine-basis  representation.  In  turn,  we  can  now 
rigorously  derive  the  Sine-basis  representation  of 
the  QED  light-cone  Hamiltonian. 

For  example,  in  the  bra  and  ket  notation,  the 
momentum  space  representation  of  the  light-cone 
Hamiltonian  may  be  written  as 

hlc- r"  ■■■/""  r"  -  r  \k> dkw- 

—  7r/a  — rr/a'^  —  TT/a  '^~7r/n 

-  \k')dk'{k'\Hi^0^")dk"{k"\- 
■■■\k"')dk"'{k’"\.  (26) 


We  obtain  the  Sine-basis  representation  by  insert¬ 
ing  the  identity  operator  (4): 


/  '£,\m,  a)im,  a\k)  dkik\--- 

V 

•••  |m',  a){m' ,  a\k')  dk'ik'lHu^^lk") 

Xdk"{k"\n,  a){n,  a\ 

-  \k'")  dk'"(k’"\n',a}(n',a\  (27) 

=  E  •••  E  E  •••  ll\m,a}im,a\---  \ni,a) 

in  m'  n  n' 

X  {ni ,  a\Hy^\n,  a){n,  a\‘-  \r{ ,  a){ri ,  a\, 

(28) 

where  the  appropriate  limits  of  summation  and 
integration  are  understood.  If  we  now  use  (28)  to 
operate  on  a  bound  state  I'T' ),  we  see  by  inspection 
of  (1)  that  we  would  be  dealing  with  an  eigenvalue 
and  matrix  diagonalization  problem.  The  bound- 
state  samples  are  obtained  at  the  Sine  points;  we 
can  then  use  the  cardinal  series  to  get  the  relativis- 
tically  invariant  bound-state  amplitudes  through¬ 
out  all  space. 


SoMng  the  Planar  Coulomb  Equations 

We  will  now  demonstrate  the  power  of  the  Sine 
collocation  method  by  solving  the  radial  Coulomb 
equation  in  rf  =  2  +  1.  The  motives  for  studying 
QED  in  2  -j-  1  dimensions  are  numerous  [7,  8].  The 
lower  dimensions  allow  a  smaller  number  of  de¬ 
grees  of  freedom,  but  the  model  still  possesses 
independent  photon  degrees  of  freedom,  unlike 
the  (1  +  l)-dimensional  model.  The  model  is  su- 
per-renormalizable  and,  when  formulated  with 
four-component  spinors,  it  exhibits  confinement  [8, 
9]. 

The  nonrelativistic  Coulomb  Schrbdinger  equa¬ 
tion  is  derived  from  the  LCQ  formalism  as  follows 
[7,  8].  One  first  derives  the  discretized  light-cone 
Hamiltonian  for  (2  +  l)-dimensional  QED  with 
four-component  spinors.  A  Tamm-Dancoff  integral 
equation  is  then  obtained  for  the  "positronium" 
bound  states.  Taking  the  weak-coupling  limit,  one 
gets  a  nonrelativistic  integral  equation  which  is  the 
momentum-space  Coulomb  Schrodinger  equation; 
the  infrared  divergences  cancel  between  the  self¬ 
mass  and  one-photon  exchange  diagrams  [8].  The 
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position-space  result  is  then  obtained  by  a  Fourier 
transform. 


1 

- H - (y  +  In  mr) 

m  '  Itt 


>F(r)  =  E^(r),  (29) 


where  y  is  the  Euler-Mascheroni  constant,  m  is 
the  mass,  and  g  is  the  coupling  constant.  It  is 
convenient  to  rewrite  Eq.  (29)  in  terms  of  the 
dimensionless  variables  x  =  ]/mg^/2Trr  and  Y  = 
(27r/g^)E  +  ln}/2g^/m7T .  We  get 

(-V^  +  In  X  +  7  +  ln2)^(x)  =  A'^(x),  (30) 


or 


(-V^^  -h  In  x)^(x)  =  A'P(x),  (31) 

with 

A  =  A'  —  7  —  In  2.  (32) 

The  separation  of  variables,  ^(x)  ==  E(x)0(0), 
leads  to  0(^)  =  exp(±z70),  where  /  is  the  angular 
momentum  quantum  number.  We  are  left  with  a 
radial  differential  equation  for  R  which,  after  us¬ 
ing  the  substitution  [9] 

R(x)  =  x^^^^fix),  (33) 

transforms  into  a  differential  equation  for  /: 


-fix)  + 


+  In  X 


fix)  ==  Xf(x).  (34) 


W  denotes  the  set  of  complex  numbers.  It  turns  out 
that  the  absolute  error  of  Sine  interpolation  and 
quadrature  on  such  functions  is  exponentially 
damped  [1]. 

For  the  problem  of  interest  to  us  we  need  to  use 
Sine  methods  on  a  function  /  whose  domain  is 
(0,  oo).  The  more  general  class  just  described,  how¬ 
ever,  has  a  domain  which  includes  the  whole  real 
line.  This  conflict  is  elegantly  circumvented 
through  the  use  of  conformal  maps  [1].  For  exam¬ 
ple,  let  </>  be  a  one-to-one  conformal  map  from 
some  domain  D  to  domain  and  let  ijj  denote 
the  inverse  map  which  is  also  conformal.  If  /  is 
analytic  in  D  then  f  °  ijj  is  analytic  in  Dg.  So,  if  a 
numerical  process  has  been  developed  in  a  domain 
containing  the  whole  real  line,  then  this  process 
can  be  carried  over  to  a  new  domain  containing 
only  a  proper  subset  of  the  real  line. 

We  will  shortly  see  that  the  domain  of  interest 
to  us  is 

D  =  l^zv  ^  W:  |arg(sinh(iz;))|  <  d  <  —  (36) 

This  domain  is  conformally  mapped  onto  the  infi¬ 
nite  strip  D5  by  the  function  [1] 

z  =  (f^izv)  =  ln(sinh(zz;)).  (37) 

If  we  let 

TV  =  i//(z)  =  (/)“^(z)  =  ln(^e^  +  Vl  -(-  ),  (38) 

then 


This  equation  for  /  represents  a  singular  Sturm- 
Liouville  system,  and  it  can  be  solved  by  Sine 
collocation  [1]. 

The  theory  of  Sine  series  (cardinal  functions)  on 
the  entire  real  line  has  been  thoroughly  developed. 
As  mentioned  earlier,  for  a  class  of  functions 
known  as  the  Wiener  class,  the  Sine  interpolation 
and  quadrature  formulas  are  exact  [1].  However,  a 
more  practical  application  of  Sine  approximation 
of  functions  which  are  in  a  much  less  restrictive 
class  has  also  been  developed  and  the  absolute 
errors  have  been  derived  via  contour  integration 
[1].  In  a  nutshell,  the  Wiener  functions  are  entire 
but  a  more  practical  class  of  functions  should  have 
specific  growth  restrictions  on  the  real  line  and 
should  be  analytic  only  on  an  infinite  strip  cen¬ 
tered  about  the  real  line, 

D5  -  {z  e  r:  2  -  X  +  iyAy\  <  d],  (35) 


r  =  ipi^)  =  (0,«)),  (39) 

as  desired.  For  a  >  0  and  m  an  integer  we  define 
the  Sine  points 

=  Ipima)  =  +  Vl  +  (40) 

One  may  further  verify  that 

<^"(x„,)  =  (41) 

where  the  primes  denote  differentiation  with  re¬ 
spect  to  X. 

Now,  to  solve  (34)  we  note  that  [7] 

N 

fix)-  12  Sim,a)i4>ix))fix^J,  (42) 

m=  — M 
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N 


/"(.V)-  L 

///  =  -  M 


where 


J 

n  /  4/2  -  1  \ 

m  ~  ^ti! 

/(■t,„). 

(43) 

n,}U  1  4  m 

\  ^ m 

Since  we  would  like  our  solution  to  be  normalized, 
we  will  also  make  use  of  the  Sine  quadrature 
formula  [7], 


jO)  = 

n,;/;  ^  ^n,m 


/(2)  = - (1  + 

iti  ^  2  Jfi  • 


(49) 


cc  N  ) 

F{x)dx^a  E  (44) 

m= -M  ^ 

To  evaluate  these  expansions  at  a  general  Sine 
point  X,,  note  that 


L 


—  S(m,^7)(0(x))|,.,  =  - 

d-  1 

-—iS{m,a){4>{x))\,  =  (45) 

ci<P~  a 

where  is  the  Kronecker  delta  function  and 


6">  = 
^n,  m 


^0,  ni  =  n 

.  m  -  n 


(-1) 


III  —  n 


m  ¥=  n. 


(46) 


77“ 

2(-l) 


m  =  n 


m  -  d  +  1 


(47) 


{m  -  n) 


m  n 


Using  these  results  along  with  (41)  we  may  now 
approximate  (34)  via  Sine  collocation, 

L  [(f,,, +  //,!!,, +  C,]/(x,,,)  =  A/(x,,),  (48) 

ui  =  -  M 


In  the  tradition  of  LCQ,  we  have  formulated  the 
problem  as  an  eigenvalue  and  matrix  diagonaliza- 
tion  problem.  The  components  of  the  eigenvector 
/(x,,^)  can  be  substituted  into  (42)  to  compute  the 
eigenfunction  at  arbitrary  x.  Finally,  we  use  (33)  to 
acquire  R(x)  and  we  normalize  this  function  by 
computing  the  norm,  ^ fQR^(x)dx ,  using  the  Sine 
quadrature  formula  (44). 

In  Table  I  we  list  the  first  five  eigenvalues  for  / 
ranging  from  0  to  4.  The  diagonalization  was  per¬ 
formed  with  MATLAB  [10].  The  convergence  was 
excellent  for  all  /  values  but  we  needed  consider¬ 
ably  higher  M  values  for  /  =  0  than  for  /  0  [7]. 

In  Ref.  [7]  the  first  five  /  =  0  eigenvalues  were 
computed  in  momentum  space: 

A'o  =  1.7969,  A'  =  2.9316,  A'^  =  3.4475, 

A'3  =  3.7858,  A'4  =  4.0380,  (50) 

where  A'  is  related  to  A  in  Table  I  by  (32).  These 
momentum  space  results  do  not  agree  with  the 
position-space  computations  in  the  literature  [9] 
and  additional  repeated  attempts  to  get  conver¬ 
gence  in  position  space  have  failed  [8].  It  was 
concluded  in  Ref.  [8]  that  'The  previous  position 
space  calculation  [9]  was  inaccurate,  due  to  the 
slow,  logarithmic  behavior  of  the  potential ....  The 
momentum  space  calculation  is  much  more  rapidly 
convergent." 


TABLE  I _ 

First  five  eigenvalues  for  /  ranging  from  0  to  4. 


/=  0 

l=^ 

1  =  2 

1  =  3 

1  =  4 

^0 

0.52643626 

1.3861862 

1 .8443720 

2.1578468 

2.3962798 

Ai 

1.6619365 

2.0094748 

2.2758614 

2.4881158 

2.6638815 

A2 

2.1870578 

2.3943387 

2.5800522 

2.7390550 

2.8772701 

A3 

2.5153639 

2.6726676 

2.8144703 

2.9409664 

3.0543788 

A4 

2.7677810 

2.8906069 

3.0049630 

3.1096821 

3.3373990 
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FIGURE  1 .  Plot  of  normalized  function  Rg  q(x)  for  /  =  0 
and  A'o  =  1 .7968. 


On  the  other  hand,  if  we  use  (32)  to  convert  the 
first  column  of  Table  I  to  the  parameterization 
used  in  Ref.  [7]  we  see  that 

A'o  =  1.7967991,  A',  =  2.9322993,  A'j  =  3.4574206, 

A'o  =  3.7857268,  A'4  =  4.0381439.  (51) 

These  values  are  in  very  good  agreement  with 
(50),  they  converge  faster,  and  they  are  more  accu¬ 
rate.  Note  further  that  the  eigenvalues  for  /  #  0  are 
actually  in  good  agreement  with  the  previously 
reported  position  space  results  [9].  (The  momen¬ 
tum  space  calculations  in  [8]  were  not  carried  out 
for  I  ¥=  0.)  This  leads  us  to  conclude  that  the  slow 
convergence  of  the  previous  position  space  results 


FIGURE  2.  Plot  of  normalized  function  for  /  =  0 

3nd  =  2.9323. 
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FIGURE  3.  Plot  of  normalized  function  for  /  =  0 

and  A2  =  3.4574. 


was  not  due  to  the  slow  logarithmic  behavior  of 
the  potential;  rather,  it  was  due  to  a  minor  instabil¬ 
ity  caused  by  the  sign-flip  of  the  "centrifugal"  part 
of  the  potential  when  /  =  0. 

As  promised,  it  is  now  very  easy  to  use  (42) 
along  with  (33)  to  compute  any  eigenfunction; 
these  eigenfunctions  are  easily  normalized  through 
the  use  of  the  Sine  quadrature  result  (44).  As  a 
representative  sample,  in  Figures  1  to  3  we  display 
the  first  three  normalized  eigenfunctions  for  I  =  0; 
note  that  the  number  of  bumps  increases  incre¬ 
mentally  as  we  go  from  n  =  0  to  n  =  2.  Figures  4 
to  6  show  the  first  three  normalized  eigenfunctions 
for  I  =  4;  note  that  the  "centrifugal"  barrier  causes 
the  bumps  to  move  away  from  the  center,  relative 
to  the  /  =  0  plots. 


FIGURE  4.  Plot  of  normalized  function  for  /  =  4 

and  Aq  =  3.6666. 
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FIGURE  5.  Plot  of  normalized  function  ^{x)  for  /  =  4 
and  =  3.9342. 


FIGURE  6.  Plot  of  normalized  function  4(x)  for  /  =  4 
and  A's  =  4.1476. 


Conclusion 

We  have  introduced  a  Dirac  bra-ket  formulation 
of  the  Sine  basis  along  with  an  important  com¬ 
pleteness  relation.  It  is  now  straightforward  to 
incorporate  a  Sinc-function  representation  into  any 
problem  which  is  posed  in  a  second-quantized 
format  such  as  the  light-cone  Hamiltonian.  Fur¬ 


thermore,  we  have  explicitly  demonstrated  that 
Sine  methods  provide  a  very  powerful  tool  for 
solving  the  radial  Coulomb  Schrodinger  equation 
in  d  =  2  +  1,  These  methods  are  very  accurate  and 
converge  very  fast.  This  point  is  strongly  demon¬ 
strated  by  the  fact  that  convergence  for  the  /  =  0 
values  could  not  be  attained  in  position  space  by 
other  well-known  numerical  techniques.  It  is  our 
hope  that  Sine  methods  will  be  found  useful  in 
attacking  many  other  nonperturbative  problems  in 
quantum  chemistry,  condensed  matter  physics,  and 
related  disciplines. 
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ABSTRACT _ _ _ 

We  recently  investigated  some  features  of  atomic-electron  distributions  which  are 
products  of  atomic  orbitals  without  linear  redundancies.  Then,  the  atomic-electron 
distributions  were  employed  as  expansion  functions  of  molecular  charge  distributions  in 
the  three-center  expansion  method  for  electron  repulsion  integrals.  It  turned  out  that  the 
method  is  very  accurate  even  after  substantial  reductions  of  required  integrals.  We  shall 
illustrate  the  method  and  discuss  its  applicability  to  many  electron  theories.  ©  1996  John 
Wiley  &  Sons,  Inc. 


Introduction 

The  fundamental  scaling  properties  in  many 
types  of  ab  initio  electronic  structure  calcula¬ 
tions  are  governed  by  generating  and  manipulat¬ 
ing  electron  repulsion  integrals  (ERIs), 

(ab  I  cd) 

=  j  dt^dr2X„(dr^'>Xh(‘^ri^^uXc(dr2^Xd(dt2)- 

(1) 

In  spite  of  a  great  deal  of  effort  toward  theoretical 
and  algorithmic  developments  coupled  with  in¬ 
creasingly  faster  electronic  computers,  the  tractable 

*To  whom  correspondence  should  be  addressed. 


quality  of  a  wave  function  is  strongly  limited  with 
respect  to  the  size  of  molecules  and  the  quality  of 
basis  functions.  To  overcome  this  obstacle,  there 
has  been  a  recent  resurgence  of  attempting  to 
approximate  ERIs.  The  pseudospectral  approach 
developed  by  Friesner  and  co-workers  [1]  chooses 
a  spatial  grid  and  transforms  basis  set  and  spatial 
grid  representations.  This  approach  scales  as  N'^M^ 
in  the  context  of  the  Hartree-Fock  approximation 
and  in  the  localized  formulation  of 

Moller-Plesset  perturbation  theory  (LMP2)  [2], 
where  N,  n,  and  are  numbers  of  basis  func¬ 
tions,  occupied  orbitals,  and  grid,  respectively,  and 
is  fixed  virtual  space  size. 

Alternative  ways  to  approximate  ERIs  employ 
model  distributions  to  represent  products  over  ba¬ 
sis  functions,  and  the  resolution  identity  (RI)  over 
model  distributions  is  substituted  into  the  expres- 
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sion  of  ERIs  [3].  Some  of  these  approaches  are 
based  on  the  similar  ideas  used  for  approximating 
direct  coulombic  energy  in  the  linear  combination 
of  atomic  orbital  (LCAO)  Xa  method  [4,  5],  and 
charge  and  transition  densities  of  molecules  are 
expanded  into  model  distributions  of  auxiliary  ba¬ 
sis  functions.  Although  these  methods  gave  almost 
satisfactory  results,  the  optimal  ways  to  generate 
expansion  basis  functions  were  not  well-addre¬ 
ssed. 

Most  recently,  we  focused  our  attention  upon 
atomic-electron  distributions  [6,  7],  in  which  all  of 
the  linear  redundancies  of  orbital  products  are 
removed,  and  approximated  molecular  charge  dis¬ 
tributions  with  linear  combination  of  atomic-elec¬ 
tron  distributions  (LCAD).  The  LCAD  approxima¬ 
tion  has  following  features: 

1.  Neither  independent  sets  of  auxiliary  func¬ 
tions  nor  numerical  parameters  are  needed  in 
approximating  ERIs. 

2.  The  number  of  atomic-electron  distributions 
grows  almost  linearly  not  only  with  respect 
to  the  size  of  molecules  but  also  to  the  qual¬ 
ity  of  basis  functions. 

3.  The  approximation  is  very  accurate  since  no 
error  occurs  in  the  intra-atomic  distributions, 
and  the  accuracy  is  systematically  improved 
according  to  the  quality  of  basis  set  used. 

The  present  study  will  show  how  this  approach 
can  be  used  to  provide  particularly  convenient 
procedures  for  the  wave  functions  of  molecules 
and  what  new  aspects  are  possible  to  enhance  the 
electronic  structure  theory  within  the  method.  We 
shall  give  a  brief  review  of  the  method  and  then 
explore  new  possibilities  to  the  many-electron  the¬ 
ory,  focusing  our  attention  on  a  multiconfiguration 
self-consistent  field  (MCSCF)  procedure,  where  or¬ 
bital  Hessian  is  not  explicitly  set  up,  and  direct 
configuration  interaction  (Cl)  methods. 


RI-LCAD  method 

The  notion  to  employ  auxiliary  functions  to 
reexpand  the  distributions  is  not  quite 

new.  In  these  methods,  the  auxiliary  functions  are 
usually  chosen  to  be  proportional  to  N,  N  being 
the  number  of  LCAO  basis  functions.  In  the 
Roothaan-type  expansion,  the  required  rank  of  dis¬ 
tributions  should  be  much  smaller  than  N(N  -\- 
l)/2  for  large  N,  since  the  convergence  of  distribu¬ 


tions  is  doubly  faster  than  that  of  molecular  basis 
functions.  Several  approximations  are  conceivable 
with  a  given  set  of  auxiliary  functions  [3].  Vahtas 
et  al.  [8]  investigated  Hartree~Fock  energies  of 
different  approximations  based  on  RI  or  the  spec¬ 
tral  resolution, 

{ah  \cd)=Y.  (cib  I  (2) 

K 

or 

(ab  led)  =Y.  Cf  (k  I  A)Q^  (3) 

kA 

where  k,  X.,... ,  denote  auxiliary  functions.  The 
expansion  coefficients  are  determined  locally, 

cf  =  E  (S-’)KA(flbA),  (4) 

A 

or  nonlocally, 

Cf  =  E  I  A),  (5) 

A 

where  {ah\)  and  are  three-  and  two-center 
overlap  integrals,  and  (ab  i  X)  and  1^^  ==  (/c  |  A)  are 
three-  and  two-center  ERIs.  The  use  of  nonlocally 
determined  coefficients  leads  to  the  same  expres¬ 
sion  of  Eq.  (2)  and  (3).  For  the  approximation  of 
ERIs  in  the  LCAO  Xa  and  Hartree-Fock  calcula¬ 
tions  [4,  8],  the  authors  concluded  that  the  nonlo¬ 
cally  determined  coefficients  (5)  give  results  supe¬ 
rior  to  a  locally  determined  one  (4),  since  Eq.  (5) 
can  be  obtained  by  minimizing  the  self-repulsion 
of  the  residual.  In  order  to  realize  the  computa¬ 
tional  efficiency  and  the  reduction  in  disk  storage 
requirement  of  decomposed  quantities  in  Eqs.  (2) 
or  (3),  the  approximate  ERIs  are  rewritten  as, 

(ab  icd)  (ab\K]lK\cd),  (6) 

K 

where 

(<71;|k]  =  E  I  k).  (7) 

K 

This  expression  is  used  in  the  CCSIXT)  (singles 
and  doubles  coupled-cluster  plus  a  perturbational 
estimate  of  the  effects  of  connected  triple  excita¬ 
tions)  models  [9]  to  avoid  the  input/output  (I/O) 
and  storage  bottlenecks.  In  the  RI-MP2  [10]  and 
the  coupled-cluster  methods  with  approximate  in¬ 
tegrals,  the  scaling  properties  of  the  calculations 
are  hardly  improved,  viz.  Oin^N^rn)  and  Oiri^N"^), 
m  being  the  number  of  auxiliary  functions,  since 
they  explicitly  form  four-indexed  integrals  from 
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three-indexed  integrals.  Since  the  usual  ab  initio 
methods  are  based  on  the  four-indexed  ERIs,  addi¬ 
tional  research  is  needed  to  explore  optimal  ways 
to  use  three-indexed  integrals.  However,  the  com¬ 
putational  effort  shifts  from  I/O  to  matrix  multi¬ 
plications,  which  can  be  handled  more  efficiently 
than  I/O.  We  also  employ  the  same  expression  (6) 
and  (7)  in  the  approximate  second-order  MCSCF 
procedure  [6,  7],  but  the  three-indexed  integrals 
are  used  as  much  as  possible  in  place  of  the 
four-indexed  integrals.  In  this  case,  the  most  time- 
consuming  part  of  the  molecular  orbital  (MO)  inte¬ 
gral  transformation  step  is  much  more  improved 
and  maximum  a  factor  of  8.4  speedup  is  attained 
for  one  MCSCF  iteration  [7]. 

The  choice  of  auxiliary  functions  is  apparently 
dominant  over  the  quality  of  three-center  approxi¬ 
mation.  Previous  researchers  tried  to  construct  an 
independent  auxiliary  set  as  compact  as  possible. 
Some  of  them  used  an  even-tempered  set  and 
obtained  accurate  self-consisted  field  (SCF)  ener¬ 
gies  [8].  However,  no  systematic  contraction 
scheme  has  been  proposed.  Although  auxiliary  sets 
have  also  been  used  to  approximate  coulombic 
potentials  in  the  density  functional  theory,  prelimi¬ 
nary  experience  suggests  that  the  exchange  inter¬ 
actions  are  more  complicated  than  the  direct 
coulombic  one.  So  such  expansion  sets  are  not 
necessarily  transferable  to  the  general  two-electron 
interactions.  Furthermore,  an  optimal  choice 
should  require  one-to-one  correspondence  of  the 
auxiliary  basis  sets  with  a  given  molecular  basis 
set  which  varies  with  the  purpose.  For  these  rea¬ 
sons,  we  have  fallen  back  on  the  idea  of  using 
atomic-electron  distributions  (ADs)  consisting  of 
each  atomic  basis  set, 

intra 

cc' 

where  the  summation  runs  over  intra-atomic  dis¬ 
tributions.  In  each  set  of  atomic-electron  distribu¬ 
tions,  they  include  two  different  kinds  of  linear 
dependencies,  viz.  the  angular  and  coulombic  re¬ 
dundancies  (AR  and  CR)  [6,  7].  It  should  be  noted 
that  atomic  blocks  are  dominant  in  the  matrix  V. 
Both  of  the  redundancies  can  be  efficiently  re¬ 
moved,  since  we  can  use  full  symmetry  of  each 
atom,  which  is  independent  from  the  shape  of 
molecules  to  be  calculated.  Numerical  results  indi¬ 
cate  that  ADs  grow  proportionally  not  only  with 
the  size  of  molecules  but  also  with  the  quality  of 
basis  functions,  if  AR  and  CR  are  removed.  We 


call  the  RI  method  plugged  into  ADs,  where  the 
charge  distributions  are  reexpressed  by  linear  com¬ 
binations  of  ADs,  the  RI-LCAD,  or  simply  LCAD 
approximation.  The  accuracies  of  the  RI-LCAD  are 
automatically  and  systematically  improved  by  ex¬ 
tending  the  set  of  basis  functions.  Our  previous 
calculations  suggest  that  typical  errors  of  the  ap¬ 
proximation  for  molecules  consisting  of  first  row 
atoms  are  a  few  10 with  d  functions  (^-ADs) 
and  less  than  IjitE^  with  /  functions  (f-ADs)  [7]. 
To  our  knowledge,  the  present  LCAD  is  the  most 
accurate  way  to  approximate  ERIs, 


MCSCF  Method  without  Setting  up 
Orbitai  Hessian 

We  combined  the  RI-LCAD  method  with  the 
approximate  second-order  MCSCF  method  [6,  7]. 
Since  the  MCSCF  methods  include  Cl  and  orbital 
optimization,  the  time-consuming  steps  are 
strongly  dependent  on  the  number  of  basis  func¬ 
tions  and  the  size  of  Cl  expansion.  We  therefore 
shall  not  give  general  inspections  on  the  MCSCF 
method  in  this  section  but  focus  our  attention 
upon  the  orbital  improvement  within  the  frame¬ 
work  of  second-order  MCSCF  theory.  Direct  Cl 
method  combined  with  the  RI-LCAD  approxima¬ 
tion,  which  bear  some  resemblance  to  the  direct 
MCSCF  method,  will  be  given  in  the  next  section. 
For  a  large  majority  of  MCSCF  calculations,  the 
most  time-consuming  step  is  the  integral  transfor¬ 
mation  required  in  each  MCSCF  iteration.  It  is  the 
most  apparent  that  integral  transformation  time 
can  be  much  reduced  in  the  three-center  approxi¬ 
mation,  since  the  total  number  of  ERIs  is  much  less 
than  that  of  full  integrals.  In  this  case,  the  integrals 
transformation  is  efficiently  achieved  without  dif¬ 
ficulty  using  a  code  for  similarity  transformations 
of  symmetric  matrices.  So  if  the  transformed 
(three-indexed)  integrals  can  be  used  in  the  re¬ 
maining  steps  as  efficiently  as  those  of  four- 
indexed,  the  MCSCF  method  will  be  more  en¬ 
hanced  by  the  three-center  expansion. 

In  the  second-order  MCSCF  method,  solving  the 
Newton-Raphson  linear  equations  in  each  itera¬ 
tion  commonly  reduces  to  an  iterative  procedure, 
where  the  Hessian  is  multiplied  by  a  trial  vector 
and  a  sequence  of  vectors  is  built  [11,  12].  Usually, 
the  number  of  active  orbitals  n  is  much  smaller 
than  the  entire  part  N,  and  the  linear  equations  are 
solved  quickly  in  comparison  with  the  integrals 
transformation  step.  We  therefore  set  up  the  or- 
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bital  Hessian  matrix  explicitly  in  the  approximate 
second-order  MCSCF  method  with  RI-LCAD  [6,7]. 
There  coarse  quantities  of  ERIs  are  accumulated 
into  Hessian,  and  four-indexed  integrals  are  not 
generated  explicitly  for  the  gradient  and  the 
coulombic-like  part  of  the  Hessian, 

E  (9) 

K-l 

and 

dfj=  (10) 

kl 

where  /,  j,k, ,  and  a,h,c, . . . ,  refer  to  occupied 
and  virtual  orbitals,  respectively,  F  denotes  two- 
particle  reduced  density  matrix,  and  is  a  coarse 
threshold  of  two-electron  contributions  to  the  Hes¬ 
sian.  Our  previous  experience  suggests  that  em¬ 
ploying  the  coarse  threshold  efficiently  simplifies 
the  evaluation  of  Hessian  without  any  loss  of  con¬ 
vergence  [7].  It  is,  however,  not  completely  re¬ 
solved  to  generate  four-indexed  integrals  for  the 
exchange-like  contributions,  and  this  step  and  the 
step  in  Eq.  (9)  scale  as  where  is  a 

number  of  distributions  within  the  coarse  thresh¬ 
old.  One  can  expend  a  nontrivial  amount  of  com¬ 
putational  effort  in  these  steps,  if  the  active  space 
becomes  large  both  in  the  LOAD  and  the  usual 
algorithms. 

We  can  proceed  with  an  alternative  method  to 
solve  linear  equations,  where  the  update  vectors 
are  evaluated  directly  from  the  elements  of  the 
Hessian  matrix.  Instead  of  using  four-indexed 

coulombic-like  and  exchange-like  operators,  whose 
one  index  is  transformed  to  be  a  restricted  one,  the 
basic  contributions  can  be  written  in  the  forms: 

E  E  dfjKj  (11) 

i<=i  i 

and 

Jn(vc) 

E  L  (12) 

K=1  / 

where  L  denotes  three-indexed  operator,  one  in¬ 
dex  of  which  is  transformed  by  an  arbitrary  vector, 

^i=  i:(ab\K]Rj,  (13) 

b 


and 

(14) 

kl 

The  above  procedure  scales  at  most  nN^miv^X  and 
it  is  more  independent  of  the  size  of  the  active 
orbitals  than  setting  up  Hessian  explicitly.  So  it 
would  be  most  efficient  in  the  restricted  active 
space  (RAS)  SCF.  It  should  be  noted  that  the  pre¬ 
sent  algorithm  is  entirely  based  on  the  three- 
indexed  integrals,  in  which  four-indexed  quanti¬ 
ties  do  not  appear.  The  operations  in  the  entire 
part  consist  of  simple  linear  algebra,  and  we  can 
expect  that  the  method  with  LCAD  approxima¬ 
tion  can  handle  many  degrees  of  orbital  rotations. 
The  Newton- Raphson  procedure  requires  that  the 
starting  orbitals  be  in  a  local  region  close  to  the 
global  minimum,  and  it  is  very  stable  in  such 
region.  However,  if  one  starts  from  ill-conditioned 
orbitals,  there  is  no  way  to  attain  the  global  con¬ 
vergence  with  information  of  first-  and  second- 
order  derivatives  since  the  energy  function  in¬ 
cludes  infinite-order  terms  with  respect  to  the  anti¬ 
symmetric  exponent  R  in  the  unitary  matrix. 
Werner  and  Meyer  [13,  14]  have  introduced  a 
method  in  which  the  energy  is  expanded  through 
second  order  in  T(R)  -=  -  1  and  the  method  is 

more  reliable  than  the  Newton-Raphson  proce¬ 
dure.  Since  the  microiteration  requires  one-index 
transformation,  it  is  as  cheap  as  the  update  scheme 
in  the  Newton-Raphson  procedure.  We  note  that 
the  above  mentioned  procedure  employing  three- 
indexed  integrals  is  applicable  to  the  method  with 
an  entirely  parallel  argument. 


Slater  Determinant-Based  Direct 
Cl  Method 

The  Cl  method  is  one  of  the  most  popular  tools 
to  treat  electron  correlation  effects  of  ground  and 
excited  states.  When  we  use  the  direct  Cl  algo¬ 
rithm  where  the  Davidson  scheme  is  used,  the 
time-consuming  step  is  the  evaluation  of  a  of  the 
two-electronic  part: 

cti=E  (ij  I  kl)E  {I\E,jij}}ci,  (15) 

ijkl  } 

where  c  is  a  trial  vector  and  Eij  are  generators  of 
unitary  group.  Siegbahn  [15,  16]  proposed  an  algo¬ 
rithm  for  full  Cl  calculations,  which  can  make 
good  use  of  pipeline  computers,  introducing  the 
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resolution  of  the  identity  over  CSFs, 

<I|4.e;,|/>  =  E  0\E, j\K)(K\Ej}}.  (16) 

K 

Then  the  evaluation  of  a  in  Eq.  (15)  can  be  written 
as  follows,  with,  loop  over  K: 

=  E  (17) 

where  are  one-electron  coupling  coefficients, 

=  </|E^|K>,  (18) 

whose  matrix  elements  over  Gelfand  states  are 
ordinarily  calculated  in  the  graphical  unitary  group 
approach  [17,  18],  and 

Et;  =  L  (y  IE/)D,^„  (19) 

kl 

D,^=Eryc,.  (20) 

j 

Although  this  method  turns  out  to  be  computa¬ 
tionally  efficient,  there  remains  a  demand  on  I/O 
facilities  for  the  one-electron  formula  tape.  Aban¬ 
doning  the  use  of  CSFs,  Knowles  and  Handy  [19] 
extended  Siegbahn's  idea  to  the  use  of  simple 
Slater  determinants.  Although  there  are  two  or 
three  times  as  many  determinants  as  CSFs  in  a  Cl 
expansion  and  accordingly  the  matrix  multiplica¬ 
tion  in  Eq.  (19)  increases,  the  treatment  of  the 
coupling  coefficients  is  much  more  simplified. 

We  can  expect  that  the  RI-LCAD  method  pro¬ 
vides  an  efficient  alternative  to  the  Slater  determi¬ 
nant  Cl  illustrated  above.  In  the  three-center  ex¬ 
pansion,  the  part  of  Hamiltonian  including  two- 
electron  generators  can  be  expanded  into  a  combi¬ 
nation  of  square  of  one-electron  operators, 

EE7.£;,(y  |fc/)=  E4'.  (21) 

ijkl  K 

where 

4=  (22) 

Accordingly,  the  major  matrix  multiplication  step 

in  Eq.  (19)  can  be  rewritten 

ee  =  E(y1«]D^''  (23) 


and 

=  E  <-f(|(3tl/>c,-  (24) 

/ 

Then  the  scaling  property  in  Eq.  (19)  changes  from 
rp-  to  m,  asymptotically.  However,  since  the  size  of 
full  Cl  grows  rapidly  with  the  numbers  of  elec¬ 
trons  and  orbitals,  it  is  usual  that  a  set  of  basis 
functions  are  chosen  as  active.  The  original  algo¬ 
rithm  is  also  designed  with  the  problem  of  the 
complete  active  space  (CAS)  SCF  method  in  mind 
[19],  where  the  square  of  n  often  becomes  smaller 
than  m.  Thinking  from  a  different  point  of  view, 
we  shall  explore  the  possibility  of  restricted  Cl 
algorithms  with  the  Slater  determinant-based 
method. 

The  problem  to  be  solved  here  is  a  Cl  calcula¬ 
tion  of  a  single  reference  in  which  the  excitation  is 
limited  to  be  X.  The  most  obvious  difficulty  of 
introducing  intermediate  states  is  that  they  must 
span  extra  spin  states  which  at  least  interact 
through  the  operators  We  can  divide  the  linear 
contributions  to  a  into  two  parts,  viz.  intermedi¬ 
ate  states  are  within  or  out  of  the  range  of  the  Cl 
expansion  space: 

i:  <ii4i^><^i4i/> 

.y<x 

+  <J14|X  +  1><X  +  1|(5l|/>,  (25) 

where  we  used  abbreviated  notation  for  the  sum¬ 
mation  over  intermediates  and  k.  The  size  of  inter¬ 
mediates  out  of  the  Cl  space  makes  the  calculation 
wasteful.  It  is  also  possible  to  classify  the  second 
contributions  into  three  parts;  namely  the  rank  of 
orbital  difference  is  2,  1,  or  0.  For  rank  =  2,  we  can 
rewrite  it  to  a  by  introducing  X  —  1  electron 
excited  states  as;  loop  over  (X  -  1)  and  k; 

</|4lX+  1>E  <X+  l|4l/>C; 

/ 

bj 

j 

b 

+  Cjiail'Kf',  (26) 

where  a  and  i  indicate  created  and  annihilated 
orbitals  to  form  a  state  I  from  a  state  belonging  to 
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X  -  1,  and 

^  (iiijx  -  l)Xai\K].  (27) 

In  the  RI-LCAD  method,  the  main  contributions 
therefore  can  be  expressed  as  a  sum  of  products  in 
each  excited  state  of  X  -  1  and  k.  This  expression 
is  no  more  wasteful  and  similar  convenience  can 
be  provided  for  rank  =  0, 1  as  well.  The  present 
method  is  restricted  within  the  single  reference  Cl 
calculation.  However,  even  in  the  multireference 
case,  we  believe  that  efficient  code  can  be  gener¬ 
ated  based  on  the  RI-LCAD  method  with  a  tech¬ 
nique  similar  to  the  one  introduced  here. 


Conclusion 

We  have  illustrated  MCSCF  and  direct  Cl  meth¬ 
ods  incorporating  the  RI-LCAD  approximation.  In 
both  methods,  new  ideas  are  proposed  to  avoid 
explicitly  setting  up  four-indexed  integrals.  These 
methods  therefore  enable  us  to  handle  smaller  size 
of  data  and  lead  to  new  aspects  in  the  ab  initio 
methods  for  large  molecular  systems.  Our  experi¬ 
ence  suggests  that  the  LCAD  approximation  is 
very  accurate  not  only  in  the  single-determinant 
wave  function  but  also  in  the  large-scale  Cl  calcu¬ 
lations,  unless  one  uses  an  extremely  poor  basis 
set.  It  should  be  emphasized  that  we  do  not  think 
the  entire  part  of  the  ab  initio  method  can  be 
improved  by  adopting  three-center  expansion.  For 
instance,  the  SCF  or  energy  part  of  the  MP2  method 
scales  as  fourth  power  of  the  size  of  molecules.  It 
is  always  necessary  to  transform  an  index  in  the 
three-center  expansion  and  it  also  scales  as  N^. 
However,  the  bottleneck  of  the  computation  shifts 
from  the  necessity  to  handle  a  large  number  of 
data  to  the  algebraic  manipulations  of  matrices.  So 
the  three-center  expansion  would  be  much  more 
suited  on  parallel  computers. 
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ABSTRACT  - - - 

We  review  "Hilbert  space  basis-set  reduction"  (BSR)  as  an  approach  to  reduce  the 
computaHonal  effort  of  accurate  correladon  calculations  for  large  basis  sets.  We  parHtion 
the  single-particle  basis  into  a  small  "internal"  and  a  large  external  set.  We  use  the 
MRCI  method  for  the  calculation  for  that  part  of  configuration  space  in  which  only 
internal  orbitals  are  occupied  and  perturbatively  correct  for  the  remaining  configurations 
using  a  method  similar  to  Shavitt's  method.  The  present  implementation  approximates 
the  MRCI  result  for  the  unpartitioned  basis  set,  with  a  significantly  reduced  computational 
effort.  To  demonstrate  the  viability  of  the  method,  we  present  results  for  selected  states 
of  small  molecules  (Be^,  CHj,  O3).  For  the  examples  investigated,  we  find  that  relative 
energy  differences  can  be  reproduced  to  an  accuracy  of  approximately  1  kcal/mol  with  a 
significant  computational  saving.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

In  recent  years  a  number  of  attempts  have  been 
made  to  introduce  approximations  for  the 
treatment  of  dynamical  correlation  effects  within 
the  framework  of  the  established  methodology  of 
accurate  ab  initio  methods  in  quantum  chemistry. 
The  impetus  for  such  developments  stems  from 
the  unfavorable  scaling  of  the  correlation  energy 

*To  whom  correspondence  should  be  addressed. 


with  the  basis-set  size,  which  arises  when  the 
many-body  wave  function  is  expanded  in  terms  of 
Slater  determinants.  Consequently,  methods  have 
been  proposed  to  improve  either  the  wave  func¬ 
tion,  e.g.  through  the  introduction  of  explicit  elec¬ 
tron-electron  cusps  [1],  or  to  formulate  an  effec¬ 
tive  Hamiltonian  [2,  3]  to  reduce  the  size  of  the 
basis  set  which  must  be  treated  within  the  estab¬ 
lished  methodology.  The  latter  approach  benefits 
from  the  realization  that  perturbative  methods  cope 
adequately  with  the  treatment  of  purely  dynamical 
correlation  effects.  Additionally  there  are  a  num¬ 
ber  of  comprehensive  multireference  perturbation 
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theory  approachs  [4-6]  which  account  for  both 
dynamical  and  nondynamical  correlations  effects. 

One  common  objective  of  such  treatments  is  an 
adequate  description  of  electronic  fluctuations  for 
a  zeroth-order  wave  function  which  is  more  com¬ 
plex  than  the  single-determinant  closed-shell 
Hartree-Fock  function.  It  is  well  known  that  the 
systematic  approximation  of  the  electronic  fluctua¬ 
tions  requires  the  use  of  a  hierarchical  sequence  of 
basis  sets  of  increasing  size  [7].  For  nonperturba- 
tive  methods  this  increase  quickly  leads  to  a  pro¬ 
hibitive  computational  cost  even  for  molecules  of 
moderate  size.  The  underlying  assumption  for  the 
development  of  multireference  perturbation  meth¬ 
ods  is  the  existence  of  an  orbital  hierarchy,  so  that 
only  a  small  subset  of  the  orbitals  is  occupied  in 
the  zeroth-order  wave  function.  This  observation 
suggests  a  partition  of  the  orbitals  basis  into  two 
subsets:  a  small  "internal"  subset  required  for  the 
construction  of  the  zeroth-order  wave  function  and 
a  larger  "external"  one  for  the  treatment  of  dy¬ 
namical  electronic  fluctuations  [3]. 

Introducing  projectors  P  and  Q  =  1  -  P  for  the 
internal  and  external  many-body  space,  respec¬ 
tively,  an  effective  Hamiltonian  [2]  for  the  internal 
many-body  space  can  be  written  as 

=  PHP  +  PHQ(  E  ~  QHQ)  “  ’  QHP,  (1) 

where  the  inversion  of  the  many-body  operator 
(E  -  QHQ)  generates  a  perturbation  expansion. 
Given  such  an  effective  Hamiltonian,  the  internal 
wave  function  =  PT^  can  be  generated  by  an 
established  method  using  only  the  small  subset  of 
internal  configurations  and  then  expanded  to  yield 
the  full  wave  function  through  the  wave  operator: 

^  =  [i  +  (E-QHQ)^']t^,,  (2) 

which  recovers  all  fluctuation  effects  implicit  in 
the  effective  Hamiltonian. 

A  scaling  analysis  in  the  orbital  momentum  k 
demonstrates  that  only  terms  in  lowest-order  per¬ 
turbation  theory  to  Eq.  (1)  yield  corrections  of 
order  k  ^  to  the  correlation  energy,  justifying  a 
termination  of  the  perturbation  expansion  at  that 
level.  Unfortunately,  in  addition  to  one-particle 
and  two-particle  terms,  this  approach  generates  a 
three-particle  interaction  in  the  effective  Hamilto¬ 
nian.  The  corrections  arising  from  the  three-par¬ 
ticle  interaction  are  of  equal  importance  as  those  of 
the  two-particle  terms.  As  Wang  and  Freed  ob¬ 
served  early  on,  an  omission  of  this  interaction 


expansion  seriously  affects  the  quality  of  the  ap¬ 
proximation  [8].  Its  treatment,  on  the  other  hand, 
necessitates  a  computational  effort  which  tends  to 
overwhelm  the  possible  savings  gained  by  this 
approach. 


Description  of  the  Method 


Based  on  these  observations,  we  previously  de¬ 
veloped  a  formalism  [9]  which  avoids  the  explicit 
introduction  of  the  effective  Hamiltonian  in  favor 
of  a  configuration-based  approach  similar  to 
Shavitt's  B).  method  [6].  Using  an  orbital  partition¬ 
ing,  we  divide  the  many-body  state  vector  into 
an  internal  and  an  external  part:  %  =  P"^  and 
"4^^:  =  (1  -  P)'i'.  The  internal  wave  function  will  be 
treated  with  the  MRCI  method.  We  do  not  require 
the  knowledge  of  the  effective  Hamiltonian,  but 
only  the  internal  components  of  the  many-body 
field  H  =  These  arise  from  the  coupling  to 
both  the  internal  and  the  external  wave  function: 


ni  =  P{H%  +  H%).  (3) 

The  approximate  evaluation  of  the  second  term  in 
the  many-body  field  corresponds  to  the  implicit 
evaluation  of  the  second  term  in  the  effective 
Hamiltonian  in  Eq.  (1).  In  the  following  we  de¬ 
velop  a  formalism  to  compute  this  contribution 
approximately,  a  method  we  refer  to  as  Hilbert 
space  basis-set  reduction  (BSR). 

As  noted  above,  lowest-order  perturbation  the¬ 
ory  captures  the  most  important  contributions  to 
the  correlation  energy.  The  truncation  of  the  per¬ 
turbation  expansion  at  this  level  decouples  the 
external  configurations  from  one  another.  There¬ 
fore  we  can  determine  approximate  coefficients  a,, 
for  the  each  of  the  external  configurations  4>x  inde¬ 
pendently  and  their  contribution  to  the  effective 
internal  field  as  In  determining  we  go 

beyond  perturbation  theory  in  a  fashion  which 
avoids  the  intruder  state  problem:  for  each  exter¬ 
nal  configuration  we  diagonalize  the  Hamiltonian 
in  the  Hilbert  space  spanned  by  (f)^  and  the  inter¬ 
nal  wave  function  i.e.,  we  solve  the  eigenvalue 
problem 


We  note  that  a  substitution  of  the  full  many-body 
wave  function  T"  for  and  accounting  for 
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nonorthogonality  would  yield  the  exact  coefficient 
for  [10].  An  expansion  in  the  energy  difference 

recovers  all  lowest- 
order  terms  in  the  effective  Hamiltonian  of  Eq.  (1), 
including  the  crucial  three-particle  contribution. 
The  sum  of  the  individual  energy  contributions  of 
all  the  external  configurations  is  a  perturbative 
estimate  of  the  external  energy  contribution, 

AE  =  i;(e^  -  (5) 

AT 

where  E,  is  the  energy  of  the  internal  wave  func¬ 
tion.  The  lowest-order  expansion  A  E  yields  the 
energy.  A  similar  approach,  DCPT2,  has  recently 
been  introduced  as  an  alternative  to  MP2  [11]. 

As  the  first  step  in  the  calculation  we  omit  any 
external  contributions  to  fi,  and  converge  the 
MRCI  wave  function  in  the  internal  space,  yielding 
a  zeroth-order  approximation  to  the  internal  state 
For  each  external  configuration,  this  state  is 
substituted  into  Eq.  (4)  and  we  obtain  a  first  ap¬ 
proximation  for  the  external  state, 

,p(i)  =  (6) 

which  is  used  to  compute  the  first  correction  to  the 
internal  many-body  field,  We  then 

reconverge  the  internal  calculation  in  the  presence 
of  this  field,  completing  conceptually  one  iteration 
of  the  single-state  B,^  method,  arriving  at  a  cor¬ 
rected  internal  state  ^P\  The  energy  of  this  wave 
function  is  given  as 

-l-  +  h.c.)  +  E^ 

Ebsr  =  ' 

(7) 

which  accounts  for  the  relaxation  of  the  internal 
wave  function  in  the  presence  of  the  field  gener¬ 
ated  by  the  external  configurations.  In  the  expres¬ 
sion  for  the  BSR  energy  we  have  introduced 

and 

N,  -  ») 

X 

for  the  purely  external  contributions  to  energy  and 
norm.  The  most  serious  error  of  this  approxima¬ 
tion  arises  from  the  decoupling  of  the  external 
configurations,  which,  on  the  other  hand,  is  re¬ 


sponsible  for  the  significant  reduction  of  the  com¬ 
putational  effort  of  this  method.  We  note  that  the 
energy  in  Eq.  (7)  is  not  invariant  under  unitary 
transformations  of  the  external  orbitals.  The  effort 
scales  as  the  second  power  of  the  number  of  exter¬ 
nal  orbitals,  compared  to  a  fourth-power  depen¬ 
dence  for  the  equivalent  MRCI. 


Results 

The  underlying  hypothesis  for  the  introduction 
of  this  method  is  that  it  is  possible  to  separate  out 
a  sufficiently  large  part  of  the  basis  for  an  approxi¬ 
mate  treatment  without  the  introduction  of  serious 
differential  errors  in  the  potential  energy  surface. 
In  the  following  we  present  a  comparison  between 
MRCI/FCI  and  BSR  energies  for  three  small 
molecules,  Be2,  CH2,  and  O3.  These  molecules 
were  chosen  because  the  treatment  of  dynamical 
and  nondynamical  correlation  effects  in  each  ex¬ 
ample  presents  a  significant  challenge  to  the 
method. 

Be2  is  among  the  prototypical  examples  of  a 
molecule  which  requires  an  accurate  treatment  of 
both  dynamical  and  nondynamical  correlation  ef¬ 
fects  in  a  large  basis  set  and  has  been  widely 
studied  for  that  reason.  A  section  of  the  FCI 
ground-state  potential  energy  surface  in  a  large 
basis  [12]  is  shown  as  the  long-dashed  line  in 
Figure  1.  The  minimum  of  the  potential  energy 
surface  (PES)  is  found  at  r,  =  4.78  a.u.  in  good 
agreement  with  an  experimental  value  of  =  4.72 
a.u.  [13].  If  we  reduce  the  basis  set  for  another  FCI, 
calculation  to  the  lowest  22  orbitals,  selecting  on 
the  basis  of  the  Hartree-Fock  energies,  the  FCI-PES 
shows  no  binding  in  the  relevant  region  (short- 
dashed  line);  hence  the  entire  differential  effect  arises 
from  the  proper  treatment  of  the  contribution  of 
the  neglected  external  orbitals.  The  state-selected 
FCI-BSR-PES  (full  line)  captures  the  important  dif¬ 
ferential  contribution  of  the  external  orbitals,  while 
introducing  an  overall  absolute  error  of  1.8  mH. 
The  binding  distance  rf  =  4.82  a.u.  is  in  good 
agreement  with  that  of  the  FCI  calculation. 

The  computation  of  the  singlet-triplet  splitting 
in  methylene  is  another  example  where  a  high-level 
method  and  a  large  single-particle  basis  are  re¬ 
quired  to  obtain  the  desired  accuracy.  In  Table  I 
we  present  the  results  for  MRCI  and  BSR  calcula¬ 
tions  for  the  singlet  and  triplet  states  of  methylene 
at  their  respective  miruma,  which  were  deter- 
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FIGURE  1 .  FCI  and  FCI-BSR  energies  for  the  beryllium 
dimer.  The  FCI  energy  (long  dashed  line)  in  the  basis 
used  by  Harrison  and  Handy.  The  short  dashed  line 
shows  the  FCI  potential  energy  surface  for  a  22-orbltal 
subset  of  this  basis  —  it  has  been  shifted  down  by  1 2 
mH.  The  fuli  line  is  the  BSR  curve  including  relaxation. 

mined  by  Shavitt  [14]  in  a  QZP  quality  basis.  All 
BSR  and  MRCI  calculations  presented  here  were 
performed  in  the  frozen-core  interacting-space 
Hilbert  space  without  Davidson  correction.  The 
internal  orbitals  were  generated  by  an  all-electron 
CASSCF  calculation,  and  the  remaining  orbitals 
were  K-orbitals  [15]. 

Table  I  demonstrates  the  absolute  energy  con- 


vergence  of  the  BSR  method.  The  top  row  shows 
the  results  of  a  six-electron  CAS-MRCI  calculation 
at  their  respective  minima.  The  following  three 
blocks  of  data  correspond  to  different  partitions  of 
the  orbitals,  with  6,  14,  and  48  orbitals  in  the 
internal  set,  respectively.  The  first  entry  in  each 
block,  labeled  £,,  gives  the  MRCI  energy  in  the 
internal  basis,  the  second  entry  the  perturbative 
energy  of  Eq.  (5),  and  the  last  entry  the  BSR  en¬ 
ergy.  The  difference  between  the  last  two  energies 
is  a  measure  of  the  importance  of  the  relaxation 
effects  in  the  last  step  of  the  BSR  procedure.  The 
last  column  of  each  block  shows  the  relative  reduc¬ 
tion  in  the  number  of  matrix  elements  which  must 
be  calculated  in  the  BSR  vs.  the  MRCI  calculation. 
The  first  calculation  restricts  the  internal  space  to 
the  CAS  reference  space  and  gives  a  predictably 
large  total  error;  on  the  other  hand  the  computa¬ 
tion  effort  was  reduced  by  a  factor  of  1430.  The 
perturbative  correction  overshoots  the  MRCI  en¬ 
ergy  by  about  12  mH,  as  is  common  for  an  Aj^-like 
method.  About  half  of  this  error  is  corrected  for  in 
the  final  relaxation  step. 

One  can  now  systematically  improve  upon  this 
calculation.  Based  on  the  orbital  occupation  in  the 
single-particle  density  matrix  or  their  energy  con¬ 
tribution  in  the  first  calculation  we  can  select  the 
next  important  set  of  orbitals  for  inclusion  into  the 
internal  space.  A  typical  threshold  for  the  orbital 
occupation  is  about  0.01,  but  a  systematic  investi¬ 
gation  for  an  automated  selection  of  the  orbitals 


TABLE  I 


Electronic  energies  for  lowest  triplet  and  singlet  states  of  methylene  in  the  frozen-core  full-valence 
complete  active-space  MRCI  and  BSR  methods  at  the  geometries  determined  by  Shavitt  [12].^ 

Treatment 

Ni/N, 

®B,^ 

(a.u.) 

Error 

(mH) 

'A," 

(a.u.) 

Error 

(mH) 

Comp. 

speedup^ 

MRCI 

82/0 

-39.0831 

___ 

-39.0683 

1 

6/76 

-38.9721 

111.8 

-38.9529 

115.4 

E;  +  ^E 

6/76 

-39.0940 

-10.9 

-39.0815 

-13.2 

^BSR 

6/76 

-39.0903 

-6.7 

-39.0771 

-8.8 

1430 

14/68 

-39.0016 

83.0 

-38.9856 

82.7 

E;+  AE 

14/68 

-39.0853 

-7.0 

-39.0727 

-4.4 

^BSR 

14/68 

-39.0846 

-1.5 

-39.0713 

-3.0 

95 

48/34 

-39.0463 

36.8 

-39.0309 

37.4 

E,-  + AE 

48/34 

-39.0824 

0.7 

-39.0684 

-0.1 

^BSR 

48/34 

-39.0826 

0.5 

-39.0684 

-0.1 

4 

"N;  (A/,)  denote  the  number  of  internal  (external)  orbitals  in  the  BSR  framework.  Six  of  the  internal  orbitals  are  treated  in  the 
complete  active  space.  £,  is  to  the  energy  of The  perturbative  energy  correction  AE  and  the  BSR  energy  Boor  are  defined  in 
Eqs.  (5)  and  (7),  respectively. 

'’Geometry:  R  =  1.0779  A,  t»=  132.9°. 

"Geometry:  R  =  1.1086  A,  e=  102.0°. 

The  computational  speedup  is  the  fraction  of  MRCI  matrix  elements  evaluated  vs.  those  used  for  the  BSR  calculation. 
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for  the  internal  space  has  yet  to  be  undertaken.  In 
this  set  of  14  orbitals  the  original  six  active  orbitals 
will  be  treated  as  internal  in  the  MRCI  calculation, 
while  the  remaining  eight  will  be  external  in  the 
sense  of  the  MRCI  calculation.  While  the  total 
error  of  the  internal  calculation  is  still  quite  large, 
the  BSR  method  now  recovers  more  than  96%  of 
this  error,  the  computational  effort  is  reduced  by  a 
factor  of  100.  The  last  partition,  with  48  internal 
orbitals,  demonstrates  convergence  of  the  absolute 
error,  though  the  computational  saving  is  no  longer 
significant. 

An  equivalent  set  of  calculations  was  carried 
out  for  the  stretched  molecule  (75  A)  to  determine 
the  dissociation  energy.  In  Table  II  we  summarize 
the  results  for  singlet-triplet  splitting  and  the  dis¬ 
sociation  energy  for  the  various  partitions.  For 
even  the  smallest  internal  basis,  BSR  recovers  the 
full  MRCI  singlet-triplet  splitting  and  the  dissocia¬ 
tion  energy.  Note  that  none  of  the  energies  have 
been  corrected  for  zero-point  vibrations.  In  the 
second  calculation  both  energy  differences  are  con¬ 
verged  to  1  kcal / mol.  The  last  set  of  results 
demonstrates  the  convergence  of  the  method. 

As  the  final  example  we  present  results  of  an 
exploratory  calculation  of  the  ground  state  Aj 
energy  surface  of  ozone.  The  importance  of  this 
molecule  to  atmospheric  chemistry,  as  well  as  its 
interesting  properties  have  spurred  a  number  of 
theoretical  investigations  in  recent  years  [16].  While 


a  number  of  properties  could  be  adequately  de¬ 
scribed,  some  features,  most  notably  the  dissocia¬ 
tion  process,  remain  to  be  elucidated.  Here  we 
focus  on  the  two  minima  of  the  ground-state  en¬ 
ergy  surface  to  test  the  validity  of  the  BSR  approxi¬ 
mation.  Calculations  at  the  primary  minimum  were 
carried  out  at  the  experimental  geometry;  the  sec¬ 
ondary  minimum  was  determined  using  an  11 
reference  MRCI  calculation  in  CASSCF  orbitals 
constrained  to  symmetry.  Dunning's  cc-pVTZ 
basis  [17]  was  used  for  all  calculations.  Both  the 
energy  of  the  primary  minimum  and  the  location 
of  the  secondary  minimum  compare  well  with 
previous  studies  [16].  The  location  of  the  minima 
and  their  MRCI  energies  are  given  in  Table  III.  We 
determined  the  most  important  orbitals  using  a 
BSR  calculation  in  which  all  nonreference  orbitals 
at  both  minima  were  treated  in  the  BSR  approxi¬ 
mation,  yielding  a  set  of  22  internal  orbitals.  Using 
only  the  internal  basis,  we  incur  a  large  differential 
error  in  the  separation  of  the  two  minima.  The 
perturbative  step,  labeled  E,-  -t-  AE,  reduces  the 
relative  error  to  4.4  mH,  while  the  relaxation  step 
reduces  it  further  to  0.4  mH.  The  BSR  computation 
cost  of  the  BSR  calculation  was  7%  of  that  of  the 
corresponding  MRCI  calculations. 


Conclusions 

For  the  examples  investigated  here  the  BSR  ap¬ 
proximation  resulted  in  an  adequate  energy  reso- 


Singlet-triplet  splitting  and  dissociation  energy  of  the  methylene  radical  in  the  frozen-core  full-valence 
complete  active-space  MRCI  and  BSR  methods.® _ _ _ 


Treatment 

A/,/W, 

Splitting 

(kcal  /  mol) 

Error 

De 

(kcal  /  mol) 

error 

MRCI 

82/0 

9.3 

— 

189.7 

— 

E: 

6176 

12.0 

2.7 

169.5 

-20.2 

E,-  + AE 

6/76 

7.8 

-1.5 

191.9 

2.2 

^BSR 

6/76 

8.3 

-1.0 

191.3 

1.6 

E; 

14/68 

10.4 

+  1.1 

173.1 

-16.6 

E,.  + AE 

14/68 

7.9 

-1.4 

190.1 

0,4 

^BSR 

14/68 

8.3 

-1.0 

189.3 

-0.4 

E; 

48134 

9.7 

+  0.4 

184.3 

-5.4 

Ej  +  AE 

48/34 

8.7 

-0.6 

189.7 

0.0 

^BSR 

48/34 

8.9 

-0.3 

189.8 

0.1 

®For  symbols  and  geometries  see  Table  I. 
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TABLE  III  _ _ _ _ _ 

Electronic  MRCI  and  BSR  energies  for  the  primary  (X^A.,)  and  secondary  minimum  of  the  ground-state 
potentiai  energy  surface  of  ozone  in  a  cc-pVTZ  basis  using  CASSCF  orbitals  in  CzqV  symmetry.® 


Treatment 

Ni/N, 

Primary 

(a.u.) 

Error 

{mH) 

Secondary*^* 

(a.u.) 

Error 

(mH) 

^PS 

(mH) 

Error 

(mH) 

MRCI 

84/0 

-225.0482 

-224.9848 

63.4 

22/62 

-224.7880 

260.2 

-224.7346 

250.2 

53.4 

10.0 

E,  + AE 

22/62 

-225.0721 

-23,9 

-225.0131 

-28.3 

59.0 

4.4 

^BSR 

22/62 

-225.0669 

-18.7 

-225.0031 

-18.1 

63.8 

0.4 

®A/,  (NJ  denote  the  number  of  internal  (external)  orbitals  in  the  BSR  framework.  There  are  six  (active)  valence  orbitals  E  refers  to 
the  energy  of  The  perturbative  energy  correction  AE  and  the  BSR  energy  Ebsr  are  defined  in  Eqs.  (5)  and  (7),  respectively. 
Aps  denotes  the  energy  difference  between  the  two  minima. 

‘’Geometry:  R  =  2.413at,,  0=116.8°. 

‘’Geometry:  R  =  2.6683^,  e  =  60.0°. 


lution  of  the  potential  energy  surfaces  with  a  sig¬ 
nificantly  reduced  computational  effort.  Building 
on  the  effective  valence  shell  Hamiltonian  method, 
we  have  been  able  to  show  that  a  configuration- 
based  approach  can  successfully  treat  the  three- 
particle  terms  which  arise  in  lowest-order  pertur¬ 
bation  theory  for  the  effective  Hamiltonian.  BSR 
offers  a  systematic  approach  to  formulate  a  mul¬ 
tireference  perturbation  theory.  Based  on  an  initial 
run  with  only  a  few  internal  states,  the  important 
orbitals  can  be  selected  on  the  basis  of  either  their 
perturbative  energy  contribution  or  according  to 
their  occupation.  The  approximation  allows  the 
generation  of  a  single-particle  density  matrix  and 
thus  the  computation  of  properties.  The  computa¬ 
tion  of  forces  and  frequencies  is  a  more  challeng¬ 
ing  problem.  The  present  implementation  was 
explicitly  designed  to  approximate  an  MRCI  calcu¬ 
lation  of  the  full  basis,  as  such  it  lacks  size  consis¬ 
tency.  This  approach  allowed  us  to  test  the  most 
significant  approximation  of  the  method,  namely 
the  neglect  of  the  off-diagonal  terms  in  the  external 
Hamiltonian,  against  a  firmly  established  method. 
In  the  future  the  requirement  to  approximate  the 
MRCI  will  be  relaxed  to  arrive  at  a  size-consistent 
version  of  the  method. 
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ABSTRACT  _ _ _ 

An  extensive  investigation  of  the  basis-set  effect  on  the  predicted  geometry  of  the  redox 
pair  is  presented.  Basis  sets  where  the  core  electrons  have  been 

replaced  with  a  relativistic  core  potential  as  well  as  all-electron  basis  sets  were  tested. 
Best  agreement  with  observations  was  obtained  with  the  all-electron  basis  set  MIDI 
augmented  with  a  set  of  /-type  polarization  functions  on  the  metal  center.  Other 
properties  such  as  the  vibration  spectrum,  the  relative  energy  of  the  high-spin  and 
low-spin  states,  and  geometry  changes  upon  oxidation/reduction  of  the  central  metal  are 
discussed.  The  importance  of  electron  correlation  on  the  predicted  geometry  was 
estimated  at  the  MP2,  MP3,  MP4(SDQ),  CCSD,  and  CCSD(T)  levels  of  theory.  The 
MIDI(/)  basis  set  is  then  used  for  other  octahedral  second-row  transition-metal  complexes 
and  some  other  related  complexes.  The  electronic  spectrum  of  [Ru(NH3)6]^^  is  also 
calculated  using  two  different  Cl  computational  schemes.  Surprisingly  good  agreement 
between  the  predicted  electronic  spectrum  and  the  observed  spectrum  are  obtained  using 
one  of  the  Cl  computational  schemes.  ©  1996  lohn  Wiley  &  Sons,  Inc. 


Introduction 

Ruthenium  complexes,  especially 
[Ru(bpy)3]^^,  are  among  the  most  experi¬ 
mentally  studied  metalorganic  complexes,  due  to 
their  unique  combination  of  chemical  stability,  re¬ 
dox  properties,  electron  and  energy-transfer  prop¬ 
erties,  and  excited-state  reactivity  [1],  despite  that 
few  quantum  chemical  investigations  of  ruthe¬ 
nium  complexes  have  been  published  [2-4].  The 


list  of  computational  work  on  ruthenium  com¬ 
plexes  is  certainly  not  complete  and  many  works 
have  probably  escaped  our  attention.  Until  just 
recently,  only  semiempirical  methods  [2,  4a]  and 
density  functional  theory  methods  [3]  have  been 
used.  Part  of  the  problem  in  applying  ab  initio 
quantum  chemical  methods  to  second-row  transi¬ 
tion  metals  is  the  relative  large  size,  in  terms  of 
basis  functions,  of  the  metal  complexes.  Another 
aspect  is  the  importance  of  relativistic  effects  on 
the  electron  structure  of  the  metal,  which  is  not 
properly  accounted  for  in  a  traditional  Hartree- 
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Fock  calculation.  Yet  another  important  factor  is 
the  so  far  unclear  role  of  electron  correlation  on  the 
geometry  and  binding  energy  of  second-row  tran¬ 
sition  metals  [5].  However,  attempts  have  been 
made  to  clarify  the  role  of  electron  correlation  on 
the  geometry  of  second-row  metalorganic  com¬ 
plexes  [6].  The  aim  of  this  article  was  to  further 
study  the  effect  of  electron  correlation  and  the 
effect  of  the  basis  set  on  geometry  and  geometry 
changes  when  the  formal  oxidation  state  is  changed 
for  the  metal  ion.  This  work  is  concentrated  mainly 
on  ruthenium  complexes  with  a  nearly  octahedral 
arrangement  of  the  ligands  surrounding  the  cen¬ 
tral  metal  ion.  Several  basis  sets  and  computa¬ 
tional  methods  are  used.  A  compromise  between 
accuracy  and  computational  resource  requirements 
are  recommended  to  be  used  in  future  studies. 


Computational  Details 

The  ligand  field  around  the  central  ruthenium 
ion  has  octahedral  (O;^)  symmetry  or  close  to  O;^ 
symmetry  in  most  of  the  ruthenium  complexes  in 
this  study.  The  ruthenium  ion  has  either  a  4 
electron  configuration,  Ru(II),  or  a  4^^^  electron 
configuration,  Ru(III).  Three  d  orbitals,  xz,  j/z,  and 
xy,  are  occupied  and  two,  x^-y"^  and  z^,  are 
unoccupied.  In  a  perfect  ligand  field,  the  occu¬ 
pied  d  orbitals  have  t2g  symmetry  and  the  unoc¬ 
cupied  d  orbitals  have  symmetry.  When  the 
perfect  symmetry  is  broken  by  less  symmetric 
ligands,  the  occupied  d  orbitals,  as  well  as  the  two 
unoccupied  metal  orbitals,  will  be  separated  in 
energy.  All  the  Ru(II)  complexes  are  assumed  to 
have  a  low-spin  d^  metal  configuration,  since  the 
ligands  included  in  this  study  are  known  to  pro¬ 
duce  rather  large  ligand  fields  [7].  The  Ru(III) 
complexes  are  assumed  to  have  a  low-spin  d^ 
metal  configuration,  which  is  also  observed  in  most 
Ru(III)  complexes.  However,  the  corresponding 
high-spin  states  were  calculated  in  some  cases.  The 
highest  symmetry  a  M(NH3)6  complex  can  have  is 
In  the  symmetry,  all  the  M-N  bonds  are 
equal.  However,  we  would  like  to  be  able  to  study 
Jahn-Teller  distortions  upon  oxidation-state 
changes.  Thus,  all  geometry  optimizations  were 
performed  under  the  C2-,  symmetry  restriction.  In 
the  C2y  symmetry,  three  different  metal -ligand 
(M-L)  bond  distances  are  possible.  When  the  C2J, 
symmetry  restriction  is  used,  the  t2g  orbitals  will 
be  transformed  in  to  an  ^2  orbital,  an  orbital. 


and  a  orbital.  The  orbitals  will  be  trans¬ 
formed  into  an  orbital  and  a  i?2  orbital.  The 
geometries  and  a  simple  molecular  orbital  (MO) 
diagram  of  the  studied  complexes  are  foimd  in 
Figure  1. 

All  the  ab  initio  geometry  optimizations  were 
performed  with  the  GAMESS  set  of  programs  or 
the  Gaussian94  program  packages  [8].  A  rather 
large  number  of  basis  sets  were  tested.  The  used 
basis  sets  are  summarized  in  Table  I. 


Results  and  Discussion 

[Ru(rVH3)6l2  + 

The  most  extensive  basis-set  study  was  per¬ 
formed  for  the  [Ru(NH3)6]^'^  complex.  The  geom¬ 
etry  of  [Ru(NH3)5]^^  was  optimized  using  basis 
set  A-I  at  the  Hartree-Fock  (HF)  level  of  theory. 
The  results  are  summarized  in  Table  II.  It  is  evi¬ 
dent  that  the  HF  method  predicts  the  Ru — N  bond 
length  to  be  much  too  long  compared  to  what  is 
found  in  the  crystallographic  geometry  of 
[Ru(NH3)5]l2  [17].  This  type  of  over  estimation  of 
the  M-L  bond  length  was  also  found  in  many 
M(H20)6  complexes  studied  at  the  HF  level  of 
theory  by  Akesson  et  al.  [4h].  In  the  work  by 
Akesson  et  al.,  both  first-  and  second-row  transi¬ 
tion  metals  were  considered.  They  discussed  possi¬ 
ble  sources  to  the  rather  large  error  in  the  pre¬ 
dicted  bond  lengths  and  argued  that  hydrogen 
bonding  to  the  second  solvation  shell  is  one  possi¬ 
ble  source  of  error.  Another  possible  source  of 
error  put  forward  by  Akesson  et  al.  was  that  the 
observed  bond  lengths  are  temperature-dependent 
and  crystal  effects  might  also  contribute  to  the 
differences  between  theory  and  observations  [4h]. 

Even  though  we  do  not  expect  that  the  geome¬ 
try  will  be  well  reproduced  at  the  HF  level  of 
theory,  we  investigated  the  possible  effect  of  the 
surrounding  media  on  the  predicted  geometry  of 
the  [Ru(NH3)g]^^  complex.  The  geometry  of  the 
[Ru(NH3)g]^^  complex  was  optimized  in  the  pres¬ 
ence  of  two  counterions  to  form  a  noncharged 
complex.  Two  different  counterions  were  consid¬ 
ered:  F”  and  Cl”.  The  effective  core  potential  basis 
set  B  was  used  in  this  study.  The  counterions  were 
placed  in  between  the  equatorial  ligands  (see 
Fig.  1).  The  Ru(NH3)6F2  and  the  Ru(NH3)6Cl2 
complexes  were  optimized  under  the  €2^  symme¬ 
try  constraint.  In  both  complexes,  three  differ¬ 
ent  Ru — N  bond  lengths  were  obtained.  For 
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Simple  MO  diagram 
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FIGURE  1 .  A  schematic  picture  of  the  geometries  of  some  of  the  metal  complexes  studied  in  this  work.  It  is  also 
shown  in  a  simple  MO  diagram  of  the  4of  orbitals  on  the  metal.  The  symmetry  labels  of  the  MOs  in  different  point 
groups  are  also  indicated. 


Ru(NH3)6F2,  the  Ru— N  bond  lengths  are  pre¬ 
dicted  to  be  2.271,  2.247,  and  2.246  A,  and  for 
Ru(NH3)(iCl2,  the  Ru — N  bond  lengths  are  2.275, 
2.258,  and  2.257  A,  compared  to  the  Ru — N  bond 
length  2.286  A  predicted  for  the  "free"  complex 
using  basis  set  B.  Thus,  a  considerable  bond-length 
decrease  is  induced  by  the  counterions.  It  is  not 
clear  whether  the  bond-length  change  is  due  to 
sterical  effects  or  electronic  effects,  but  it  is  likely 
that  the  sterical  effects  are  the  most  important 
contribution  to  the  observed  bond-length  decrease. 

The  geometry  of  [Ru(NH3)J^^  was  optimized 
with  different  basis  sets  using  the  second-order 
perturbation  theory  (MP2)  to  account  for  some  of 


the  dynamic  electron  correlation.  The  predicted 
geometries  are  found  in  Table  II  together  with  the 
HF-predicted  geometries.  In  the  MP2  calculations, 
the  core  electrons  were  left  imcorrelated.  This 
means  that  with  the  ECP  basis  sets  the  12  core 
electrons  on  the  ligands  were  left  uncorrelated, 
and  for  the  all-electron  calculations,  the  core  elec¬ 
trons  of  the  ligands  and  the  Is,  2s,  2 p,  3s,  3 p,  4s, 
4p,  and  3d  electrons  on  ruthenium  were  left  uncor¬ 
related.  With  the  ECP  basis  set,  the  Ru — N  bond 
length  is  still  predicted  to  be  too  long  compared  to 
what  is  observed.  Electron  correlation  at  the  MP2 
level  of  theory  decreases  the  M-L  bond  length 
about  0.04  A  for  the  ECP  basis  sets.  The  Ru — N 
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TABLE  I  _ _ 

Summary  and  description  of  the  basis  sets  used  in  this  work. 


Basis  set 

Abbreviation 

Description 

A 

SBK  -t  STO-3G 

Relativistic  effective  core  potential  on  Ru  where  the  28  most 
inner  electrons  have  been  repalced  with  a  core  potential  [8]  and 
single  zeta  on  N  and  H  [9] 

B 

SBK-I-6-31G 

Relativistic  effective  core  potential  on  Ru  where  the  28  most 
inner  electrons  have  been  replaced  with  a  core  potential  [8]  and 
split  valence  double  zeta  on  N  and  H  [9] 

C 

HW-t6-31G 

Relativistic  effective  core  potential  on  Ru  where  the  28  most 
inner  electrons  have  been  replaced  with  a  core  potential  [10]  and 
split  valence  double  zeta  on  N  and  H  [9] 

D 

HW  -1-  D95 

Relativistic  effective  core  potential  on  Ru  where  the  28  most 
inner  electrons  have  been  replaced  with  a  core  potential  [10]  and 
double  zeta  on  N  and  H  [1 1  ] 

E 

3-21 G 

Double  zeta  [12] 

F 

3-21  G(3f) 

Double  zeta  [13]  augmented  by  three  Mype  polarization 
functions  on  Ru  (a  =  1.6,  0.4,  0.1) 

G 

MIDI 

Double  zeta  [14] 

H 

MIDKf) 

Double  zeta  [14]  augmented  by  a  set  of  Atype  polarization 
functions  on  Ru* 

I 

MIDI(fp,d) 

Double  zeta  [14]  augmented  by  a  set  of  Atype  of  ploarization 
functions  and  a  set  of  diffuse  p-type  functions  on  Ru*  and  a  set  of 
d-type  polarization  functions  on  N 

®  The  f-type  polarization  function  is  a  three-component  contraction  which  mimic  a  Slater-type  function.  The  contraction  is  due  to  the 
scheme  outlined  by  Stewart  [15]  and  Walsh  et  al.  [16]. 


bond  length  obtained  at  the  MP2  level  using  the 
all-electron  basis  set  G  is  not  altered  much  relative 
to  what  is  obtained  at  the  HF  level.  Polarization 
functions  of  /-type  on  the  metal  ion  seem  to  be 
more  important  and  the  geometry  is  predicted  in 
good  agreement  with  observations  at  the 
MP2/MIDI( /)  level  of  theory.  When  the  basis  set 
is  extended  with  a  set  of  d-type  polarization  func¬ 
tions  on  the  nitrogen  and  a  set  of  diffuse  p-type 
functions  on  the  metal,  the  M-L  bond  length  is 
increased.  A  similar  pattern  was  also  found  at  the 
HF  level  of  theory.  The  effect  of  a  higher  level  of 
electron  correlation  on  the  geometry  will  be  dis¬ 
cussed  further  in  the  Property  sub-section.  The 
relativistic  effect  on  the  geometry  was  not  explic¬ 
itly  investigated  since  all  the  ECPs  used  here  in¬ 
clude  relativistic  corrections  in  the  core  potential 
and  the  all-electron  basis  sets  do  not  include  any 
relativistic  corrections.  However,  the  expected  im¬ 
pact  on  the  predicted  geometry  when  the  relativis¬ 
tic  effects  are  sufficiently  included  in  the  calcula¬ 
tions  is  a  shortening  of  the  M— L  bond;  Thus,  the 
ECP  basis-set  calculations,  which  include  some  of 
the  relativistic  effects,  should  produce  shorter  bond 
lengths  than  do  the  all-electron  basis  sets,  pro¬ 


vided  that  the  quality  of  the  basis  sets  is  the  same 
in  the  valence  region.  Clearly,  this  is  not  the  case 
here  and  we  conclude  that  the  ECP  basis  sets  are 
not  sufficiently  flexible  to  be  used  for  this  molecule. 

An  other  important  issue  is  whether  the  correct 
spin  state  is  obtained  or  not.  The  high-spin/low- 
spin  separations  is  investigated  by  calculating  the 
energy  of  the  first  low-spin  state  in  each  irre¬ 
ducible  representation  and  the  energy  of  the  corre¬ 
sponding  high-spin  states  of  [RuCNHj)^]^'^.  Basis 
set  B  was  used  in  a  set  of  CASSCF  calculations. 
The  rational  for  using  basis  set  B  is  that  Akesson 
et  al.  used  a  similar  ECP  basis  set  in  an  investiga¬ 
tion  of  the  high-spin/low-spin  separation  for  some 
transition-metal  hexaaqua  complexes  with  modest 
success  [4h].  We  compared  the  present  results  with 
theirs  on  an  equal  basis.  In  the  CASSCF  calcula¬ 
tions,  six  electrons  were  distributed  in  all  possible 
ways  among  nine  active  orbitals.  The  low-spin 
states  were  geometry-optimized.  The  high-spin 
states  were  assumed  to  have  all  the  Ru — N  bond 
lengths  equal  to  2.26  A  and  the  geometry  of  the 
NH3  groups  was  kept  as  predicted  for  the  ’A, 
state.  The  corresponding  energies  and  Ru — N  bond 
lengths  are  summarized  in  Table  III.  The  ’Aj  low- 
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TABLE  II _ 

HF  and  MP2-predicted  geometries  of  [Ru(NH3)6]^+  using  different  basis  sets;  Qru,  q^,  and  Qh  denotes  the  Mulliken 
atomic  charge  on  the  ruthenium  atom,  the  nitrogen  atoms,  and  the  hydrogen  atoms,  respectively. 


GEOMETRY  OF  SECOND-ROW  TRANSITION-METAL  COMPLEXES 
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spin  state  is  found  as  the  ground  state  and  is 
favored  by  about  170  kj/mol  over  the  state. 
Thus,  the  CASSCF  calculations  predict  the  high- 
spin/low-spin  energy  ordering  in  accord  with  ob¬ 
servations  [7].  Akesson  et  al,  [4h]  predicted  the 
high-spin  state  to  be  the  most  stable  state  of  the 
[Ru(H20)6]^^  complex  at  both  the  HF  and  CASSCF 
levels  of  theory  in  contrast  to  what  is  found  here 
for  [Ru(NH3)6]^'^.  However,  ammonia  is  known  to 
produce  a  larger  ligand  field  splitting  than  does 
water  [7].  The  high-spin/low-spin  separation  will 
be  further  discussed  for  the  [Ru(H20)6]^'^  com¬ 
plex  later. 

lRu(rVH3)J^^ 

The  three  valent  oxidation  state  of  ruthenium 
has  five  d  electrons.  The  low-spin  groimd  state  is  a 
doublet.  The  "f2g"  orbitals  have  ^2/ 
symmetry  in  the  €2^  point  group.  Thus,  three 
different  doublets  can  be  constructed  with  C2.^ 
space  symmetry.  The  ^A2,  and  states  were 
geometry-optimized  with  the  CASSCF  method 
with  five  active  electrons  in  seven  active  orbitals 
As  for  the  Ru(Il)  ions,  the  basis  set  B  was  used  in 
the  CASSCF  calculations.  Three  almost  identical 
M-L  bond  lengths  are  found  for  all  three  doublet 
states  (see  Table  111).  The  M-L  bond  lengths  are 
predicted  to  be  much  too  long  compared  to  the 
observed  one  of  2.104  A  in  the  [Ru(NH3)6](BF4)3 
[17].  The  ^A2  state  is  found  to  be  the  most  stable 
state,  favored  over  the  state  by  28.2  kJ/mol 
and  over  the  ^A^  state  by  37.8  kj/ mol. 

The  HF/H  and  MP2/H  model  reproduced  the 
observed  geometry  of  the  2  +  complex  best.  Thus, 
the  same  basis  set  was  used  to  predict  the  geome¬ 
try  of  the  3  +  complex  at  the  unrestricted 
Hartree-Fock  level  (UHF),  restricted  open-shell 
Hartree-Fock  (ROHF)  level,  and  the  second-order 
perturbation  corrected  level  (UMP2).  The  opti¬ 
mized  geometries  of  [Ru(NH3)5]^^  are  summa¬ 
rized  in  Table  IV.  In  the  ROHF  calculations,  the 
state  symmetry  was  allowed  to  change  during  the 
SCF  iterations.  This  resulted  in  that  only  two  dou¬ 
blet  states  were  geometry-optimized.  The  ^A^  state 
was  never  accessed  in  the  ROHF  calculations,  while 
the  UHF  and  UMP2  geometry  optimizations  al¬ 
ways  ended  in  an  ^A  state. 

lRu(H20)6l"  + 

The  geometry  of  the  hexaamine  complexes  were 
best  reproduced  with  all-electron  basis  sets  G  and 
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TABLE  III _ 

CASSCF-predlcted  Ru^N  bond  lengths  of  the  first  root  in  each  symmetry  with  low-spin  of  [Ru(NH3)6]^^ 
compared  with  the  high-spin  state  of  each  symmetry. 


Energy 


state 

Ru— N  (A) 

Ru— N  (A) 

Ru— N  (A) 

(-429.  au) 

[Ru{NH3)e]^^ 

1  ^ 

V 

Exp.  2.144 

2.292 

2.292 

2.292 

.93774 

2.319 

2.322 

2.605 

.87733 

2.337 

2.337 

2.524 

.87136 

2.268 

2.478 

2.490 

.87495 

P' 

2.26 

Nonoptimized 

.87340 

h 

2.26 

.86186 

2.26 

rf 

.87314 

% 

2.26 

tf 

.77476 

[Ru(NH3)6]®^ 

Exp.  2.104 

2.228 

2.208 

2.205 

.39563 

^A 

2  2 

2.221 

2.214 

2.209 

.41004 

% 

2.228 

2.221 

2.212 

.39931 

The  high-spin  states  were  not  geometry-optimized,  the  geometry  of  the  ammonia  was  extracted  from  the  MP2  /  B  geometry,  and  the 
M-L  bond  distance  was  set  arbitrary  to  2.26  A.  Also  shown  is  the  CASSCF-predicted  Ru— N  bond  lengths  for  the  [Ru(NH3)6]3  + 
complex.  The  ECP  basis  set  B  was  used  in  the  CASSCF  calculations. 


H.  Thus,  those  basis  were  used  for  further  studies 
together  with  ECP  basis  set  B.  Even  though  the 
ECP  basis  sets  did  not  reproduce  the  geometry  of 
the  hexaamine  complexes  very  well,  they  are 
needed  for  studies  of  larger  systems  than  what  are 
considered  here.  Thus,  it  is  of  importance  to  obtain 
a  better  knowledge  of  how  well  ECP  basis  sets 
reproduce  the  geometry  of  second-row  transition- 
metal  complexes.  The  geometry  of  [Ru(H20)5]^^ 
was  optimized  assuming  symmetry.  The  pre¬ 


dicted  geometry  and  the  basis  set  and  method 
dependency  are  summarized  in  Table  V.  The 
Hartree-Fock  model  with  an  ECP  basis  set  pre¬ 
dicts  the  Ru — O  bond  length  to  be  too  long  com¬ 
pared  to  the  observed  bond  length,  as  was  also  the 
case  for  the  Ru — N  bond.  Akesson  et  al.  [4h] 
reported  a  Ru — O  bond  length  of  2.197  A  for  the 
[Ru(H20)5]^^  complex  at  the  Hartree-Fock  level 
of  theory  using  the  Hey-Wadt  ECP  basis  set  [10] 
on  ruthenium  and  a  double  zeta  basis  set  on  the 


TABLE  IV 


Predicted  geometries  of  [Ru(NH3)6P^  using  the  basis  sets  G  and  H;  and  qndenotes  the  Mulliken 

atomic  charge  on  the  ruthenium  atom,  the  nitrogen  atoms,  and  the  hydrogen  atoms,  respectively. 


Method/basis  set 

ROHF/G 

UHF/H 

UMP2/H 

"^2 

ROHF/H 

"A2 

ROHF/H 

Exp.  [1 7] 

Ru— N  (A) 

2.151 

2.159 

2.123 

2.149 

2.148 

2.104 

2.152 

2.149 

2.121 

2.149 

2.152 

2.158 

2.144 

2.118 

2.155 

2.156 

N— H  (A) 

1.024 

1.024 

1.042 

1.024 

1.024 

H— N— H 

105.3 

105.3 

105.1 

105.6 

105.3 

Qru 

1.59 

1.50 

1.51 

1.51 

Qn 

-0.93 

-0.92 

-0.92 

-0.92 

qh 

0.39 

0.39 

0.39 

0.39 

1336 


QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  30 


GEOMETRY  OF  SECOND-ROW  TRANSITION-METAL  COMPLEXES 


The  predicted  geometry  of  [Ru(H20)6]^^  complex  compared  with  the  crystallographic  geometry  of 
[Ru(H20)6](C7H7S03)2  [21]  and  the  predicted  geometry  of  [Ru(H20)6]^^  complex  compared  with  the 
crystallographic  geometry  of  [Ru(H20)6](C7H7S03)3  [21]. _ _ 


Method  /  Basis  Set 


[Ru(H20)6]^*  _  [Ru(H20)6]^'^ 


HF/B 

HF/G 

HF/H 

MP2/H 

Observed 

ROHF/H 

Observed 

Ru— 0  (A) 

2.224 

2.134 

2.162 

2.094 

2.139 

2.105 

2.016 

2.224 

2.130 

2.160 

2.088 

2.121 

2.058 

2.030 

2.194 

2.108 

2.140 

2.066 

2.107 

2.040 

2.031 

2.037 

2.019 

2.031 

0— H  (A) 

0.952 

0.973 

0.973 

0.992 

0.982 

H— 0— H 

110.4 

107.8 

107.9 

110.7 

107.5 

108.2 

111.9 

109.7 

110.3 

Qru 

1.38 

1.07 

1.32 

1.39 

Qo 

-0.77 

-0.72 

-0.77 

-0.73 

Qh 

0.44 

0.44 

0.44 

0.49 

ligands.  Thus,  the  ECP  basis  sets  predict  the  Ru — O 
bond  to  be  too  long  compared  to  what  is  observed. 
In  contrast,  Sargent  and  Hall  predicted  the  geome¬ 
try  of  the  RUO4  complex  using  the  same  ECP  basis 
set  at  the  Hartree-Fock  level  of  theory  in  good 
agreement  with  observations  [4i].  In  fact,  all  basis 
sets  in  their  work  performed  remarkably  well  for 
the  RUO4  complex,  while  the  M-L  bond  lengths 
were  not  well  reproduced  at  all  for  the 
Ru(P(CH3)3)2(CO)3  complex  using  the  same  basis 
set  [4i].  The  Ru — P  bond  was  predicted  to  be  0.11 
A  too  long  and  the  Ru — C  bond  length  was  pre¬ 
dicted  0.10  A  too  long.  At  first  glance,  this  could 
appear  to  be  strange.  However,  the  ruthenium 
atom  has  a  different  formal  oxidation  state  in  the 
two  complexes.  The  Ru — O  bond  lengths  of  the 
RUO4  complex  are  predicted  to  be  1.664  A  at  the 
HF /H  computational  level,  which  is  close  to  what 
Sargent  and  Hall  found  for  all  their  basis  sets  [4i]. 

In  the  work  by  Akesson  et  al.  [4h],  only  partial 
geometry  optimization  was  performed  of  the 
[Ru(H20)6]^''‘  complex.  Furthermore,  symme¬ 
try  restrictions  were  used.  Thus,  only  one  Ru — O 
bond  length  was  obtained.  However,  the 
[Fe(H20)6]^^  was  optimized  under  both  and 
D21,  symmetry  restrictions  and  a  distorted  geome¬ 
try  was  obtained  for  the  D21,  complex 
=  0.043  A  [4h].  In  the  crystallographic-determined 
geometry,  three  different  Ru — O  bond  lengths 


were  found  [18].  Bernhard  et  al.  assumed  that 
hydrogen  bonding  was  responsible  for  the  inequal¬ 
ity  of  the  Ru — O  bond  lengths  [18].  However,  all 
the  geometries  of  the  [Ru(H20)e]^''  complex  re¬ 
ported  here  have  three  different  Ru — O  bond 
lengths,  two  almost  equal  (equatorial)  and  one  a 
little  shorter  (axial).  By  allowing  the  symmetry  to 
be  reduced  0ahn-Teller  distortion),  we  repro¬ 
duced  the  observed  bond  length  and  the  small 
deviation  from  perfect  symmetry.  The  bond 
length  difference  was  calculated  to  be 

0.02  A  for  the  [Ru(H20)6]^^  complex  at  the  HF/H 
level.  Furthermore,  Akesson  et  al.  was  not  able  to 
reproduce  the  relative  ordering  of  the  high-spin 
and  the  low-spin  state  of  the  [Ru(H20)5]^^  com¬ 
plex,  not  even  with  an  MRCI  -1-  Davidson  correc¬ 
tion  calculation  [4h].  Using  the  HF/G  model  with 
the  HF/G  geometry,  which  is  closest  to  the  ob¬ 
served  geometry,  the  high-spin  state  is  predicted 
to  be  45.2  kj/mol  higher  in  energy  than  the  low- 
spin  state  in  accord  with  what  is  observed  [18]. 
Thus,  the  correct  ordering  of  the  high-spin  and 
low-spin  states  is  possible  to  reproduce  if  an  all¬ 
electron  basis  set  is  used,  but  the  correct  ordering 
is  not  obtained  with  an  ECP  basis  set.  The  reason 
for  the  somewhat  disappointing  performance  of 
the  ECP  basis  sets  might  be  that  the  core  potential 
is  not  flexible  enough  to  account  for  the  core  con¬ 
traction  due  to  the  ionic  state  of  the  metal.  The 
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ECPs  are  obtained  for  neutral  molecules  and  not 
for  the  ions  that  are  studied  here.  It  might  be 
necessary  to  develop  different  ECPs  for  different 
ion  charges  in  order  to  increase  the  usefulness  of 
the  ECP  basis  sets. 

In  Table  V  is  also  the  ROHF/H-predicted  ge¬ 
ometry  state  of  the  [Ru(H20)6]^"  complex 
compared  with  the  observed  geometry.  A  rather 
large  Jahn-Teller  distortion  is  found  for  the  Ru(III) 
hexaaqua  complex  that  was  not  found  for  the 
corresponding  hexaamine  complex.  The  predicted 
bond-length  change  upon  reduction  of  the  metal 
ion  AR  ^  0.086  A  is  in  good  accord  with  the 
observed  A  R  =  0.093  A.  It  has  been  argued  that 
the  different  bond-length  change  for  the  hexaaqua 
and  hexaamine  complexes  when  the  ruthenium 
oxidation  state  is  changed  is  due  to  tt  interaction 
in  the  case  of  H2O,  which  is  not  possible  for  the 
NH3  ligands  [18,  19].  The  tt  back-bonding  effect 
should  be  manifested  by  an  increase  of  the  Mul- 
liken  charge  of  the  metal  ion.  If  the  Mulliken 
charge  of  the  ruthenium  ion  for  the  hexaamine  and 
hexaaqua  complexes  is  compared,  a  somewhat 
larger  Mulliken  charge  is  found  for  the  hexaamine 
complex.  Thus,  the  bonding  tt  orbital  on  the  water 
molecules  donates  a  fraction  of  about  0.1  elec¬ 
tron  charges  to  the  ruthenium  ion  (see  Tables  II,  IV 
and  V). 

Mo(CO)^^  AND  Ku(CO)3 

This  far,  only  ruthenium  complexes  with  small 
or  no  back-bonding  effects  have  been  considered. 
We  also  included  a  ruthenium  carbonyl  complex 
in  this  study  in  order  to  investigate  an  M-L  bond 
where  the  back-bonding  effects  are  very  important. 
The  molybdeniumhexacarbonyl  complex  was  also 
studied  since  it  has  octahedral  symmetry  and  was 
included  in  the  study  by  Sargent  and  Hall  [4i].  The 
formal  oxidation  state  of  the  central  metal  is  zero. 
The  MoCCO)^  complex  was  geometry-optimized 
under  the  symmetry  restriction.  Thus,  all  six 
CO  ligands  are  identical  and  all  the  Mo — C  bond 
lengths  are  equal.  At  the  HF/G  level  of  theory,  the 
Mo — C  bond  length  is  predicted  to  be  2.094  A, 
which  is  a  little  longer  than  the  observed  2.063  A 
[20].  The  C — O  bond  length  is  calculated  to  be 
1.137  A  compared  to  the  observed  1.145  A  [20]. 
Sargent  and  Hall  reported  Mo — C  bond  lengths 
between  2.122  and  2.092  A.  The  geometry  of 
Mo(CO)<3  is  reproduced  reasonably  well  already  at 
the  HF  level  of  theory  even  with  a  rather  compact 


all-electron  basis  set  and  with  the  ECP  basis  sets. 
The  geometry  of  the  RuCCO)^  complex  was  opti¬ 
mized  under  the  C2-^  symmetry  restriction  at  the 
HF/G  level  of  theory.  Three  different  Ru — C  bond 
lengths  are  possible.  However,  the  two  different 
equatorial  Ru — C  bond  lengths  were  found  to  be 
equal.  Thus,  the  Ru— C^^iai  bond  length  was  pre¬ 
dicted  to  be  2.010  A  and  the  equatorial  Ru — C 
bond  length  was  predicted  to  be  1.991  A,  To  the 
best  of  my  knowledge,  no  crystallographic  geome¬ 
try  is  available  for  the  Ru(CO)5  complex,  but  the 
geometry  of  the  related  Ru(P(CH3)3)2(CO)3  has 
been  reported  [21].  The  observed  Ru — C  is  1.890 
A,  which  is  0.11  A  shorter  that  what  was  predicted 
at  the  HF/G  computational  level.  That  is  the 
largest  difference  between  theory  and  observations 
in  this  work  at  the  HF/G  level  of  theory.  How¬ 
ever,  the  crystallographic  geometry  is  perhaps  not 
appropriate  for  a  detailed  comparison  since  the 
rather  large  P(CH3)3  might  affect  the  Ru — C  bond¬ 
ing  situation.  Sargent  and  Hall  reported  a  similar 
Ru — C  bond  length  for  the  Ru(P(CH3)3)2(CO)3 
complex  using  both  ECP  basis  sets  and  all-electron 
basic  sets  at  the  HF  level  of  theory. 

PROPERTIES 

An  interesting  question  is:  Which  level  of  theory 
is  necessary  to  be  used  to  reproduce  properties, 
such  as  vibration  spectrum  or  bond-length  changes 
upon  oxidation/reduction  of  the  central  metal  ion? 
Both  these  two  properties  are  of  importance  in 
electron-transfer  theory  since  the  bond-length 
change  and  the  force  constant  for  the  symmetric 
M-L  stretching  are  proportional  to  the  internal 
reorganization  energy,  in  the  Marcus  and  Hush 
model  for  electron  transfer  [22].  in  the  case  of 
an  ML^'  -ML^^  system  is  given  by  Eq,  (1): 

where  /2  and  are  the  force  constants  for  the 
breathing  mode  of  the  2  +  and  3  +  complex  and 
is  the  metal-ligand  bond-length  change 
upon  oxidation/reduction. 

The  observed  bond  length  changes  when  the 
[Ru(NH3)5]^^  complex  is  reduced  is  0.04  A  [17]. 
At  the  CASSCF/^B  level  of  theory,  a  bond-length 
change  of  0.075  A  is  predicted.  At  the  HF-UHF/ 
ROHF  level  of  theory  with  basis  set  H,  a  ARj^i-l 
of  0.058  A  is  predicted.  A  AR^^j^l  of  0.017  A  is 
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predicted  at  the  MP2/UMP2/H  level  of  theory. 
Thus,  a  rather  large  spread  in  the  predicted  ^Rm-l 
is  obtained  and  the  method  dependency  is  large. 
The  bond-length  change  upon  reduction  of  the 
central  metal  ion  is  best  reproduced  at  the 
HP /UHF /ROHF  level  of  theory.  Fiowever,  at  the 
MP2/H  level,  the  geometry  is  best  reproduced. 
For  the  [Ru(H20)6]^^^^^  redox  pair,  the  observed 
bond-length  change  was  better  reproduced  (see 
Table  V). 

To  calculate  the  full  vibration  spectrum,  the 
Hess  matrix  has  to  be  diagonalized.  It  is  a  very 
time-consuming  issue  if  the  basis  set  is  large 
and /or  at  a  high  level  of  electron  correlation  cor¬ 
rection.  A  full  vibrational  analysis  was  performed 
at  the  HF-ROHF/G  level  of  theory  for  the  two 
ruthenium  hexaamine  complexes  (see  Fig.  2  and 
Table  VI).  When  the  predicted  frequencies  associ¬ 
ated  with  vibrations  within  the  amino  group  are 
scaled  by  a  factor  of  0.9,  the  agreement  between 
the  observed  frequencies  and  the  predicted  fre¬ 
quencies  is  good.  A  scaling  factor  0.9  for 
Hartree-Fock  vibrational  frequencies  to  obtain  a 
good  agreement  between  theory  and  observations 
is  generally  accepted  nowadays.  The  reasons  for 
scaling  the  frequencies  are  the  lack  of  electron 


correlation  in  the  calculation  and  that  the  vibration 
analysis  neglect  the  anharmonicity  of  the  vibra¬ 
tions.  The  skeletal  vibrational  spectra  are  repro¬ 
duced  rather  well  without  scaling,  which  is  in 
accord  with  that  anharmonicity  is  less  important 
for  the  low-energy  mode  since  the  vibrational  am¬ 
plitude  is  smaller  than  for  the  high-frequency 
modes.  It  should  be  mentioned  here  that  in  the 
vibrational  analysis  four  imaginary  frequencies 
were  obtained.  The  imaginary  frequencies  are  due 
to  rotations  of  the  amino  groups  in  a  way  to 
increase  the  symmetry  to  the  highest  possible  sym¬ 
metry  This  small  deviation  from  the  highest 

possible  symmetry  causes  some  vibrational  mode 
to  be  slightly  mixed,  e.g.,  the  symmetric  and  asym¬ 
metric  breathing  vibrational  mode  for  the  Ru(III) 
complex  are  slightly  coupled. 

At  a  higher  level  of  theory,  the  breathing  vibra¬ 
tion  mode  was  explored  by  freezing  the  geometry 
of  the  ligands  at  the  MP2/G  geometry  and  vary¬ 
ing  the  Ru — N  bond  length.  The  breathing  vibra¬ 
tion  modes  are  summarized  in  Table  VII.  It  is 
interesting  to  note  that  the  Ru — N  bond  distance  is 
not  changed  when  higher  level  of  electron  correla¬ 
tion  is  included  in  the  calculation.  All  the  breath¬ 
ing  frequencies  predicted  with  the  Moller-Plesset- 


FIGURE  2.  The  vibrational  spectrum  of  [Ru(NH3)6]^^  calculated  at  the  HF  /  basis  set  G  level  of  theory. 
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TABLE  VI 


The  vibration  spectrum  of  [Ru(NH3)6]^"^^^+  as  predicted  with  the  HF/G  model;  the  displayed  vibration 
frequencies  are  unscaied. 


»'a 

5a 

5s 

p 

V 

^as 

5as 

8 

NH3 

NH3 

H— N— H 

I 

1 

1 

X 

NH3 

M^N 

M— N 

M— N 

N— M— N 

N— M— N 

HF/G 

-  3615 

3532 

1822- 

1499 

806 

436 

397 

381 

243 

240 

1867 

397 

382 

241 

384 

Obsd.® 

3315 

3210 

1612 

1220 

763 

409 

_ 

_ 

_ 

_ 

ROHF/G 

3540- 

3479 

1826 

1609 

885 

516 

462 

424 

278 

248 

3569 

3486 

1832 

1611 

890 

468 

435 

255 

Obsd.^ 

1852 

1616 

920 

436 

— 

— 

— 

— 

— 

500 

475 

463 

273 

248 

'“[Ru(NH3)e]Ci2,from  [23], 
'’[Ru(NH3)e]Cl3,from  [24], 


type  treatment  of  electron  correlation  are  predicted 
to  be  too  high  in  energy  compared  to  the  observed 
one,  while  the  coupled  cluster  calculations  predict 
too  low  breathing  frequencies.  A  similar  pattern 
was  observed  in  a  work  on  FeCO  by  Ricca  et  al. 
[25],  This  pattern  can  also  be  taken  as  an  indication 
on  the  lack  polarization  functions  on  the  transition 
metal,  which  was  shown  earlier  to  have  a  large 
effect  on  the  predicted  geometry  at  the  MP2  level 
of  theory.  However,  it  was  not  possible  to  use  a 
larger  basis  set  in  this  study  due  to  limitations  of 
the  local  computer  environment. 

m-\IS-SPECTR/\ 

As  a  final  test  of  the  basis  sets,  the  electronic 
spectrum  of  the  [RufNHj)^]^^  was  calculated.  In 
this  part,  only  basis  set  G  was  used,  Meyer  re¬ 
ported  the  electronic  spectrum  of  [RufNHj)^]^^  in 
his  thesis  work  and  found  a  very  weak  band 
centered  at  3.10  eV,  ^  =  30  M"’  cm“’,  and  a 

TABLE  VII _ _ _ _ 

Harmonic  breathing  mode  frequencies  of 
[RufNHglg]^^  as  predicted  at  different  ievels  of 
theory  using  the  MIDI  basis  set  (basis  set  G). 


Ru— N  (A) 

v(Ru~N) 

Experimental 

(cm”'') 

(cm  ”  ■')  [20] 

HF 

2.207 

436 

403  -423 

MP2 

2.182 

442 

MP3 

2.181 

448 

MP4(SDQ) 

2.183 

446 

CCSD 

2.185 

383 

CCSD(T) 

2.183 

382 

somewhat  more  intense  band  at  4.51  eV,  s  =  630 
M~’  cm~  ’  [26].  Both  bands  are  due  to  Ru  4d  Ad* 
(t2g  Sg)  transitions.  It  is  well  known  that  most 
ab  initio  methods  do  not  reproduce  transition  en¬ 
ergies  in  the  UV  and  visible  frequency  regions 
very  well.  Recent  developments  of  the  multirefer¬ 
ence  perturbation  theory  have,  however,  shown  a 
promising  future  in  this  area  [27].  In  most  ab  initio 
studies  of  the  absorption  spectrum,  the  transition 
energies  must  be  scaled  due  to  the  lack  of  a  good 
treatment  of  the  dynamic  electron  correlation  in 
the  excited  state  [28].  Two  different  types  of  elec¬ 
tronic  spectrum  calculations  are  used  here.  The 
first  method  uses  HF  orbitals  as  the  starting  or¬ 
bitals  in  a  Cl  calculation  where  all  possible  single, 
double,  and  triple  substitutions  within  a  restricted 
active  space  are  considered.  This  is  the  lowest 
level  of  configuration  interaction  which  brings  in 
electron  correlation  of  the  excited  states  [29].  This 
method  will  be  abbreviated  as  CI(SDT)  hereafter. 
The  second  method  starts  with  a  small  CASSCF 
calculation  where  the  general  density  matrices  are 
formed  from  an  average  of  the  ground  state  and 
the  two  lowest  excited  states.  The  final  CASSCF- 
optimized  orbitals  are  then  used  in  a  CI(SDT) 
calculation.  This  method  will  be  abbreviated  CAS- 
CI(SDT)  hereafter.  In  the  CI(SDT)  calculation,  the 
six  t2g  electrons  were  allowed  to  be  excited  into 
the  30  lowest  unoccupied  molecular  orbitals 
(LUMOs);  20,118  configuration-state  functions 
were  generated  in  this  way.  The  first  transition 
energy  was  calculated  to  be  4.79  eV,  corresponding 
to  a  "t2g  e^"  metal-centered  transition.  Due  to 
symmetry  restrictions  in  the  Cl  calculation,  only 
one  transition  energy  was  calculated;  however. 
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two  more  "t2g  e^"  transitions  are  to  be  found  at 
about  the  same  energy  but  with  different  symme¬ 
try.  The  second  transition  is  predicted  at  6.72  eV 
and  has  also  a  "t2g  e^'  character.  In  similarity 
with  ab  initio  Cl  calculations  of  the  absorption 
spectrum  of  the  DNA  bases  [28],  the  CI(SDT) 
method  predicts  the  transition  energies  to  be  too 
high  in  energy.  Petke  et  al.  used  a  scaling  factor  of 
about  0.7  to  obtain  qualitative  good  agreement 
with  the  observed  absorption  spectra  [28].  A  simi¬ 
lar  scaling  of  the  predicted  CI(SDT)  "t2g 
transition  energies  give  the  first  peak  at  3.35  eV 
and  the  second  peak  at  4.70  eV,  which  still  is  about 
0.2  eV  above  the  observed  transition  energies.  The 
CAS-CI(SDT)  method  predicts  the  first  band  at 
3.02-3.13  eV  ~  0.0)  that  contains  three  "t2g 
Bg”  transitions;  no  symmetry  restriction  was  used 
in  this  Cl  calculation.  The  second  band  with  three 
''t2g  Sg”  transitions  are  predicted  at  4.27-4.35 
eV  ~  10"®).  The  CAS-CI(SDT)  transition  ener¬ 
gies  are  in  excellent  agreement  with  the  observed 
spectrum.  However,  this  very  good  agreement  is 
likely  due  to  cancellations  of  errors  since  a  rather 
compact  basis  set  was  used.  However,  the  success 
of  the  CAS-CI(SDT)  treatment  indicates  that  it  is  of 
great  importance  to  optimize  the  virtual  orbital 
space  to  obtain  qualitative  agreement  between  cal¬ 
culated  and  observed  electronic  spectra.  This  issue 
is  currently  under  further  study  in  this  laboratory. 
Preliminary  results  indicate  that  electronic  spectra 
of  organic  molecules  could  be  reproduced  with 
high  accuracy,  without  a  scaling  of  the  transition 
energies,  using  the  computational  scheme  pro¬ 
posed  above  [CAS-CI(SDT)]  [30]. 


Summary 

The  basis-set  dependency  of  the  predicted  ge¬ 
ometry  of  some  octahedral  ruthenium  complexes 
was  reported.  Ab  initio  HP  calculations  with  effec¬ 
tive  core  potential  basis  sets  do  not  reproduce  the 
observed  metal  M-L  bond  lengths  very  well.  The 
geometry  is  better  reproduced  if  all-electron  basis 
sets  are  used.  Polarization  functions  of  /-type  on 
the  central  metal  ion  improve  the  description  but 
the  predicted  M-L  bond  lengths  are  still  too  long. 
Electron  correlation  at  the  second-order  perturba¬ 
tion  theory  level  in  combination  with  an  all-elec¬ 
tron  basis  set  augmented  with  a  set  of  /-type 
polarization  functions  [MP2/MIDI(/)]  is  the  mini¬ 
mum  level  of  theory  that  is  needed  to  reproduce 


the  geometry  of  octahedral  second-row  transition- 
metal  complexes  with  an  acceptable  accuracy.  A 
similar  level  of  theory  (MP2/6-31G*)  has  been 
proven  to  be  sufficient  to  accurately  reproduce  the 
geometry  of  small  and  medium-sized  molecules 
[28,  31,  32].  ECP  basis  sets  seem  to  be  too  rigid  in 
the  core  region  to  reproduce  observed  geometries 
of  ruthenium  complexes  when  the  metal  is  in  an 
ionic  state.  When  the  metal  has  mostly  atomic 
character,  the  observed  geometries  are  better  re¬ 
produced. 

Properties  such  as  geometry  changes  due  to 
reduction/oxidation  of  the  central  metal  ion  are 
reproduced  well  already  at  the  HE  level  of  theory. 
Inclusion  of  electron  correlation  does  not  improve 
the  results.  The  high-spin/low-spin  state  separa¬ 
tion  is  predicted  in  accord  with  observations  at  the 
HP  level  of  theory  with  the  all-electron  basis  set  H 
in  contrast  to  what,  e.g.,  Akesson  et  al.  observed 
for  the  [Ru(H20)6]^'®  complex  and  other  second- 
row  transition  metal  complexes  using  ECP  basis 
sets  [4i].  Thus,  we  conclude  that  to  reproduce  the 
observed  high-spin/low-spin  separation  all-elec¬ 
tron  basis  sets  are  needed  at  least  for  complexes 
where  the  metal  is  in  an  ionic  state. 

The  full  vibration  spectrum  of  [RuCNHj)^]^^/^'^ 
predicted  at  the  HF/MIDI  level  of  theory  agrees 
reasonably  well  with  the  observed  spectrum  if  a 
linear  scaling  factor  of  0.9  is  applied  on  the  calcu¬ 
lated  frequencies,  in  similarity  with  what  has  been 
found  for  small  organic  molecules.  The  skeletal 
vibrational  modes  are  accurately  reproduced  with¬ 
out  any  scaling.  At  a  high  level  of  electron  correla¬ 
tion,  only  the  breathing  vibrational  mode  has  been 
explored.  The  MPn  methods  all  predict  a  breathing 
mode  frequency  higher  in  energy  that  what  is 
observed  and  the  coupled  cluster  methods  predict 
the  breathing  mode  too  low  in  energy  compared 
with  the  observed  IR  spectrum.  Polarization  func¬ 
tions  are  probably  needed  to  reproduce  the  ob¬ 
served  breathing  vibrational  mode  at  the  electron- 
correlated  level  of  theory. 

The  observed  electronic  spectrum  {t2g  Sg 
transitions)  of  [RufNHj)^]^'*'  is  reproduced  sur¬ 
prisingly  well  using  basis  set  G  and  a  novel  strat¬ 
egy  to  do  Cl  calculations  on  excited  states.  The 
strategy  can  be  summarized  as  follows:  First,  the 
important  occupied  and  virtual  orbitals  are  opti¬ 
mized  in  a  small  complete  active  space  self-con¬ 
sistent  field  calculation  (CASSCF).  For  the  present 
complex,  this  means  that  the  occupied  t2g  orbitals 
and  the  lowest  unoccupied  orbitals  are  opti- 
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mized.  The  optimized  natural  orbitals  are  then 
used  in  a  Cl  calculation  where  all  single,  double, 
and  triple  excitations  in  a  restricted  active  space 
are  considered  [CI(SDT)].  In  this  case,  all  MOs  that 
have  large  metal  character  are  considered  in  the 
CI(SDT)  calculation.  Using  this  procedure,  electron 
correlation  is  introduced  in  a  democratic  way  in 
both  the  ground  state  and  the  excited  states. 
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ABSTRACT  _ _ 

The  lattice  constant,  cohesive  energy,  bulk  modulus,  and  band  gap  for  Si  were  calculated 
with  the  linear  combinations  of  Gaussian-type  orbitals-fitting  function  (LCGTO-FF) 
technique  using  three  distinct  types  of  charge  density  and  exchange-correlation  fitting 
function  basis  sets:  (1)  site-centered  s-type  GTOs  only;  (2)  site-  and  bond-centered  s-type 
GTOs;  and  (3)  site-centered  s-  and  /-type  GTOs.  All  three  basis  sets  produce  good  results 
for  the  lattice  constant  and  bulk  modulus  of  Si,  but  only  the  two  larger  basis  sets 
determine  the  cohesive  energy  and  LDA  band  gap  accurately.  The  numerical  results 
obtained  with  the  two  larger  basis  sets  are  in  good  quantitative  agreement  with  each 
other  and  with  results  from  other  techniques.  ©  1996  John  Wiley  &  Sons,  Inc."*" 


Introduction 

The  linear  combinations  of  Gaussian-type  or¬ 
bitals -fitting  function  (LCGTO-FF)  method 
is  an  all-electron,  full-potential,  density  functional, 
electronic  structure  technique  characterized  by  its 
use  of  three  independent  GTO  basis  sets  to  expand 
the  one-electron  orbitals,  charge  density,  and 
exchange  correlation  (XC)  integral  kernels.  The 
LCGTO-FF  technique  is  routinely  used  now  to 

*The  Los  Alamos  National  Laboratory  requests  that  the 
publisher  identify  this  article  as  work  performed  under  the 
auspices  of  the  U.S.  Department  of  Energy. 


perform  local  density  approximation  (LDA)  calcu¬ 
lations  on  molecules  [1-3],  polymers  [4-6],  ultra- 
thin  films  [7-9],  and  crystalline  solids  [10-12].  One 
of  the  primary  advantages  of  the  LCGTO-FF 
method  over  other  extant  LDA  electronic  structure 
techniques  is  this  ability  to  treat  localized  (molecu¬ 
lar)  and  periodically  extended  (crystalline)  systems 
on  an  equal  footing,  thereby  bridging  the  long¬ 
standing  gap  between  quantum  chemistry  and 
solid-state  physics.  To  exploit  this  advantage  of  the 
LCGTO-FF  technique  fully,  a  new  computer  code 
(GTOFF)  [10]  was  developed  recently  to  perform 
LCGTO-FF  calculations  on  general  systems  with 
3D,  2D,  and  ID  (not  yet  implemented)  periodici¬ 
ties. 


International  Journal  of  Quantum  Chemistry:  Quantum  Chemistry  Symposium  30,  1345-1349  (1996) 
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Although  the  LCGTO-FF  method  is,  in  princi¬ 
ple,  capable  of  producing  fully  consistent  results 
for  local  and  extended  systems,  in  practice,  there 
are  at  least  two  potential  obstacles  to  making  di¬ 
rect  quantitative  comparisons  between  results  for 
such  systems.  First,  there  are  a  number  of  algorith¬ 
mic  differences  between  the  implementation  of  the 
LCGTO-FF  method  generally  employed  for  the 
calculations  on  molecules  and  polymers  [2]  and 
that  used  for  most  of  the  calculations  on  films  and 
crystalline  solids  [8-10].  The  most  significant  dif¬ 
ference  between  the  two  implementations  is  in  the 
technique  used  for  fitting  the  XC  integral  kernels. 
In  the  molecular  formulation,  the  unfitted  XC  inte¬ 
gral  kernels  are  formed  from  the  exact  electron 
density,  whereas  GTOFF  forms  the  unfitted  XC 
kernels  from  the  fitted  density.  Since  formation  of 
the  exact  electron  density  is  a  time-consuming 
process  for  crystalline  solids  [10],  the  impact  of 
this  algorithmic  distinction  can  be  most  efficiently 
addressed  via  molecular  calculations  and,  hence, 
lies  beyond  the  scope  of  the  present  work  using 
GTOFF. 

The  second  potential  obstacle  to  direct  compari¬ 
son  of  LCGTO-FF  calculations  for  various  types  of 
systems  is  more  subtle.  LCGTO-FF  calculations 
for  molecules  occasionally  have  used  bond-center¬ 
ed  s-type  GTO  fitting  functions  to  achieve  accurate 
num.erical  results  without  including  large-/  GTO 
fitting  functions  in  the  site-centered  expansions 
[13].  In  contrast,  nearly  all  calculations  for  films 
and  crystalline  solids  have  used  site-centered  fit¬ 
ting  functions  only,  since  the  higher  symmetry  of 
these  systems  often  excludes  many  components  of 
the  larger  /-type  fitting  functions  and  the  lengthy 
Coulomb  lattice  sums  converge  more  rapidly  for 
the  large-/  charge  fitting  functions  than  for  the 
s-type  charge  fitting  functions.  This  systematic  dif¬ 
ference  between  the  fitting  function  basis  sets  used 
in  molecular  and  crystalline  LCGTO-FF  calcula¬ 
tions  raises  questions  about  the  relative  merits  of 
these  two  approaches  to  fitting  the  charge  density 
and  XC  kernels  and  the  extent  to  which  results 
obtained  using  the  two  approaches  can  be  com¬ 
pared  directly. 

In  this  investigation,  these  questions  are  ad¬ 
dressed  by  calculating  the  properties  of  crystalline 
Si,  in  the  cubic  diamond  structure,  with  GTOFF, 
using  three  types  of  fitting  function  basis  sets:  (1) 
site-centered  s-type  fitting  functions  only;  (2)  site- 
and  bond-centered  s-type  fitting  functions;  and  (3) 
site-centered  s-  and  /-type  fitting  functions.  This 
particular  test  case  has  been  chosen  for  several 


reasons:  Most  importantly,  cubic  Si  has  well-de¬ 
fined  tetrahedral  bonds,  which  ensure  that  the 
site-centered  s-type  only  basis  will  be  severely 
taxed.  In  addition,  crystalline  Si  has  been  studied 
recently  with  at  least  two  other  all-electron,  full- 
potential,  electronic  structure  techniques  [14,  15], 
which  provide  an  external  reference  for  the  present 
study.  Finally,  since  Si  routinely  appears  in 
molecules,  polymers,  surfaces,  and  crystals  that 
are  of  great  technological  importance,  consistent 
calculations  for  these  various  types  of  systems 
could  prove  to  be  invaluable. 

The  implementation  of  the  LCGTO-FF  method 
used  in  GTOFF  was  discussed  in  considerable  de¬ 
tail  elsewhere  [8-10].  That  discussion  will  not  be 
repeated  here.  Instead,  the  following  section  dis¬ 
cusses  only  those  methodological  details  that  are 
peculiar  to  this  particular  investigation.  Results  are 
presented  and  discussed  in  the  third  section.  A 
few  concluding  remarks  are  given  in  the  final 
section. 


Computational  Details 

The  orbital  basis  set  used  here  was  derived 
from  Huzinaga's  lls7p  atomic  basis  [16].  To  re¬ 
duce  the  size  of  the  secular  matrix,  the  atomic 
basis  set  was  contracted  into  a  6s4p  basis  set  using 
contraction  coefficients  derived  from  a  series  of 
atomic  calculations  using  the  same  LDA  parame¬ 
terization  as  was  subsequently  used  for  the  crystal 
(Hedin-Lundqvist  [17]).  The  orbital  basis  set  then 
was  augmented  with  a  single  ^^-type  GTO,  whose 
exponent  was  found  to  minimize  approximately 
the  total  energy  of  crystalline  Si  in  a  series  of  test 
calculations.  The  final  6s4:pld  orbital  basis  set  used 
here  is  shown  in  Table  1. 

The  site-centered  s-type-only  charge  and  XC 
fitting  function  basis  sets  used  here  each  consisted 
of  nine  GTOs  (see  Table  II).  The  exponents  for 
these  sets  were  selected  on  the  basis  of  previous 
experience  with  a  wide  range  of  materials  and 
were  fine-tuned  by  requiring  that  the  site-centered 
s-type  fitting  functions  should  produce  good  re¬ 
sults  for  an  isolated  Si  atom.  The  site-centered 
s-type  basis  sets  were  then  augmented  as  desired 
either  with  two  site-centered  /-type  functions,  with 
only  the  xyz  component  retained  due  to  symmetry 
considerations,  or  by  a  single  s-type  function  in 
the  center  of  each  bond  (see  Table  II).  Since  there 


1346 


QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  30 


GAUSSIAN  FITTING  FUNCTION  BASIS  SETS  FOR  SILICON 


TABLE  I  _ _ _ _ ^ ^ ^ - 

Gaussian  exponents  (a)  and  contraction  coefficients  (C)  of  the  orbital  set  used  for  Si;  horizontal  lines 

separate  the  contractions. 


s-Type 

p-Type 

d-Type 

ct 

C 

a 

C 

a  C 

26737.389 

0.0010354 

163.73270 

0.0118853 

0.45  1 .00 

4076.3786 

0.0077158 

38.352081 

0.0805337 

953.29315 

0.0376661 

12.021114 

0.2705100 

274.58441 

0.1368397 

4.1845733 

0.4640613 

90.681987 

0.2433422 

1.4827170 

1.0000000 

90.681987 

0.0924999 

0.33498569 

1.0000000 

33.529057 

0.4327388 

0.09699384 

1.0000000 

13.457629 

0.1933028 

4.0506916 

1.0000000 

1.4841839 

1.0000000 

0.27044323 

1 .0000000 

0.09932161 

1 .0000000 

are  two  atomic  sites  and  four  bond  centers  in  each 
primitive  unit  cell  of  the  diamond  structure,  both 
augmentation  schemes  add  a  total  of  four  GTOs  to 
each  of  the  fitting  function  basis  sets.  No  effort 
was  made  to  optimize  the  exponents  of  these  addi¬ 
tional  fitting  functions. 

In  all  calculations  reported  here,  the  Brillouin 
zone  (BZ)  integrals  were  evaluated  over  the  entire 
BZ  on  a  uniform  6x6x6  mesh,  with  16  irre¬ 
ducible  points,  using  the  histogram  technique. 
One-electron  eigenvalues  were  calculated  at  three 
additional  /c-points  along  the  A-axis,  near  the  X- 
point,  to  allow  an  accurate  determination  of  the 
indirect  band  gap  for  Si.  The  SCF  portion  of  each 
calculation  was  iterated  until  the  total  energy  var- 

TABLE  II _ 

Gaussian  exponents  of  the  site-centered  s-type, 
site-centered  f-type,  and  bond-centered  s-type 
charge  and  XC  basis  sets  used  for  Si. 


Charge 

XC 

Site  s 

Site  f 

Bond  s 

Site  s 

Site  f 

Bond  s 

1000.00 

1.00 

1.00 

360.00 

1.00 

1.00 

260.00 

0.30 

90.00 

0.30 

72.00 

27.00 

24.00 

9.00 

8.10 

3.00 

2.70 

1.00 

0.90 

0.36 

0.35 

0.17 

0.13 

0.08 

ied  by  less  than  1  /xRy/atom.  All  the  calculations 
were  nonrelativistic. 


Results 

For  each  of  the  fitting  function  schemes  de¬ 
scribed  above,  the  total  energy  of  crystalline  Si  was 
calculated  at  six  lattice  constants:  10.0,  10.1,  10.2, 
10.26,  10.3,  and  10.4,  all  in  bohr.  Binding  energies 
were  then  obtained  by  subtracting  the  energy  of  an 
isolated,  spin-polarized  Si  atom,  —576.391596  Ry, 
as  determined  from  an  atomic  calculation  with  one 
additional  diffuse  p-type  GTO  added  to  the  orbital 
basis  set  to  compensate  for  the  effect  of  the  orbital 
basis  functions  on  neighboring  sites  during  the 
crystalline  calculations.  Finally,  the  equilibrium 
lattice  constants  (uq),  cohesive  energies  (E^),  and 
bulk  moduli  (B)  were  obtained  by  fitting  the  cal¬ 
culated  binding  energies  with  a  slightly  modified 
form  [18]  of  the  universal  equation  of  state  [19]. 
Each  fit  had  a  standard  deviation  of  less  than  0.01 
mRy/atom.  The  equilibrium  band  gap  (£^)  for 
each  series  of  calculations  was  then  determined 
via  linear  interpolation. 

In  Table  III,  the  equilibrium  properties  of  Si 
obtained  with  the  three  fitting  function  schemes 
discussed  earlier  are  compared  with  results  from 
recent  calculations  using  the  scalar-relativistic 
full-potential  linearized  muffin-tin  orbital  (FP- 
LMTO)  method  and  the  full-potential  Korringa- 
Kohn-Rostoker  (FKKR)  method  with  fully  rela- 
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TABLE  IN _ 

The  lattice  constant  (aQ,  in  bohr),  static  lattice 
cohesive  energy  (E^,,  in  eV  /  atom),  bulk  modulus 
(S,  in  GPa),  and  band  gap  {Eg,  in  eV)  obtained 
here  for  silicon  with  the  LCGTO-FF  method,  using 
three  different  fitting  function  basis  sets,  compared 
with  previous  results  from  FP-LMTO  [14]  and  FKKR 
[15]  calculations  and  experimental  data  [20];  except 
B,  which  is  taken  from  [21]. 


So 

Ec 

e 

Es 

FP-LMTO 

10,22 

5.18 

95 

FKKR 

10.24 

96 

0.49 

Site  s 

10.25 

5.26 

101 

0.76 

Site  s  +  bond  s 

10.25 

5.14 

97 

0.52 

Site  s  +  site  f 

10.25 

5.14 

99 

0.51 

Experiment 

10.26 

4.63 

98 

1.17 

tivistic  cores  and  nonrelativistic  valance  electrons. 
Table  III  also  lists  experimental  values  for  the 
various  parameters  [20,  21].  Several  interesting  fea¬ 
tures  are  readily  apparent  in  Table  III:  First,  the 
lattice  constants  and  bulk  moduli  obtained  here 
with  the  three  fitting  function  schemes  are  nearly 
identical,  with  the  lattice  constants  varying  by  less 
than  0.1%  and  the  bulk  moduli  varying  by  about 
4%,  a  negligible  variation  for  a  fitted  second 
derivative.  This  insensitivity  of  Uq  and  B  to 
changes  in  the  fitting  function  basis  sets  reflects 
the  overall  robustness  of  the  fitting  algorithms 
used  in  GTOFF,  especially  the  variational  charge 
fit  [1].  The  various  LCGTO-FF  results  for  Uq  and  B 
are  all  in  slightly  better  agreement  with  the  experi¬ 
mental  data  than  are  the  corresponding  FP-LMTO 
and  FKKR  results. 

Although  the  cohesive  energies  obtained  with 
the  different  fitting  function  schemes  vary  only  by 
0.12  eV/atom  (~  2%),  that  energy  difference  is 
significant.  The  cohesive  energy  from  the  s-type 
only  calculation  is  larger  than  are  the  cohesive 
energies  obtained  with  the  richer  basis  sets  be¬ 
cause  the  variational  charge  fit  procedure  ensures 
that  the  fitted  Coulomb  energy  is  a  lower  bound  to 
the  exact  Coulomb  energy  [1].  The  virtually  perfect 
agreement  between  the  cohesive  energies  obtained 
with  the  two  augmented  fitting  function  basis  sets 
once  again  demonstrates  the  robustness  of  the  fit¬ 
ting  function  algorithms  being  used  here,  i.e.,  any 
reasonable  extension  of  the  basis  set  beyond  the 
s-type-only  approximation  is  sufficient  to  achieve  a 
well-converged  result  for  the  cohesive  energy.  The 
cohesive  energies  obtained  with  the  augmented 


fitting  function  basis  sets  differ  from  the  FP-LMTO 
value  by  only  0.04  eV  (<  1%),  an  excellent  agree¬ 
ment  for  two  completely  different  electronic  struc¬ 
ture  techniques.  All  the  theoretical  results  exhibit 
the  usual  LDA  overbinding. 

The  parameter  in  Table  III  that  best  illustrates 
the  overall  inadequacy  of  the  s-type-only  approxi¬ 
mation  is  the  band  gap.  The  s-type-only  basis  set 
produces  a  band  gap  (0.76  eV)  that  is  roughly  0.25 
eV  larger  than  the  band  gaps  obtained  with  the 
two  richer  fitting  function  basis  sets  (0.51  and  0.52 
eV)  or  with  other  electronic  structure  methods, 
0.49  eV  [15]  ”>  0.52  eV  [22].  This  sensitivity  of  the 
band  gap  to  the  richness  of  the  fitting  function 
basis  is  not  surprising  given  the  variational  nature 
of  the  charge  fit  procedure.  The  one-electron  eigen¬ 
values  for  the  unoccupied  states  and  the  band  gap 
have  no  variational  significance  during  an  elec¬ 
tronic  structure  calculation  and,  thus,  will  be 
"sacrificed''  to  ensure  that  the  occupied  one-elec¬ 
tron  eigenvalues  are  well  represented.  As  was  true 
for  the  cohesive  energy,  either  augmentation 
scheme  provides  enough  additional  flexibility  in 
the  fitting  function  basis  set  to  allow  a  precise 
estimate  of  the  LDA  band  gap,  which  is,  as  usual 
[22],  much  smaller  than  the  experimental  gap  of 
1.17  eV  [20]. 


Conclusions 

It  has  been  demonstrated  here  that  the 
LCGTO-FF  technique  is  an  exceptionally  robust 
electronic  structure  method  for  determining  the 
ground-state  properties  of  crystals  within  the  con¬ 
text  of  the  LDA.  Even  the  crudest  fitting  function 
basis  set  used  here,  the  site-centered  s-type-only 
basis  set,  produces  excellent  results  for  the  lattice 
constant  and  bulk  modulus  of  Si  and  only  misses 
the  converged  cohesive  energy  by  0.12  eV/atom. 
This  is  a  remarkable  result  given  the  fact  that 
crystalline  Si  should  be  one  of  the  more  difficult 
systems  to  describe  with  GTO  fitting  functions, 
due  to  its  highly  directional  tetrahedral  bonding. 

The  LCGTO-FF  method  has  a  substantially 
more  difficult  time  describing  the  band  gap  of 
crystalline  Si,  since  the  gap  is  not  a  ground-state 
property.  For  this  reason,  the  site-centered  s-type- 
only  basis  set  proved  inadequate  for  determining 
the  band  gap  and  should  not  be  used  for  any 
application  in  which  the  conduction  states  of  a 
tetrahedral  semiconductor  must  be  described  accu- 
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rately.  The  band  gap  can  be  determined  accurately 
if  the  site-centered  s-type  basis  set  is  augmented 
with  either  site-centered  /-type  GTOs  or  bond- 
centered  s-type  GTOs. 

The  results  obtained  for  crystalline  Si  with  the 
two  types  of  augmented  fitting  function  basis  sets 
are  nearly  indistinguishable.  This  result  suggests 
that  LCGTO-FF  cluster  calculations  using  bond- 
centered  s-type  fitting  functions  should  be  fully 
consistent  with  LCGTO-FF  crystalline  calculations 
using  site-centered  /-t5rpe  fitting  functions,  so  long 
as  there  are  no  significant  algorithmic  differences 
between  the  codes  used  for  the  two  calculations. 
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ABSTRACT _ _ _ 

Reaction  energetics  of  the  highly  strained  tetrahedrane  molecule,  as  well  as  some  smaller 
hydrocarbons,  were  computed  at  the  ab  initio  MP4,  QCISD  (T),  Gl,  and  G2  levels  and 
also  by  several  density  functional  (DF)  approaches:  VWN(LDA),  PW86,  BLYP,  and 
B3LYP.  A  variety  of  basis  sets  were  used  in  the  DF  calculations.  For  atomization 
processes,  nonlocal  DF  procedures  are  superior  to  MP4  and  QCI.  For  the  other  reactions, 
B3LYP  is  the  most  reliable  DF  approach  and  is  overall  competitive  with  MP4  and  QCISD 
(T),  although  the  MP4/6-311G(2rf/,  p)  results  are  generally  the  closest  to  the  experimental 
and/or  the  G2  values.  There  is  no  consistent  correlation  between  the  accuracy  of  the  DF 
results  and  the  size  of  the  basis  set.  ©  1996  John  Wiley  &  Sons,  Inc. 
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Introduction 

While  a  considerable  amount  of  evidence  al¬ 
ready  attests  to  the  effectiveness  of  nonlo¬ 
cal  density  functional  methods  for  computing 
molecular  properties  and  reaction  energetics  [1- 
26],  there  remains  a  continuing  need  to  evaluate 
various  combinations  of  exchange  and  correlation 
functionals  and  basis  sets.  In  the  present  work,  we 
extended  such  comparative  investigations  to  the 
highly  strained  molecule  tetrahedrane,  1: 


H 


This  compound  has  not  yet  been  isolated  [27], 
although  it  has  been  claimed  to  be  an  intermediate 
in  several  reactions  [28].  Estimates  of  strain  energy 
[27]  and  a  quantitative  measure  of  bond  bending 
[29]  both  confirm  its  extremely  strained  nature. 
Our  present  objective  was  to  compare  the  results 
of  various  ab  initio  and  density  functional  treat¬ 
ments  of  this  molecule  and  its  reaction  energetics, 
as  well  as  those  of  some  smaller  hydrocarbons. 


Methods 

We  computed  the  geometries  and  energies  of 
tetrahedrane  and  some  other  hydrocarbons  with 
several  ab  initio  and  density  fimctional  (DF)  meth¬ 
ods.  The  former  included  Hartree~Fock  (HF), 
Moller-Plesset  perturbation  theory  (MP),  and 
quadratic  configuration  interaction  (QCI).  The  den¬ 
sity  functional  procedures  involved  the  Becke  (B) 
[30]  and  Becke  three-parameter-hybrid  (B3)  [31] 
exchange  functionals  and  the  Perdew-86  (P86)  [32] 


TABLE  I _ 

Ab  initio  energies  and  geometries  for  tetrahedrane. 


Method 

Energy 

(Hartrees) 

Bond  lengths  (A) 

C  — C  C  — H 

HF/6-31G(d) 

-153.59789 

1.463 

1.062 

HF/6-31G(d,p) 

-153.60704 

1.463 

1.062 

HF/6-311G(d,p) 

-153.63451 

1.465 

1.062 

MP2/6-31G(d,p) 

-154.14019 

1.478 

1.067 

MP2  =  Full/ 6-31  G(d,p) 

-154.16113 

1.476 

1.066 

MP2/ 6-311  G(d,p) 

-154.19211 

1.484 

1.072 

MP2  =  Full/ 6-31 1G(d,p) 

-154.26843 

1.482 

1.071 

MP2/6-311G(2d^p) 

-154.27430 

1.476 

1.070 

HF  /  6-31 1  G(d,  p)  //  MP2  /  6-31 1 G (d,  p) 

-153.63339 

1.484 

1.072 

HF /  6-31 1  +G(d.  p)  //  MP2  /  6-31 1  G(d,  p) 

-153.63499 

1.484 

1.072 

HF /  6-31 1  G(2d/,  p)  //  MP2  /  6-31 1  G(d,  p) 

-153.64308 

1.484 

1.072 

HF/6-31H-G(3d/,  2p)  // MP2/6-31 1G(d,  p) 

-153.64958 

1.484 

1.072 

MP2  /  6-31 1  -1-  G(d,  p)  //  MP2  /  6-31 1  G(d,  p) 

-154.19584 

1.484 

1.072 

MP3  /  6-31 1  G(d,  p)  //  MP2  /  6-31 1  G(d,  p) 

-154.21238 

1.484 

1.072 

MP3  /  6-31 1  -l-G(d,  p)  //  MP2  /  6-31 1  G(d,  p) 

-154.21543 

1.484 

1.072 

QCISD(T)  / 6-31 1  G(d,  p)  //  MP2  /  6-31 1  G(d,  p) 

-154.24319 

1.484 

1.072 

MP4  /  6-31 1  G(d,  p)  //  MP2  / 6-31 1  G(d,  p) 

-154.24359 

1.484 

1.072 

MP4  /  6-31 1  -I-  G(d,  p)  //  MP2  /  6-31 1  G(d,  p) 

-154.24719 

1.484 

1.072 

MP2  /  6-31 1  G(2df,  p)  //  MP2  /  6-31 1  G(d,  p) 

-154.27418 

1.484 

1.072 

MP3  /  6-31 1  G(2df,  p)  //  MP2  /  6-31 1  G(d,  p) 

-154.29230 

1.484 

1.072 

MP2  /  6-31 1  -FG(3df,  2p)  //  MP2  /  6-31 1  G(d,  p) 

-154.29308 

1.484 

1.072 

MP4  /  6-31 1  G(2df,  p)  //  MP2  /  6-31 1  G(d,  p) 

-154.32804 

1.484 

1.072 

G1  //MP2/6-311G(d,p) 

-154.33482 

1.484 

1.072 

G2//MP2/6-311G(d,p) 

-154.33858 

1.484 

1.072 
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and  Lee,  Yang,  and  Parr  (LYP)  [33]  correlation 
functionals.  All  this  work  was  carried  out  with  the 
Gaussian  92/DFT  and  Gaussian  94  pro^ams  [34, 
35],  A  variety  of  basis  sets,  some  mcVudirvg  a  con¬ 
siderable  number  of  polarization  terms,  were  used; 
these  are  enumerated  in  Tables  I  and  II. 

We  also  used  the  code  deMon  [36,  37]  for  a  DP 
calculation  with  the  Perdew-Wang  (PW  86)  ex- 
change~correlation  functional  [32,  38,  39]  and,  for 
completeness,  a  local  density  approximation  (LD A) 
treatment  with  the  Vosko-Wilk-Nusair  (VWN) 
correlation  functional  [40].  The  basis  set  for  these 
deMon  studies  was  the  DZVP2  [41],  which  is  simi¬ 
lar  to  the  Gaussian  6-31G((i,  p).  In  addition,  auxil¬ 
iary  basis  sets  containing  s-,  p-,  and  d-type  func¬ 
tions  were  used  to  fit  the  density  and  the  ex¬ 
change/correlation  potential  and  energies  [41]. 

Since  no  experimental  data  are  available  for 
tetrahedrane,  we  compared  our  results  to  values 
obtained  by  the  ab  initio  Gaussian-1  (Gl)  and 
Gaussian-2  (G2)  procedures,  which  were  designed 
to  achieve  a  very  high  level  of  accuracy  [42~44]. 
For  the  latter,  we  used  MP2/6-311G(d,  p)-opti- 


mized  geometries  instead  of  the  prescribed  MP2/ 
6-31G(d)  and  improved  the  energies  through  sub¬ 
sequent  MP4/6-311  +  G(3d/,2p)  and  QCKIXT)/ 
+  calculations  plus  empirical  cor¬ 

rections.  These  results  are  generally  expected  to  be 
accurate  to  within  <  2  kcal/ mol  [42—44],  al¬ 
though  larger  errors  have  been  reported  in  certain 
instances  [45,  46]. 


Results  and  Discussion 

TETRAHEDRANE  ENERGIES 

AND  GEOMETRIES 

The  total  energies  and  optimized  bond  lengths 
obtained  for  tetrahedrane  by  the  various  ap¬ 
proaches  are  given  in  Tables  1  and  II.  The  ab  initio 
results  include  a  series  of  single-point  calculations 
at  various  levels  using  a  fixed  geometry,  the  MP2/ 
6-311G(d,  p). 

A  striking  feature  of  the  computed  bond  lengths 
is  their  uniformity.  If  the  HF  (uncorrelated)  values 


TABLE  II  _ _ _ ^ 

Density  functional  energies  and  geometries 


for  tetrahedrane. 


Method 


VWN  /  DZVP2(LDA) 

BLYP/6-31G(d) 

BLYP/6-31G(d,p) 

BLYP/6-311G(d,p) 

BLYP/6-311+G(d,p) 

BLYP/6-311G(2df,  p) 

BLYP/ 6-311  +G{2df,p) 

BLYP/ 6-311  +G{2df,2pd) 
BLYP  /  6-3 1 1  -t- G(3  df ,  2  p) 

BLYP  /  6-31 1  -t-G(3df,2pd) 
BLYP/6-311  -t-l-G(3df,3pd) 
B3LYP/6-31G(d) 
B3LYP/6-31G(d,p) 
B3LYP/6-311G(d,p) 

B3LYP/ 6-311  +G{d,p) 
B3LYP/6-31lG{2df,p) 
B3LYP/ 6-311  -t-G(2df,  p) 
B3LYP/6-311  -l-G{2df,  2pd) 
B3LYP/6-311  -1-G(3df,  2p) 
B3LYP  /  6-31 1  -l- G(3df ,  2pd) 
B3LYP  /  6-31 1  -I-  H-  G(3df,  3pd) 
BP86  /  6-31 1  -I- -l-G(3df,  3pd) 
PW86/DZVP2 

B3P86  /  6-31  ++G{3df,  3pd) 


Bond  lengths  (A) 


Energy 

(Hartrees) 

c— c 

C  — H 

-153.24990 

1.485 

1.082 

- 1 54.55969 

1.493 

1.080 

- 1 54.56745 

1.493 

1.078 

-154.60545 

1.492 

1.076 

-154.60825 

1.492 

1.076 

-154.61283 

1.486 

1.075 

-154.61575 

1.487 

1.076 

-154.61881 

1.487 

1.074 

-154.61989 

1.487 

1.074 

-154.62061 

1.487 

1.074 

-154.62107 

1.487 

1.074 

-154.63669 

1.479 

1.073 

-154.64466 

1.479 

1.072 

- 1 54.67888 

1.479 

1.070 

-154.68109 

1.475 

1.068 

-154.68697 

1.474 

1.069 

-154.68929 

1.474 

1.069 

-154.69224 

1.474 

1.068 

-154.69350 

1.474 

1.068 

-154.69411 

1.474 

1.068 

-154.69447 

1.475 

1.068 

-154.69672 

1.484 

1.077 

-154.87135 

1.494 

1.079 

-155.20317 

1.470 

1.068 
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are  excluded,  the  range  in  the  C — C  distances  is 
only  1.470-1.494  A,  and  for  the  C — H,  1.068-1.079 
A.  The  nature  of  the  theoretical  approach  is  more 
significant  than  is  the  particular  basis  set  being 
used;  e.g.,  the  C— C  bond  lengths  obtained  at  the 
BLYP  level  are  all  within  a  range  of  only  0.007  A, 
whereas  replacing  the  B  exchange  functional  by 
the  B3  typically  decreases  the  C — C  distance  by 
about  0.013  A. 

The  B3  exchange  functional  also  has  a  very 
consistent  effect  upon  the  energies  when  coupled 
with  LYP  correlation.  For  any  particular  basis  set, 
the  B3LYP  total  energy  is  about  0.074  Hartree 


more  negative  than  is  the  BLYP.  However,  with 
the  P86  correlation  functional,  replacing  the  B  ex¬ 
change  by  B3  lowers  the  energy  by  fully  0.5 
Hartree. 

The  effect  of  the  basis  set  is  brought  out  particu¬ 
larly  clearly  by  the  various  ab  initio  single-point 
calculations  with  the  same  MP2/6-311G(d,  p)  ge¬ 
ometry  (Table  I).  For  example,  in  two  instances, 
MP2  energies  are  lower  than  those  of  MP4,  be¬ 
cause  the  former  basis  sets  have  more  polarization 
functions.  The  significance  of  these  in  describing 
this  highly  strained  molecule  with  its  markedly 
bent  bonds  [29]  is  demonstrated  by  the  improve- 


TABLE  IV _ ^ - — 

Calculated  atomization  energies  (kcal  /  mol). 


Method 


H2  CH 


CH4  C2H2  C2H4  C2H6  Average  error 


Ab  initio 


MP4  /  6-311  G(d,p)® 

99.6 

MP4  /  6-311  +G(d,p)^ 

99.6 

MP4  /  6-31 1G{2df,  p)® 

99.6 

QCISD(T)/6-311G(d,p)® 

99.9 

G1'" 

103.5 

G2'= 

104.4 

Density  functional 

VWN  /  DZVP2{LDA)‘‘ 

109.3 

PW86  /  DZVP2'^ 

108.7 

BLYP/ 6-31  G(d) 

103.1 

BLYP/ 6-31  G(d,p) 

104.8 

BLYP/ 6-31 1G{d,  p) 

102.9 

BLYP  /  6-31 1  -t-G(d,p) 

102.9 

BLYP/ 6-31 1G(2d/,p) 

102.9 

BLYP  /  6-31  H-G{2df,  p) 

102.9 

BLYP  /  6-31 1  -t-G(2df,  2pd) 

103.2 

BLYP  /  6-31 1  -l-G(3d/,2p) 

103.2 

BLYP/ 6-311  +G{3df,2pd) 

103.2 

BLYP  /  6-31 1  -1- -l-G(3df,  3pd) 

103.0 

B3LYP/6-31G(d) 

103.5 

B3LYP/6-31G(d,p) 

105.4 

B3LYP/6-311G(d,p) 

103.7 

B3LYP/ 6-311  +G{d,p) 

103.7 

B3LYP/6-311G(2df.p) 

103.7 

B3LYP/6-311  -l-G(2df,p) 

103.7 

B3LYP/6-311  -l-G(2df,  2pd) 

103.9 

B3LYP  /  6-31 H- G(3df,  2p) 

103.9 

B3LYP/6-311  +G(3d/,  2pd) 

103.9 

B3LYP  /  6-31 1  -1- G(3d/,  3pd) 

103.8 

Experiment® 

103.3 

^Calculated  using  energies  from  [42]. 
'[42]. 

^[44]. 

'[11]. 

®  Calculated  from  data  in  Table  VIII. 


72.6 

73.1 

75.2 

73.3 
80.0 

80.5 

88.4 

84.6 

79.6 

80.6 
80.1 
80.6 
80.8 
81.2 
81.6 
81.6 
81.7 
81.7 

79.2 

80.2 
79.9 

80.4 
80.6 
81.0 

81.4 

81.4 
81.6 

81.5 
80.0 


375.8 

375.4 

381.5 

374.8 
391.0 

393.2 

437.7 

402.8 

389.5 
393.0 

388.3 

387.4 

388.9 
388.0 

389.5 
389.0 

389.5 
389.0 

393.1 
396.7 

391.9 

391.2 

392.6 

391.9 

393.4 

392.9 

393.4 
393.0 

392.5 


367.4 

367.4 

379.2 

364.2 
387.0 

387.2 

437.2 

394.6 

382.4 

384.9 

386.1 

385.4 

388.2 

387.3 

388.4 

388.4 

388.6 

388.4 
380.0 

382.5 

383.7 

383.3 
386.0 

385.5 

386.5 

386.4 

385.6 

386.5 

388.9 


505.5 

505.6 

517.5 

503.6 

530.1 

531.7 

600.5 

545.9 

527.3 

531.3 

526.9 

525.7 

528.9 

527.8 

529.4 

528.9 

529.6 

529.2 

528.9 

532.9 

528.7 
528.0 

530.8 

530.2 

531.7 

531.3 

531.9 

531.6 

531.9 


637.5 

636.7 

648.3 

635.3 

663.5 

666.6 

752.5 

684.3 
660.2 

665.4 
657.0 
655.0 

658.4 

656.5 

658.7 

657.8 

658.7 

658.1 

666.3 

671.6 

663.3 

661.8 

664.7 

663.4 

665.5 

664.7 

665.5 

665.1 

666.2 


17.4 

17.5 
10.3 

18.6 

1.4 
0.8 

43.8 

9.7 

3.4 

1.4 

3.6 

4.5 

2.7 

3.6 

2.6 
2.9 
2.5 

2.8 
2.3 

3.2 
2.1 

2.7 
1.1 

1.7 
1.1 

1.2 
1.0 
1.1 
0.0 
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ment  observed  in  going  from  MP2/6-311G(rf,  p)  to 
MP2/6-31ia2rf/,  p)//MP2/6-311G(d,  p).  The  ad¬ 
ditional  d  and  /  functions  on  the  carbons  lower 
the  energy  by  0.08207  Hartree  (51.5  kcal/mol),  a 
quantity  of  chemical  significance. 

REACTION  ENERGETICS 

Some  hypothetical  reactions  involving  tetrahe- 
drane  are  shown  as  Eqs.  (l)-(5): 

C4H4  4C  5-  4H  (1) 

4CH  ^  C4H4  (2) 


2C2H2  C4H4  (3) 

6C2H6  C4H4  +  8CH4  (4) 

2C2H4  ^  C4H4  +  2H2.  (5) 

In  Table  III  are  listed  computed  energies  for  the 
reactants  and  products  in  Eqs.  (l)-(5).  The  ab  initio 
values  are  taken  from  elsewhere  [42,  44],  except 
for  tetrahedrane,  and  are  based  on  MP2  =  FULL/ 
6-31G(d)  geometries;  the  tetrahedrane  energies  are 
from  Table  I.  The  density  functional  values  are 
from  the  present  work,  except  as  indicated.  The 
zero-point  energies  [ZPE(G)]  were  obtained  in  the 
course  of  the  G1  and  G2  procedures  [42,  44]  and 


TABLE  V _ _ _ 

Calculated  A£  values  (kcal/mol)  for  reactions  shown  in  Eqs.  (6)-(10). 


Method 

Eq.  (6) 

Eq.  (7) 

Ab  initio 

MP4  /  6-31 1G(d.p)^ 

222.1 

-41,9 

MP4  /  6-31 1  -hG(d,  p)^ 

221.2 

-41.2 

MP4/6-311G(2df,  p)® 

228.8 

-39.8 

QCISD(T)/6-311G(d,p)® 

217.6 

-42.3 

G1® 

226.9 

-39.4 

G2‘’ 

226.3 

-39.8 

Density  functional 

VWN  /  DZVP2(LDA)'’ 

260.5 

-69.6 

PW86/DZVP2'’ 

225.5 

-47,4 

BLYP/6-31G(d) 

223.2 

-40.1 

BLYP/6-31G{d,p) 

223.7 

-39.3 

BLYP/6-311G(d.p) 

225.9 

-31.6 

BLYP/6-311  -FG(d,  p) 

224.2 

-30.2 

BLYP/6-311G(2d/,  p) 

226.6 

-31.2 

BLYP/6-311  +G(2dL  p) 

224.9 

-29.9 

BLYP/ 6-31 1  +G{2df,  2pd) 

225.1 

-29.5 

BLYP/6-311-t-G(3d/‘.2p) 

225.2 

-29.1 

BLYP/ 6-31 1-i-G{3df,  2pd) 

225.1 

-29.4 

BLYP/6-311-H-l-G(3d/’,3pd) 

225.0 

-29.7 

B3LYP/6-31G(d) 

221.7 

-46.5 

B3LYP/6-31G(d,p) 

222.1 

-45.6 

B3LYP/6-311G(d,p) 

223.9 

-38.4 

B3LYP/6-311-(-G{d,p) 

222.5 

-37.2 

B3LYP/6-311G(2df,p) 

224.7 

-37.8 

B3LYP/6-31H-G(2d/,p) 

223.4 

-36.8 

B3LYP/6-311-t-G(2df,  2pd) 

223.6 

-36.4 

B3LYP/6-31H-G(3d/,  2p) 

223.6 

-36.1 

B3LYP/6-311  -fG(3dL  2pd) 

223.5 

-36.4 

B3LYP  /  6-31 1-1- -1- G(3df,  3pd) 

223.4 

-36.7 

Experiment'’ 

228.9 

-40.0 

^Calculated  using  energies  from  [42]. 
‘’Calculated  using  energies  from  [44], 
‘’Calculated  using  energies  from  [11], 
“  Calculated  from  data  in  Table  VIII. 


Eq.  (8) 

Eq.  (9) 

Eq.  (10) 

Average 

Error 

-6.3 

-38.6 

-32.4 

2.7 

-7.0 

-38.6 

-31.6 

2.5 

-7.4 

-38.7 

-31.3 

0.6 

-7.8 

-39.5 

-31.7 

3.1 

-9.7 

-39.5 

-29.8 

1.0 

-9.6 

-40.1 

-30.5 

0.9 

-11.3 

-54.0 

-42.7 

17.9 

-12.8 

-42.5 

-29,7 

3.8 

-12.1 

-41.8 

-29.8 

2.5 

-12.3 

-41.6 

-29,3 

2.6 

-10.6 

-37.8 

-27.2 

3.8 

-11.1 

-37.4 

-26.3 

4.8 

-11.1 

-37.7 

-26.6 

3.9 

-11.7 

-37.5 

-25.8 

4.9 

-11.7 

-37.8 

-26.1 

4.8 

-11.7 

-37.4 

-25.6 

5.1 

-11.9 

-37.8 

-25.9 

4.9 

-11.9 

-37.8 

-25.9 

4.9 

-11.5 

-45.4 

-33.9 

5.0 

-11.8 

-45.0 

-33.2 

4.6 

-10.3 

-41.3 

-30.9 

2.1 

-10.9 

-41,0 

-30.1 

2.7 

-10.8 

-41.1 

-30.3 

2.1 

-11.5 

-41.0 

-29.5 

2.8 

-11.5 

-41.3 

-29.8 

2.9 

-11.5 

-41.0 

-29.4 

2.9 

-11.6 

-41.3 

-29.7 

2.9 

-11.6 

-41.3 

-29.6 

2.9 

-8.7 

-39.8 

-31,0 

0.0 
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were  used  in  computing  the  MP4  and  QCI  reaction 
energetics;  the  ZPE(DF)  were  calculated  at  the 
B3LYP/6-311  +  +G(3df,3pd)  level  and  were  used 
for  all  DF  reaction  energetics. 

As  a  test  of  the  data  in  Table  III,  Table  IV 
presents  the  computed  atomization  energies  of  the 
molecules  in  Eqs.  (l)-(5).  Experimentally  deter¬ 
mined  values  are  also  included.  As  anticipated,  the 
largest  average  error  is  for  the  VWN/DZVP2  re¬ 
sults;  the  local  density  approximation  is  known  to 
be  very  poor  for  calculating  bond-breaking  ener¬ 
gies  [47-49].  However,  Table  IV  shows  a  clear 
superiority  of  the  nonlocal  DF  procedures,  espe¬ 
cially  the  BLYP  and  B3LYP,  over  the  MP4  and 
QCI,  which  are  consistently  too  low.  Indeed,  the 
B3LYP  approach  is  able  to  achieve  the  same  level 
of  accuracy  as  do  the  G1  and  G2,  provided  that  / 
polarization  functions  are  included  in  the  basis  set. 
It  is  interesting  to  note  that  diffuse  functions  have 


a  detrimental  effect  upon  the  BLYP  and  B3LYP 
results,  possibly  reflecting  a  basis-set  superposi¬ 
tion  error. 

As  further  tests,  we  used  the  data  in  Table  III  to 
compute  A  E  for  each  of  the  reactions  shown  as 
Eqs.  (6)-(10).  The  results  are  in  Table  V; 


C2H2  ^  2CH  (6) 

C2H2  +  4CH4  ^  3C2H6  (7) 

C2H2  +  C2H6  2C2H4  (8) 

C2H2  +  H2  C2H4  (9) 

C2H4  +  H2^C2H,.  (10) 


The  disparity  between  the  accuracies  of  the  ab 
initio  and  DF  procedures  that  was  observed  for  the 
atomization  energies  (Table  IV)  is  not  in  evidence 
in  reactions  (6)-(10).  Instead,  the  poorest  overall 


TABLE  VI 


Calculated  A£  values  (kcal/mol)  of  reactions  involving  tetrahedrane,  Eqs.  (1)  (5). 


Method 


Eq.  (1) 


Eq.  (2) 


Eq.  (3) 


Eq.  (4) 


Eq.  (5) 


Ab  initio 

MP4  /  6-311  G(d,p) 

MP4  /  6-31 1-HG(d,p) 
MP4/6-311G(2df,p) 
QCISD(T)/6-311G(d,p) 

G1 

G2 

Density  functional 
VWN  /  DZVP2(SDA) 

PW86  /  DZVP2 
BLYP/ 6-31  G(d) 

BLYP/ 6-31  G(d,p) 

BLYP/ 6-31 1G(d,p) 

BLYP/ 6-311  4- G(d,p) 
BLYP/6-311G(2dYp) 

BLYP  /  6-31 1  -HG(2df,p) 

BLYP/ 6-311  4-G(2df,2pd) 
BLYP  /  6-31 1  -HG(3df,2p) 

BLYP  /  6-31 1  -^G(3df,2pd) 
BLYP  /  6-311  4-  + G(3df,3pd) 
B3LYP/6-31G(d) 
B3LYP/6-31G(d,p) 
B3LYP/6-311G(d,p) 
B3LYP/6-311  -FG(d,p) 
B3LYP/6-311G(2df,p) 
B3LYP/6-311  4-G(2dYp) 
B3LYP/ 6-311  -^G(2df,2pd) 
B3LYP/6-311  -HG(3df,2p) 
B3LYP/6-311  -HG(3df,2pd) 
B3LYP/6-311  +  +  G(3d/,3pd) 


708.7 

-418.2 

708.7 

-416.2 

735.2 

-434.3 

702.4 

-409.1 

750.7 

-430.5 

753.8 

-431.9 

896.7 

-543.2 

782.3 

-444.1 

746.9 

-428.4 

751.8 

-429.4 

743.7 

-423.3 

740.9 

-418.6 

747.8 

-424.6 

745.2 

-420.3 

747.1 

-420.6 

747.7 

-421.3 

748.1 

-421.1 

748.0 

-421.1 

747.4 

-430.8 

752.4 

-431.6 

744.7 

-425.1 

742.9 

-421.3 

749.4 

-426.8 

747.6 

-423.5 

749.5 

-423.7 

750.2 

-424.5 

750.6 

-424.3 

750.5 

-424.4 

26.1 

26.2 

23.2 

26.1 

23.4 
20.6 

-22.3 

6.9 

17.9 

17.9 
28.6 

29.9 
28.6 

29.5 

29.7 
29.1 
29.1 

28.8 

12.7 

12.6 

22.7 

23.7 

22.7 

23.4 

23.5 
22.7 
22.7 

22.5 


109.9 
108.6 

102.7 
110.6 
102.1 
100.3 

116.9 

101.7 

98.1 

96.6 

91.9 

90.3 

90.9 

89.3 

88.6 

87.2 

87.8 

88.2 

105.7 

103.8 

99.4 
98.2 

98.2 
97.0 

96.3 

94.9 

95.4 

95.9 


103.3 

103.4 
100.6 
105.0 

102.4 
100.8 

85.7 

91.9 

101.5 
101.2 

104.2 

104.7 
104.0 

104.5 

105.3 

103.8 

104.6 

104.4 

103.6 

102.7 

105.2 

105.8 

104.9 

105.4 
106.1 
104.6 

105.3 
105.0 
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results  are  now  shown  by  the  BLYP  method  for 
reaction  (7)  and,  to  a  lesser  extent,  reaction  (10).  It 
is  interesting  that  the  BLYP  values  actually  be¬ 
come  worse  as  the  basis  set  is  expanded.  The 
B3LYP  calculations,  on  the  other  hand,  are  compa¬ 
rable  in  accuracy  to  the  MP4  and  QCISDfT),  except 
for  the  smallest  basis  sets.  Again,  as  for  the  atom¬ 
ization  energies,  the  inclusion  of  diffuse  functions 
gives  poorer  energies  by  both  the  BLYP  and  B3LYP 
approaches. 

Proceeding  now  to  the  reactions  actually  involv¬ 
ing  tetrahedrane  [Eqs.  (l)-(5)].  Table  VI  lists  the 
A£  values  obtained  using  the  data  in  Table  III.  For 
the  reaction  in  Eq.  (1),  the  atomization  of  tetrahe¬ 
drane,  the  MP4  and  QCI  results  appear  to  be  too 
low,  as  would  be  expected  on  the  basis  of  Table 
IV.  The  BLYP  and  B3LYP  values  approach  much 
more  closely  the  G1  and  G2.  For  the  other  reac¬ 
tions,  the  B3LYP  results  tend  to  be  superior  to  the 


BLYP  and  are  overall  comparable  in  accuracy  to 
the  ab  initio  (if  we  take  the  G2  as  our  standard); 
however,  the  MP4/6-311G(2rf/,  p)  are  clearly  the 
best,  as  they  were  also  for  the  reactions  in  Eqs. 
(6)-(10).  Except  for  the  VWN/DZVP2(LDA),  the 
largest  errors  were  obtained  with  the  QCISD(T) 
and  the  PW86/DZVP2  procedures. 

A  point  of  interest  concerning  Eq.  (4)  is  that  it 
depicts  an  “isodesmic"  reaction,  i.e.,  one  in  which 
the  number  of  bonds  of  each  formal  type  remains 
the  same.  A£  for  such  a  reaction  is  sometimes 
interpreted  as  reflecting  any  deviations  from  addi¬ 
tivity  of  bond  energies  and,  hence,  is  used  as  a 
measure  of  anomalous  energetic  effects,  such  as 
strain  [50].  By  this  reasoning,  the  AE  values  listed 
in  Table  VI  for  Eq.  (4)  indicate  that  tetrahedrane 
has  a  strain  energy  of  approximately  100  kcal/mol. 
On  the  other  hand,  other  approaches  to  estimating 
strain  energy,  also  involving  G2  calculations,  have 


TABLE  VII  _ _ 

Tetrahedrane  enthalpies  of  formation  at  0  K  (kcal  /  mol)  computed  from  Eqs.  (1)-(5). 

Method _  Eq.  (1)  Eq.  (2)  Eq.  (3) 


Ab  initio 

MP4  /  6-311  G(d,p) 

MP4  /  6-31 1+G{d,p) 

MP4  /  6-31 1G{2d^p) 
QCISD(T)/ 6-311  G(d,p) 

G1 

G2 

Density  functional 
VWN  /  DZVP2(SDA) 
PW86/DZVP2 
BLYP/ 6-31  G(d) 

BLYP  /  6-31  G(d,p) 

BLYP/ 6-31 1G(d,p) 
BLYP/6-31H-G(c/.p) 

BLYP/ 6-31 1G(2d/,  p) 

BLYP  /  6-31 1+G(2dAp) 

BLYP/6-311-hG(2d/,2pd) 

BLYP  /  6-31 1-hG(3df,2p) 

BLYP  /  6-31 1-HG(3c/^2pd) 

BLYP  /  6-31 1-h-hG(3cy/,3pd) 

B3LYP/6-31G(d) 

B3LYP/6-31G(cy,p) 

B3LYP/6-311G(d,p) 

B3LYP/6-311-FG(d,p) 

B3LYP/6-311G(2df,  p) 

B3LYP/6-31H-G(2d/,p) 

B3LYP/6-311-fG(2df,2pc/) 

B3LYP/6-31H-G(3c/^2p) 

B3LYP/6-311-^G(3d^2pc/) 

B3LYP  /  6-3 1 1  +  +  G  (3  d/,  3pd) 


177.7 

148.2 

177.7 

150.2 

151.2 

132.1 

184.0 

157.3 

135.7 

135.9 

132.6 

134.5 

-10.3 

23.2 

104.1 

122.3 

139.5 

138.0 

134.6 

137.0 

142.7 

143.1 

145.5 

147.8 

138.6 

141.8 

141.2 

146.1 

139.3 

145.8 

138.7 

145.1 

138.3 

145.3 

138.4 

145.3 

139.0 

135.6 

134.0 

134.8 

141.7 

141.3 

143.5 

145.1 

137.0 

139.6 

138.8 

142.9 

136.9 

142.7 

136.2 

141.9 

135.8 

142.1 

135.9 

142.0 

134.7 

138.5 

134.8 

137.2 

131.8 

131.3 

134.7 

139.2 

132.0 

130.7 

129.2 

128.9 

86.3 

145.5 

115.5 

130.3 

126.5 

126.7 

126.5 

125.2 

137.2 

120,5 

138.5 

118.9 

137.2 

119.5 

138.1 

117.9 

138.3 

117.2 

137.7 

115.8 

137.7 

116.4 

137.4 

116.8 

121.3 

134.3 

121.2 

132.4 

131.3 

128.0 

132.3 

126.8 

131.3 

126.8 

132.0 

125.6 

132.1 

124.9 

131.3 

123.5 

131.3 

124.0 

131.1 

124.5 

Eq.  (5) 


132.3 

132.4 

129.6 
134.0 

131.4 

129.7 

114.7 
120.9 

130.5 

130.2 

133.2 

133.7 
133.0 

133.5 

134.3 

132.8 

133.6 

133.4 

132.5 

131.7 
134.2 

134.7 

133.8 

134.4 

135.1 

133.5 

134.2 
134.0 
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TABLE  VIII 


VIII  - - - 

Experimental  enthalpies  of  formation  at 

0  K  (kcal/mol).® 

System 

K 

H  (g) 

51.626 

Haig) 

0 

C  (g) 

169.98 

CH  (g) 

141.6 

CH4  (g) 

-15.970 

C2H2  (g) 

54.3 

C2H4  (g) 

14.515 

CaHe  (g) 

-16.523 

®  Except  for  C2H2,  these  data  are  from  [52].  The  value  for 
C2H2  was  calculated  using  its  experimental  atomization 
energy  as  given  in  [42]. 


recently  yielded  values  greater  than  130  kcal/ mol 
[51].  Indeed,  Maier  noted  that  strain  energies  from 
97  to  150  kcal /mol  have  been  attributed  to  tetrahe- 
drane  [27].  This  simply  illustrates  the  difficulty  in 
trying  to  describe  quantitatively  a  property  for 
which  there  is  no  rigorous  definition. 

ENTHALPY  OF  FORMATION 

In  principle,  the  enthalpy  of  formation  at  0  K  of 
tetrahedrane  can  be  determined  from  the  AE  for 
any  of  the  reactions  in  Eqs.  (l)-(5)  if  AHj  q  is 
known  for  each  of  the  other  reactants  and  prod¬ 
ucts.  The  results  obtained  in  this  manner,  using 
the  experimental  data  in  Table  VIII,  are  given  in 
Table  VII.  The  G1  and  G2  procedures  yield  aver¬ 
age  values  of  133.1  and  131.0  kcal/mol,  respec¬ 
tively,  over  ranges  of  less  than  6  kcal/ mol.  (These 
are  in  reasonable  agreement  with  other  calculated 
values  [51,  53,  54].)  For  all  our  other  computational 
approaches,  the  heats  of  formation  predicted  from 
Eqs.  (l)-(5)  cover  much  larger  ranges,  although 
the  MP4/6-311G(2d/,  p)  values  are  very  consistent 
if  that  for  Eq.  (1)  is  excluded.  (The  latter  is  based 
upon  the  atomization  process,  and  we  have  al¬ 
ready  pointed  out  that  the  MP4  and  QCI  methods 
give  relatively  poor  atomization  energies.)  Thus, 
even  though  the  B3LYP/6-311  +  +  G(3df ,3pd)  av¬ 
erage  is  133.5  kcal/mol,  quite  close  to  the  G2,  the 
range  is  17.5  kcal/mol. 


Conclusions 

In  the  comparisons  that  have  been  made  in  this 


work,  the  B3LYP  procedure  has  been  foimd  to  be 
the  most  reliable  of  the  density  functional  methods 
that  were  investigated  and  generally  competitive 
with  ab  initio  MP4  and  QCISEXT)  techniques. 

A  notable  feature  of  both  the  BLYP  and  B3LYP 
results  is  the  absence  of  any  consistent  correlation 
between  their  accuracy  and  the  size  of  the  basis 
set.  Evidence  of  this  can  be  found  throughout 
Tables  IV-VII.  Thus,  the  BLYP/6-31G(d)  AE  val¬ 
ues  for  Eqs.  (2)-(5)  are  distinctly  better  than  are 
the  BLYP/6-311-t-G(3d/,2pd);  as  another  exam¬ 
ple,  the  average  of  the  B3LYP/6-31G(d,  p)  en¬ 
thalpies  of  formation  is  130.8  kcal/mol,  nearly 
matching  the  G2  131.0,  and  the  range  is  only  13.6 
kcal/mol,  while  the  corresponding  B3LYP/6-311 
+  G(3df,  2  pd)  figures  are  133.5  and  18.1  kcal/mol, 
respectively.  It  is  clear  that  a  larger  basis  set, 
which,  of  course,  increases  the  demands  upon 
computational  resources,  is  not  a  reliable  means  of 
achieving  greater  accuracy.  A  similar  conclusion 
was  reached  by  Hertwig  and  Koch  in  a  DFT  study 
of  homonuclear  diatomic  molecules  [22]. 
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ABSTRACT _ 

A  generalized  density  functional  theory  (DFT)  is  proposed  based  on  a  generalized 
Hohenberg-Kohn  theorem  with  the  pair  density  as  the  key  quantity  and  the  kinetic 
energy  as  a  universal  functional  of  the  pair  density.  It  is  assumed  that  there  exists  an 
effective  interaction  potenrial  which,  via  the  corresponding  two-particle  (2P)  Schrodinger 
equation,  generates  2P  orbitals  (geminals)  from  which  follows  the  pair  density,  just  as  in 
the  conventional  DFT  the  density  follows  from  IP  orbitals  (as  the  solutions  of  an  effective 
IP  Schrodinger  equation).  According  to  three  different  representations  (natural  spectral 
resolutions)  of  the  pair  density  or  the  cumulant  pair  densities  in  terms  of  geminals,  three 
versions  of  a  pair  DFT  (PDFT)  are  formally  sketched.  Also  considered  are  the  relation 
between  electron  correlation  and  particle-number  fluctuations  in  fragments  of  the  system 
and  the  use  of  the  pair  density  for  an  estimation  of  such  fluctuations.  ©  1996  John  Wiley 
&  Sons,  Inc. 


1.  Introduction 

Density  functional  theory  (DFT)  provides  us 
with  the  ground  state  (GS)  energy  E  of  an 
interacting  many-electron  system  (finite  or  ex¬ 
tended)  and  the  GS  density  p(l),  the  diagonal  of 
the  one-particle  (IP)  reduced  density  matrix  (RDM) 
7(1,10,  referred  to  below  as  the  1-matrix.  [The 
shorthands  1  =  (r^,  a^)  and  f  dl  =  E^.  /  d^r^  are 
used.]  The  electrons  are  bound  by  an  external 
potential  interact  via  ^he 

electron-electron  repulsion  €^/r-^2‘  Th^  inputs  of 


the  effective  IP  scheme  after  Kohn  and  Sham  (KS) 
are  the  external  potential  an  approxi¬ 

mation  for  the  exchange  (X)  and  correlation  (C) 
energy  E^c  as  a  fimctional  of  the  density  p(l). 
Several  review  papers  and  monographs  show  the 
'"triumphal  march"  of  DFT  in  solid-state  theory 
and  quantum  chemistry  [1-21],  recent  papers  deal 
with  improvements,  refinements,  extensions,  gen¬ 
eralizations,  benchmark  checking,  connections  with 
related  theories  [22-25,  70].  Although  "the  density 
has  its  charm  and  power"  [26],  it  gives  "only"  the 
probability  of  finding  an  electron  at  "1"  and  be¬ 
sides  this  the  average  number  of  electrons  in  an 
arbitrarily  chosen  fragment  fl  of  the  whole  space 
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(e.g.,  the  region  around  an  atom  or  of  a  bond  in  a 
molecule)  via 


N^  =  fjlp(l),  (1.1) 


which,  of  course,  approaches  the  total  number  of 
electrons  N  for  fl  approaching  the  total  space. 

Obviously,  further  information  about  the  quan¬ 
tum  kinematics  of  the  GS  of  a  correlated  electron 
system  is  contained  in  the  1-matrix  yd,  T)  and  in 
the  two-particle  (2P)  or  pair  density  n(l;2),  which 
is  the  diagonal  of  the  2-matrix  and  which  is  nor¬ 
malized  according  to 


/ 


did! 

21 


n(l;2) 


the  total  number  of  electron  pairs,  as  well  as  in 
higher-order  RDMs  [27].  The  l-matrix  and  pair 
density  yield  the  density  via  p(l)  =  y(l,  1)  and 

.  d2 

p(l)  =  /  (1.2) 


respectively.  In  addition  to  this  from  y(l,lO  fol¬ 
lows  the  momentum  distribution,  which  contains 
— in  the  case  of  crystalline  metals — the  Fermi  sur¬ 
face,  and  generally  follow  the  q-order  nonidempo- 
tency  (per  particle) 

1 

c(q)  =  1  -  c(l)  =  0, 

and  from  c'(l)  the  correlation  "entropy"  (per  par¬ 
ticle) 


number  of  electron  pairs  in  a  fragment  fl  via 
.  dld2  I 

which  of  course  approaches  the  total  numbers 
of  electron  pairs,  for  H  ^  oo. 

This  allows  the  calculation  of  the  particle- 
number  fluctuations  in  a  fragment  ft,  because  of 


and  the  definition  of  the  variance  as 


=  (N^)n  - 


=  (  d\d2n{l;l) 
•'n 


Nn(N,,  -  1).  (1.4) 


Then  P^(M),  the  probability  of  finding  M  (  = 
0, 1,  ...,N)  electrons  in  ft,  can  be  obtained  ap¬ 
proximately  from  Nfi  and  maximizing  its 

information  entropy 

5a  =  ~EPa(M)lnP,,(M). 

M 


The  result  is  Pn(A4)  =  Cexp(-a(M  —  pY)  with 
a,  p,C  to  be  determined  from  and  the 

normalization.  For  the  exact  particle  number  prob¬ 
ability  in  ft  higher-order  distribution  functions 
would  be  needed  [37-39]: 


Pn(M) 


M! 


E 

M'  =  0 


M'! 


(1.5) 


Here  the  notation 


1 

s  =  -  — Tr(y)ln(y) 

as  recently  discussed  quantitative  measures  (on 
the  IP  level)  of  the  qualitative  term  correlation 
strength  [28-31,  36].  Originally  Lowdin  [32]  sug¬ 
gested  studying  the  meaning  of  Tr(y  —  y^)  which 
is  just  Nc(2)  with  the  second-order  nonidempo- 
tency  c(2),  in  the  following  denoted  by  c.  [A  side 
remark:  from  c(q)  follows  the  Tsallis  entropy  s(q) 
^  c(q)/(q  —  1)  with  s(l)  =  s  [33].]  For  other  mea¬ 
sures  of  the  correlation  strength  based  on  particle- 
number  fluctuations  cf.  [34]  or  on  configurational 
interaction  (Cl)  cf.  [35,  36]. 

The  pair  density  gives  the  probability  of  finding 
an  electron  pair  with  one  electron  at  "1"  and  one 
electron  at  "2,"  and  it  again  yields  the  average 


dl'-  f  dMn(l,.,.,M) 

is  used,  Hq  =  1.  In  terms  of  the  cumulant  expan¬ 
sion 

nil)  =  p(l), 

n(l;2)  =  p(l)p(2)  -  w(l)2), 
n(l;2;3)  =  p(l)p(2)p(3)  -  p(l)i4;(2;3) 

-  pi2)zv(3;l) 

-  pi3)wil;2)  +  ii;(l;2;3), 

etc.,  the  variance  is 

(AN,,f  =  Nn  -  W®,  W®  =  f  dld2  w(l;2), 

•'n 

(1.6) 
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and 


Pa(M)  =  exp 


+ 


X 


(Nn) 

M! 


M 


■expN-M(-^n)  (1-7) 


describes  the  deviation  from  the  (classical)  Poisson 
distribution  due  to  X  and  C  hidden  in  the  hier¬ 
archical  series  •  •  •  •  The  index  N  —  M  at 

"'exp"  means  that  the  power  series  stops  with 
-  Ml 

The  relations  between  correlation  and  fluctua¬ 
tions  are  discussed  by  Fulde  [34].  Therein  the 
molecule  C2H2  is  considered  as  an  example,  and 
the  probability  of  finding  in  the  ground  state 
0, 1, . . .  or  8  valence  electrons  at  a  C-atom  is  con¬ 
sidered.  The  average  value  is  4,  but  the  variance 
depends  on  whether  or  not  correlations  are  taken 
into  account.  A  correlation-induced  narrowing  is 
found  accompanied  by  a  suppression  of  fluctua¬ 
tions:  The  probability  for  configurations  with  large 
deviations  from  the  average  value,  i.e.,  0  or  1, 
respectively,  7  or  8  valence  electrons  essentially 
goes  down  (to  almost  zero).  Similarly  in  [71]  a 
detailed  fluctuation  analysis  for  H2  and  N2  is 
performed.  So  the  pair  density  gives  a  deeper 
insight  into  the  internal  dynamical  structure  of  an 
interacting  electron  system,  especially  into  the  mu¬ 
tual  relation  between  correlation  and  fluctuations. 
A  striking  example  in  favor  of  the  pair  density  is 
the  uniform  electron  gas,  where  the  density  p(l)  = 
const  has  no  structure  whereas  the  spin  parallel 
and  antiparallel  pair  densities  with  the  Fermi 
and  Coulomb  holes,  respectively,  have  rich  struc¬ 
tures  with  curvature  and  cusp  properties  for  small 
electron-electron  separations  and  Friedel  oscilla¬ 
tions  for  large  separations.  For  a  recent  discussion 
of  pair  densities  in  the  H2  molecule  cf.  Gritsenko 
et  ai.  [23].  For  strongly  correlated  electron  systems 
with  dominating  on-site  electron-electron  repul¬ 
sion,  the  pair  density  approach  is  possibly  an  ap¬ 
propriate  access  or  starting  point.  Of  course,  the 
price  for  more  information  is  the  need  to  deal  with 
a  more  complicated  function.  With  all  this  in  mind 
one  may  perhaps  say:  if  available,  n(l;2)  would 
have  even  more  charm  and  power  than  p(l). 

For  finite  systems  quantum  chemical  methods 
(e.g..  Cl  ==  configurational  interaction,  CC  = 
coupled  cluster,  MP  =  Moller-Plesset)  allow  in 


principle  the  calculation  of  the  pair  density.  But  for 
extended  systems  (crystals)  the  Hartree-Fock  (FiF) 
approximation  (which  falsely  yields  for  metals  a 
vanishing  density  of  states  at  the  Fermi  energy) 
does  not  take  into  account  correlation,  and  DFT 
gives  ''only"  p(l)  and  E  (for  the  GS).  The  meth¬ 
ods  developed  by  Fulde  et  al.  (use  of  cumulants 
for  size  consistency,  to  avoid  for  extended  systems 
ghosts/spurious  terms  -  N^,  N^, . . . ;  use  of 
projection  and  partitioning  techniques  as  pro¬ 
moted  long  ago  by  Lowdin;  use  of  local  operators 
to  describe  the  correlation  hole;  use  of  incremental 
methods  to  calculate  GS  and  binding  energies) 
have  until  now  been  successfully  applied  "only" 
to  nonmetallic  systems  [34].  So  the  additional 
availability  of  calculational/computational 
schemes  which  provide  the  pair  density  especially 
for  extended  systems  (both  nonmetallic  and  metal¬ 
lic)  seems  to  be  highly  desirable.  For  bosons  a  pair 
density  functional  theory  (PDFT)  has  been  pro¬ 
posed  [40].  For  fermions  the  ideas  of  a  PDFT  [41] 
as  well  as  a  (generalized)  HF  method  for  electron 
pairs  [42]  have  been  presented  recently.  Here  we 
try  to  push  the  PDFT  idea  a  step  further. 

The  advantage  of  DFT  is  its  applicability  to 
finite  as  well  as  extended  (metallic  as  well  as 
insulating/semiconducting)  systems:  The  effective 
IP  or  KS  equation  survives  the  thermodynamic 
limit  (TDL).  In  case  it  is  really  possible  to  develop 
a  practicable  PDFT,  its  advantage  should  be  again 
its  applicability  not  only  to  finite  but  also  to  ex¬ 
tended  systems:  Also  the  effective  2P  equation 
should  survive  the  TDL. 

The  present  work — a  formal  sketch  of  ideas 
toward  a  PDFT  of  the  GS — has  the  following 
structure:  Section  2  clarifies  some  basic  notation. 
Section  3  considers  the  different  possible  spectral 
resolutions  of  the  2-matrix  and  its  cumulant  com¬ 
ponents.  The  derivation  of  DFT  (Sec.  4)  is  formally 
transferred  to  obtain  similarly  (and  bearing  in  mind 
the  spectral  resolutions  of  Sec.  3)  three  versions  of 
PDFT  (Sec.  5).  Section  6  presents  some  discussion, 
open  questions,  and  an  outlook. 


2.  The  System  and  Its  Quantities 

For  simplicity  relativistic  effects  are  neglected 
(spin-independent  Hamiltonian)  and  the  nuclei  are 
resting  at  given  positions  (adiabatic  approxima¬ 
tion).  The  Hamiltonian  of  this  Schrodinger- 
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Born-Oppenheimer  world 

H  =  f  +  Kxt  +  (2-1) 

describes  the  electronic  structure  of  atoms,  Hooke 
law  models,  molecules,  dusters,  solids  as  well  as 
finite  and  extended  jellium  models.  The  IP  part  of 
H  is  Hq  =  Li  HqU)  with  HqU)  =  t(i)  + 
t{i)  =  The  interaction  term  = 

Li<  j  j)  includes  a  coupling  constant  A  vary¬ 
ing  between  0  and  1. 

The  correlated  antisymmetric  wave  function  (of 
the  GS)  is  normalized  as 

/  =  1,  i  =  ir.,a,), 

f  di=  Lf 

Cr, 

and  yields  the  (nonidempotent)  1 -matrix: 

.  d2---  dN 

y(l,l')  =  /  - - N) 

J  (N  -1)1 

X^%V,2,...,N),  (2.2) 

and  the  2-matrix  (with  Id  =  2): 

«(1,1';2) 

.  d3---  dN 

. 

X  ^*(r,2,3,...,N),  (2.3) 

Density  and  pair  density  are  then  p(l)  =  y(l,  1) 
and  /t(l;2)  =  n(l,l;2),  respectively.  The  "'compo¬ 
nents''  of  the  GS  energy  E  =  T  +  in 

terms  of  RDMs  are 


T[y]  = /dlf(l)T(l,l')lr  =  i,  (2.4) 

Kxt[  p]=  f  dl  piDv.Jl),  (2.5) 

,  dld2 

Vin.ln]  =  j  -^n(l;2)vah2),  (2.6) 

or  with  «(1;2)  =  p(l)p(2)  —  zed;  2)  and  = 
Vh  +  V^c- 

^h[  =  dl  p(l)  /  dl  pi2)v,^,(l,2),  (2.7) 

,  rild2 

=  -  j  -^w{V,2)v,,,{l,2),  (2.8) 


or  with  zed;  2)  =  17(1,2)1^  -I-  c-  m(1;2)  and  dxc  — 
^x  + 

,  dld2 

VAy]  =  -f  —  7(l,2)y(2,l)dnt(1.2),  (2.9) 

,  dld2 

Vc[u]  =  -  j  -^u(l,2)c-v,Jl,2).  (2.10) 

H  means  "Hartree"  and  and  Vq  are  referred  to 
as  cumulant  X  energy  and  cumulant  C  energy, 
respectively.  Note  that  y  in  Vx  is  not  the  idempo- 
tent  HF  1-matrix.  In  [43]  the  difference  between 
lyl^  and  lylnp  is  referred  to  as  "correlation  in¬ 
duced  X-hole  narrowing";  therein  this  narrowing 
and  also  1/^  functions  of  are  illus¬ 

trated  for  the  uniform  electron  gas  using  data  for 
the  momentum  distribution  (which  agrees  for  this 
simple  model  case  with  the  natural  occupation 
numbers). 

The  cumulant  pair  densities  ze(l;2)  and  w(l;2) 
are  the  diagonals  of  the  cumulant  matrices  (CMs) 
and  z/(l,l';2),  respectively.  The  IP  and 
2P  eigenfunctions  of  the  ROMs  and  CMs  are  deco¬ 
rated  with  tilde  to  distinguish  them  from  the  IP 
orbitals  and  geminals  of  DTF  and  PDTF,  respec¬ 
tively. 


3.  Reduced  Density  Matrices  and  Their 
Spectral  Resoiutions 

The  1-matrix  yd,!')  can  be  written  in  terms  of 
its  orthonormal  eigenfunctions  (NSOs  =  natural 
spin  orbitals)  and  eigenvalues  (NONs  = 

natural  occupation  numbers)  according  to 

y(l,l')  =  E'Ak(1)«kC<1')  (3.1) 


with  =  N  and  0  <  <  1. 

The  ^j-order  nonidempotency  c(^)  =  1  —  (1/N) 
L^(n^)^  and  the  correlation  entropy  s=  —(1/N) 
L^  rl^ln  measure  the  correlation  strength  (on 
the  IP  level).  In  the  following  the  notation  c  will 
be  used  for  the  second-order  nonidempotency  c(2) 
=  1  -  Tr(y^)/N. 

For  the  spectral  resolution  of  the  pair  density 
n(l;2)  in  terms  of  geminals,  there  are  three  possi¬ 
bilities. 

a.  Version  I:  Here  the  pair  density  is  given  di¬ 
rectly  in  terms  of  orthonormal  natural  geminals 
i^^d,  2)  and  the  corresponding  geminal  occupation 
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numbers  (GON)  via 

«(1;2)  =  EnKl'^K(1.2)|'  (3.2) 

K 

with  =  (2)  0  <  nK(<  [N/2]).  Whereas 

k:  is  a  IP  quantum  number,  here  K  is  a  2P  quan¬ 
tum  number.  The  geminals  !^^(1,2)  are  anti¬ 
symmetric  and  normalized  according  to  / did! 
|i^f;(l/2)iV2!=  1-  Each  geminal  has  a  1-matrix; 

7^,(1,  r)  =  /  d2  4>^(l,2)4>*(r,2), 

I  dl  y^(l,l)  =  2, 
again  with  the  spectral  resolution 
rK(i4')  =  Ln^  =  2, 

K  ^ 

and  with  the  correlation  entropy 

%  =  -xE«Klnn^^  >0. 

^  K 

From  the  "partial"  1-matrix  y^^Cl,  1')  follows  the 
"total"  1-matrix: 

,  d2 

yil,r)  =  f  j^nil,r;2)  (3.3) 

via 

ra,r)  =  -j;^E».Va.r). 

X 

[From  the  contraction  (3.3)  follows  for  1'  =  1  the 
more  simple  contraction  (1.2).] 

The  GS  energy  takes  the  form  E  =  Fq  + 
with 

Eo^LnX,  (3-4) 

K  K 

and  with  the  (diagonal)  matrix  elements 

h°  =  j  dl  ip*(l)hoil)4>^il)  =  (kIHoIk), 

,  dld2  „  _  ,  , 

vr  =  /  -^^K*(l,2)i.i„.(l,2)./.K(1.2) 

=  <K|nJK>. 

b.  Version  II:  Here  the  pair  density  is  decom¬ 
posed  into  a  Hartree  term  and  a  remainder  (which 


describes  XC  beyond  "Hartree"): 

n(l;2)  =  p(l)p(2)  -  w;(l;2).  (3.5) 

The  cumulant  pair  density  ie(l;  2)  including  X  has 
the  contraction 

p(l)  =  |d2ze(l;2).  (3.6) 

Note  the  difference  compared  to  Eq.  (1.2).  And 
w(l;  2)  is  the  diagonal  of  a  corresponding  2-matrix 
zed,  1';  2)  with  the  contraction 

7(1,1')  = /d2ze(l,l';2).  (3.7) 

If  one  would  like  to  express  this  (unsymmetric) 
matrix  ie(l,  l';2)  in  terms  of  symmetric  and  anti¬ 
symmetric  geminals,  then  one  needs  to  decompose 
this  matrix  into  a  symmetric  and  an  antisymmetric 
part.  The  result  for  zed;  2)  is 

M1;2)  =  lE«Kj'Ai(d2)|' 

—  —  E  1 ’AK_d, 2) I  .  (3.8) 

^  K_ 

The  first  term  contains  the  symmetric  geminals 
<Ak,(1.2) 

'Ak/1)'A./2)  -f  4^(2)  1^,/!)  for  Kj  ^  K2 
)^<Ak,(1)'Ak/2)  for /Cl  =  K2 

(3.9) 

and  occupation  numbers 

«K,=  ^  1  with  E  =  L 

K+  Ki<K2 

and 

1 

-M(N  +  1)  -  Nc,  (3.10) 

[  c  denotes  the  second-order  nonidempotency 
c(2)  =  1  -  Tr(7^)/N.]  These  symmetric  geminals 
are  imcorrelated  because  they  are  single  perma¬ 
nents  [built  up  of  the  NSOs  ij/^d)  like  determi¬ 
nants  with  all  plus  signs].  Their  1-matrices  are 
explicitly  given  by 

7i,^d,l')  =  .Ak, (!)<(!')  +  -A./D^d'), 
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containing  (spin  dependent!)  occupation  numbers 
1  (for  ^  K^)  and  2  (for  =  k^).  The  second 
term  of  Eq.  (3.8)  contains  antisymmetric  geminals 
if/ ^  (1,2)  and  occupation  numbers  with 

^  1 

LnK_=  -N(N-  1)  -Nc,  (3.11) 

K_  ^ 

These  antisymmetric  geminals  are  correlated,  i.e., 
they  are  sums  of  2P  Slater  determinants.  Their 
1 -matrices 

4  (1,2)[«A^  (l',2)]* 
have  spectral  resolutions 
jK.ihr)  = 

K 

j:n^-=2 


and  correlation  entropies 
1 

=  --  h^-  >  0. 

K 

The  GS  energy  E  =  Eg  +  -t-  contains  now 


E  «K, 


^XC  =  -:zL>~^KyK.!+  T  L«k  4"'  . 

^  ^  K_ 


(3.12) 


where 

K'i,  ;  K2,  K2 

=  fdl  f  d2  >p*{2)l,fi)v,Jh2). 

The  are  similarly  defined  as  in  Eq.  (3.4). 

c.  Version  Ill:  Here  the  "'correlation  only"  (X  not 
including)  cumulant  pair  density  ii(l;  2)  is  consid¬ 
ered.  It  is  defined  via 

7!(1;2)  =  p(l)p(2)  -|y(l,2)P  -  c  ■  iiil;2)-, 

(3.13) 

because  of  /  d2  «(1;2)  =  (N  -  l)p(l)  it  has  the 
contraction  property 

cf  d2u(l;2)  =  p(l)  -  J  d2\y{l,2}t  (3.14) 
which  follows  from  the  more  general  contraction 


of  the  matrix  u(l,  V;  2): 


cf  d2  m(1,1';2)  =  r(l,l')  -  f  d2yil,2)y(2,V). 

(3.15) 


Furthermore  «(1;  2)  has  the  normalization 
cfdld2  m(1;2)  =  N  -  fdld2  |y(l,2)|^  or 


j  dld2u(l;2)  =  N  (3.16) 


and  the  spectral  resolution 


m(1;2)  =  I:«J4(1,2)|'.  (3.17) 

K 

The  antisymmetric  geminals  and  the  occupation 
numbers  are,  of  course,  different  from  those  of  Eq. 
(3.2);  for  simplicity  an  additional  index  to  distin¬ 
guish  them  will  not  be  introduced.  Here  the  occu¬ 
pation  numbers  n,,  are  normalized  to 

K  ^ 

Again  similar  as  in  version  I  each  of  these  corre¬ 
lated  geminals  i^^(l,2)  has  a  1-matrix  y^(l,  1') 
with  a  corresponding  spectral  resolution  and  a 
correlation  entropy  s^.  Here  the  relation  between 
the  "total"  1-matrix  yd,!')  and  the  "partial"  1- 
matrix  y^(l,  1')  is  more  complicated: 

r(l,l')  -  f  d2y{l,2)yi2,r)  =  c  E  "KTxd.  1')- 

K 

The  GS  energy  E  =  Eq  -h  con¬ 

tains  now 


Vv 


- h  h 

ry  La  '^Ky''K2^K^,K2)K2,  Kx  ' 


Vc  =  -  E^kC-^"'- 


(3.19) 


Note  the  role  of  the  second-order  nonidempotency 
renormalizing  the  coupling  constant  A  of  the  bare 
electron-electron  repulsion  v-^^^.(l,2)  and  note  the 
cancellation  of  the  self-interaction  terms  (arising 
from  K|  =  /C2)  in  the  terms,  similarly 

as  this  is  known  from  the  HE  approximation. 


4.  Density  Functional  Theory 

The  DFT  starts  with  the  1:1  correspondence  be¬ 
tween  the  external  potential  and  the  density 
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(Hohenberg-Kohn  theorem).  So  the  energy  and  its 
"components"  T  and  V  are  for  fixed  func¬ 

tionals  of  p(l).  The  external  potential  expectation 
value  is  an  explicitly  given  linear  functional: 

p]  =  /dl  piDv^Jl).  (4.1) 

The  kinetic  energy  functionally  depends  on  p(l) 
because  the  1-matrix  y(l,  1')  is  a  functional  of  p(l). 
The  interaction  energy  depends  functionally  on 
p(l)  because  the  pair  density  n(l;2)  is  a  functional 
of  p(l). 

Now  in  view  of  the  RDM  analysis  of  Section  3 
these  unknown  functionals  are  made  partially  ex¬ 
plicit  under  the  assumption  that  there  exists  an 
effective  (local)  IP  potential  which  is  a  func¬ 

tional  of  p(l)  and  which  causes  orthonormal  IP 
orbitals  and  IP  energies  in  such  a  way 

that  the  GS  density  of  the  interacting  system  is 
given  by 

occ 

p(l)  =  EI'/'k(1)I'  (4-2) 

K 

as  a  variational  ansatz  (v  representability)  and  the 
GS  energy  follows  as  E[  p]. 

To  derive  this  effective  IP  Schrodinger  equation 
for  the  orbitals  the  1-matrix  y(l,l')  is  de¬ 

composed  according  to 

occ 

7(1,1')  =  E'AK(l)'Af(l')  +  Ayd,!')  (4.3) 

K 

into  an  idempotent  part  (first  term)  and  an  un¬ 
known  remainder  (second  term)  with  the  property 
Ayd,  1)  0.  This  yields  the  ansatz  (4.2)  and  makes 

the  "total"  1-matrix  y(l,l0  nonidempotent.  Thus 
the  kinetic  energy 


occ 

T[7[p]]  =  +  AT[p] 


K 


(4.4) 


consists  of  an  explicitly  given  orbital  part  TJ  p]] 
[note  that  the  IP  orbitals  are  functionals  of 

p(l)]  and  an  unknown  remainder.  The  pair  density 
nil;  2)  is  decomposed  as 

n(l;2)  =  p(l)p(2)  +  An(l;2)  (4.5) 

[comparison  with  Eq.  (3.5)  shows  An(l;2)  = 
-  ivil;  2)],  so  the  interaction  energy 

p]]  ""  P]  +  p]  <^4.6) 


consists  of  the  explicitly  given  Hartree  term  and  an 
unknown  remainder.  The  sum  of  the  unknown 
remainders  is  denoted  with  Exct  pl'  nonidem- 
potency  and  fluctuations  caused  by  XC  are  hidden. 
So  the  energy  density  functional  is  given  by 

£[p]  =T,[^Ajp]]  +Kxt[p] 

+  Vj^[  p]  +  Exct  P^*  (4.7) 


Minimizing  this  functional  under  the  constraint 
/  dl  p(l)  =  N  yields  the  KS  equation 


HqH)  +  v^il)  + 


6£xc[  P^ 

dpiiV 


^k(1)  =  ^k^k(1)^ 


(4.8) 


where  %(1)  =  j  d2  pi2)v^^^(h2)  is  the  Hartree  po¬ 
tential.  Note  the  difference  between  the  KS  orbitals 
ip^il)  and  the  NSOs  4/^(1).  Their  dissimilarity  to¬ 
gether  with  the  difference  between  the  NONs 
and  their  KS  parties  ( =  0  and  1)  are  contained 
in  Ay(l,l')  and  thus  in  Ar[  p]  which  is  part  of 
Exct  p]- 

In  the  same  way  that  KS-DFT  is  obtained,  three 
types  of  PDFTs  are  derived,  thereby  bearing  in 
mind  the  spectral  resolutions  of  the  pair  density 
and  its  cumulant  pair  densities. 


5.  Pair  Density  Functional  Theory 

The  PDFT  starts  with  the  correspondence  be¬ 
tween  the  external  and  interaction  potentials  on 
the  one  hand  and  the  pair  density  on  the  other 
hand  (generalized  Hohenberg-Kohn  theorem,  cf. 
Appendix).  So  the  energy  and  its  "components"  T 
and  y,  also  y(l,l0  and  p(l)  are  for  fixed  v^^^il) 
and  2)  functionals  of  nil;  2),  Their  detailed 

representation  depends  on  which  PDFT  version 
(I,  II,  III)  is  considered.  In  each  of  these  versions  it 
is  assumed  that  there  exists  an  effective  2P  Schro¬ 
dinger  equation  generating  orthonormal  geminals 
in  such  a  way  that  the  true  pair  density  and  the 
true  cumulant  pair  densities  are  given  by  geminals 
just  as  in  the  conventional  DFT  the  density  is 
given  by  IP  orbitals  iv  representability).  Because 
"scientists  have  not  yet  learned  to  think  in  terms 
of  ( 2 )  geminals,  rather  than  in  N  orbitals"  (David¬ 
son  [27],  p.  97)  there  is  on  the  one  hand  a  "mental 
barrier,"  on  the  other  hand  it  seems  worthwhile  to 
overcome  it. 
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PDFT  I:  GEMINAL  REPRESENTATION  OF 
THE  PAIR  DENSITY  11(1;  2) 

Here  the  interaction  energy  is  given  by 

Eq.  (2.6).  With  the  contraction  (1.2)  the  external 
potential  expectation  is 

^  dldl  1  The  2P  Hamiltonian  of  Eq.  (5.5)  corresponds 

Pext[  p[^]]  =  j  — nil-,2)  ^  ^  ^  (without  the  correlation  correction — last  term)  to 

the  reduced  2P  Hamiltonian  of  Coleman  [44]. 

X[r;,,,a)  +  i;^^j(2)].  (5.1) 

PDFT  11:  GEMINAL  REPRESENTATION  OF 
THE  EXCHANGE  INCLUDING  CUMULANT 
PAIR  DENSITY  »v(l;2) 

Here  the  starting  points  are  Eq.  (3.5)  and  the 
decomposition  of  the  cumulant  2-matrix  (with 
2'  =  2); 


is  decomposed  into  a  geminal  part  and  an  un¬ 
known  remainder,  which  describes  XC  beyond  the 
first  term  and  has  the  property  A«(l,  1;2)  =  0  so 
that  1'  =  1  leaves 

occ 

J2(1;2)  =  X;i«/<j,(l,2)l^  (5.3) 

K 

as  a  variational  ansatz  for  the  pair  density  in  terms 
of  orthonormal  antisymmetric  geminals.  With 
the  contraction  (3,3)  the  kinetic  energy 

occ  ][ 

T[r[«]]  =  E-r— r<X|t(l)  +  f(2)|X> 

K  N  i 

+  Txc[«]  (5.4) 

consists  then  of  a  part,  explicitly  given  in  terms  of 
geminals  [which  are  functionals  of  «(1;2)]  and  an 
unknown  remainder. 

Minimizing  the  sum  of  T,  1/ under  the 
constraint  / dld2  w(l;2)  =  NiN  -  1)  yields  the  ef¬ 
fective  2P  Schrodinger  equation 

jy  _  2  +  ^2o(2)] 

ST.An]  \ 

-<e,*,(1,2),  (5.5) 

where  XC  makes  the  bare  interaction  ''effective,'' 
Its  self-consistent  (sc)  solution  gives  via  Eq.  (5.3) 
the  GS  pair  density,  from  which  follows  the  GS 
density  by  the  contraction  (1.2),  as  well  as  the  GS 


1 

ie(l,l';2)  =  -E'/'k,(1,2)[«A^(1',2)]* 

2  OCC 

-^j:rK_ih2)[rK  (ih2)]=^ 

+  Az(;(l,r;2)  (5,7) 

into  two  geminal  parts  and  an  unknown  remain¬ 
der  which  describes  XC  beyond  the  first  terms  and 
which  has  the  property  Air'd,  1;  2)  =  0  so  that 

wd;  2)  =  -E\^l  (1,2)  1^  -  -  ^  I  (1, 2)f 

(5.8) 

is  a  variational  ansatz  for  the  cumulant  pair  den¬ 
sity  zo(l)2)  in  terms  of  N(N  +  l)/2  symmetric 
geminals  and  N(N  —  l)/2  antisymmetric  gemi¬ 
nals.  With  the  contraction  (3.7)  the  kinetic  energy 

1 

T[r[tc]]  =  -E<^^J?(l)  +  f(2)|K^> 

2  X.,. 

1 

--ll(KJta)  +  f(2)|x^> 

^  K_ 

+  T^^qIzv]  (5.9) 

consists  then  of  a  part,  explicitly  given  in  terms  of 
geminals  [which  are  functionals  of  ze(l;2)]  and  an 
unknown  remainder.  With  the  contraction  (3.6)  the 
external  potential  expectation  is 

,.  dld2 

Pextt  =  /  -^^V(1;2)[V^A'^)  -I-  yext(2)], 

(5.10) 
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and  the  interaction  energy  Vint  =  +  ^xc  i® 

given  by  Eqs.  (2.7)  and  (2.8). 

Minimizing  the  sum  of  T,  and  under 
the  constraint  /  did!  ie(l;2)  =  N  yields  the  effec¬ 
tive  2P  Schrodinger  equation; 

/  ^  5Txr[lt>l  ) 

|/7f^(l)  4-  /ih(2)  -  nint(l,2)  -I-  gj^Q.2) 

=  ej^i//^(l,2),  (5.11) 


where  h„{l)  =  hoH)  +  Again  the  bare  inter¬ 

action  potential  [here  with  a  minus  sign  aris¬ 
ing  from  the  minus  sign  in  Eq.  (3.5)]  is  made 
effective  by  an  additional  XC  term. 

The  sc  solution  of  Eq.  (5.11)  gives  via  Eqs.  (5.8) 
and  (3.6)  the  GS  density  and  via  Eq.  (3.5)  the  GS 
pair  density  as  well  as  the  GS  energy; 


E  = 


^  occ  \  occ 

-  X  ^h[  p[w]]  +  'J’xct^l 

^  K^.  ^  K_ 


,  dld2  dTxclte] 


(5.12) 


Solving  Eq.  (5.11)  the  use  of  Hartree  orbitals  should 
be  appropriate. 


PDFT  III:  GEMIMAL  REPRESENTATION  OF 
THE  CORRELATION  ONLY  (EXCHANGE  NOT 
INCLLDING)  CUMULANT  PAIR  DENSITY 
fid,  2) 

PDFT  III  goes  a  step  further  by  using  Eq.  (3.12) 
and  representing  part  of  the  cumulant  pair  density 
w(l;2)  in  terms  of  antisymmetric  geminals.  Note 
that  this  scheme  needs  also  the  nonidempotent 
1-matrix  (at  least  approximately),  which  enters  here 
not  only  the  kinetic  energy  T[y]  but  also  the 
cumulant  X  energy  Vxly],  cf.  Eqs.  (2.4)  and  (2.8), 
and  which  yields  the  second-order  nonidempo- 
tency  c  =  1  -  Trly^l/N.  Here  the  decomposition 

w(l,l';2)  =  Ho(1/1';2)  +  Am(1,1';2), 

(5.13) 

Mo(id';2)  =  1:.Ak(i.2).A*(1',2) 

K 

into  a  geminal  part  and  an  unknown  remainder 
with  the  property  Aw(l,  1;  2)  =  0  provides 

OCC 

h(1;2)  =  i:i^x(l/2)P  (5.14) 

K 

as  a  variational  ansatz  for  the  cumulant  pair  den- 


PAIR  DENSITY  FUNCTIONAL  THEORY 

sity  nil)  2)  in  terms  of  N/2  antisymmetric  gemi¬ 
nals. 

Now  the  1-matrix  y  is  related  to  the  ansatz 
(5.13)  via  the  contraction  (3.15).  Consequently  the 
(explicitly  from  geminals  constructable)  part 
follows  from  Ugd,  r;2)  via 

yo(l,l')  -  /  d2yo(l,2)ro(2,l') 


=  Cof  d2ugil,V-,2),  (5.15) 

while  the  unknown  remainder  Aw  causes  the  un¬ 
known  remainder  Ay  via 

Ar(l,l')-/rf2[Ar(l,2)7o(2,l') 

+  yo(l,2)  Ay(2,l')  +  Ay(l,2)  Ay(2,l')] 

=  CqJ  d2  Am(1,1';2)  +  Ac  J  d2  Wo(l,l';2). 

(5.16) 

The  use  of  y  =  yg  +  Ay  in  T[y]  and  Vxiy]  leads 
to  the  complicated  task  of  performing  the  variation 
of  T  with  via  Eq,  (5.15).  In  addition  a 

consequence  of  y  =  yo  +  Ay  is  p  =  po  +  Ap 
which  causes  also  unknown  remainders  not  only 
of  T  and  Vx/  t>ut  also  of  and 

With  the  simplifying  assumption  (n^y  - 
in^Y/N  the  nonlinear  contraction  (3.15) 
emerges  the  more  simple  linear  form 


y(l,r)  «/d2w(l,l';2),  (5.17) 

which  leads  together  with  the  ansatz  (5.13)  to  the 
kinetic  energy 


occ 

T[y[M]]  =  L<Kif(l)  +  f(2)|X>  +  AT[m]. 


K 

(5.18) 


The  external  potential  expectation  is 

Vext[  P[W]]  =  /  —  M(l;2)[z;ext(l)  +  ^ex.(2)]. 

(5.19) 

The  interaction  energy  +  ^c  i® 

taken  from  Eqs.  (2.7),  (2.9),  and  (2.10)  with  V'xItI 
=  V"x[yo]  +  AVx{u].  The  sum  of  AT[m]  and 
AVxlu]  is  denoted  Exc[w]- 
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Minimizing  the  sum  of  T,  and  V■,^^  under 
the  constraint  f  dld2  u(l;2)  =  N  yields  now  the 
effective  2P  Schrodinger  equation 


+  /S|_[p(2)  CQUjj^j(l,2) 


^£xc  [ 
duil;2) 


'/'k(1,2) 


2), 


(5.20) 


where  =  h„il)  +  v^iD  with  v^iDfiD  = 

-  j  d2  y„(l,2)i;j„((2)/(2).  As  in  version  II  the  inter¬ 
action  has  a  minus  sign,  following  from  the  minus 
sign  of  the  last  term  in  Eq.  (3.13).  Besides  this,  the 
second-order  nonidempotency  Cq  =  1  —  Tr(yQ)^/N 
appears  as  a  factor  renormalizing  the  bare  interac¬ 
tion  potential  self-consistently.  Again  as  in  ver¬ 
sions  I  and  II  an  additional  XC  term  makes  the 
interaction  effective.  Deriving  Eq.  (5.20)  terms  aris¬ 
ing  from  Ac  =  c  —  Cq  and  from  the  functional 
derivative  of  the  second-order  nonidempotency 
have  been  deleted. 

The  sc  solution  of  Eq.  (5.20)  gives  via  Eqs.  (5.14) 
and  (5.17)  the  approximate  1-matrix  ygd,  1')  from 
which  follows  Cq  and  p(l),  via  Eq.  (3.13)  the  GS 
pair  density  n(l;2),  and  the  GS  energy 

occ 

K 


/ 


dld2 

21 


h(1;2) 


5Exc[m] 
Suil;2)  ■ 


(5.21) 


Solving  Eq.  (5.20)  the  geminals  are  suitably  ex¬ 
panded  in  HE  orbitals. 

Summarizing  and  comparing  the  presented 
PDFT  versions  one  may  say: 

Version  I:  Here  the  effective  2P  Hamiltonian 
contains  N,  the  total  number  of  electrons  explic¬ 
itly.  So  it  seems  not  appropriate  for  the  TDL. 

Version  11:  Here  not  only  antisymmetric  but  also 
symmetric  geminals  appear  and  the  density  [via 
contracting  Eq.  (5.8)  by  integration  over  2]  proves 
to  be  the  difference  between  two  (large?)  terms. 

Version  HI:  Here  pair  density  and  (nonidempo- 
tent)  1 -matrix  have  to  be  calculated  simultane¬ 
ously  and  the  XC  term  is  not  purely  "kinetic”  (as 
this  is  the  case  for  versions  I  and  II);  it  has  also  a 
"component"  resulting  from  part  of  the  potential 
(interaction)  energy. 

Versions  II  and  III:  The  geminals  of  the  cumulant 
pair  densities  h;(1;2)  and  nil;2)  result  from  effec¬ 
tive  2P  Hamiltonians  which  contain  the  electron- 


electron  interaction  with  a  minus  sign,  as  if  elec¬ 
trons  would  attract  each  other. 

From  a  size  consistency  point  of  view  version  III 
seems  perhaps  to  be  the  most  appropriate  one.  But 
there  are  still  flaws  which  one  should  try  to  over¬ 
come  [e.g.,  replace  the  condition  Am(1,1;2)  =  0  (or 
add  to  it)  the  requirement  / d2  Ah(1,  1';  2)  =  0  ?]. 


6.  Some  Discussions,  Open  Questions, 
and  Outlook 

1.  Even  if  the  proposed  schemes  for  the  calcula¬ 
tion  of  the  pair  density  survive  a  critical  analysis 
(possibly  after  corrections  or  modifications,  e.g., 
with  GONs  #  1?),  then  the  task  remains  at  first  to 
find  reasonable  approximations  for  the  effective 
interaction. 

a.  One  way  could  be  to  find  scaling  properties 
and  to  exploit  them.  To  find  the  minimum  of  E[n] 
one  can  first  define  the  kinetic  energy  as  a  func¬ 
tional  of  the  pair  density  with  the  constrained- 
search  idea  of  Levy  [45]: 


T[n] 


lim 

'P  n(l ;  2) 


(qrlqr)  ' 


(6.1) 


and  then  minimize  Tin]  -I-  V[n]  under  the  con¬ 
straint  /  d\d2  nil- 2)  =  NiN  —  1).  Perhaps  the 
scaling  properties  of  T[n]  and  consequently  of 
Txc[«]  and  Txc[rr’]  can  be  used  to  find  explicit 
expressions  (functionals).  Based  on  the  local  scal¬ 
ing  method  an  energy  functional  of  the  pair  den¬ 
sity  has  been  derived  [46]. 

b.  Another  way  is  to  say  that  with  each  effec¬ 
tive  interaction  a  (static)  dielectric  function  (DF)  is 
defined  which  contains  (beyond  random-phase  ap¬ 
proximation)  local-field  corrections  being  related 
to  the  pair  density  via  the  static  structure  factor,  cf. 
e.g.  [47].  Possibly  the  "correct"  DF  includes 
higher-order  response  (beyond  the  linear  one)  via 
corresponding  functional  dependences.  For  differ¬ 
ent  kinds  of  DFs  (describing  screening  between 
two  test  charges  or  between  a  test  charge  and  an 
electron  or  between  two  electrons),  cf.  [48].  For 
DFs  of  inhomogeneous  systems,  cf.  [49].  For  a 
relation  between  DF  and  pair  correlation  via  the 
fluctuation-dissipation  theorem  within  DFT,  cf 
[68]. 

2.  Another  task  is  to  make  the  spin  dependence 
more  explicit  by  distinguishing  between  parallel 
and  antiparallel  pair  densities. 
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a.  A  question  in  this  connection  is:  Must  the 
geminals  be  (restricted)  eigenfunctions  of  the  total 
spin  §1  +  §2  of  the  electron  pair? 

b.  Another  question:  How  to  treat  the  total  spin 
S  of  the  N-electron  system  on  the  way  to  an 
effective  2P  equation? 

3.  The  task  to  solve  a  2P  Schrodinger  equation 
becomes  easy  for  two  model  cases:  the  uniform 
electron  gas  and  the  Hooke's  law  model.  Separa¬ 
tion  of  the  (free,  respectively,  harmonic)  center-of- 
mass  motion  leaves  behind  the  IP  problem  for  the 
relative  motion. 

4.  How  one  can  make  use  of  the  equation  of 
motion: 

[hod)  -  hi{r)]ya,r) 

=  -  J  d2[v,Jl,2)  -  vUV,l)]nd,V-,2)  (6.2) 

and,  following  from  it,  of  the  continuity  equation 
and  the  local  stress  theorem  (or  momentum  bal¬ 
ance)  [50]? 

5.  What  is  known  from  detailed  quantum 
chemical  studies  of  atoms  and  molecules  about 
the  structure  (the  eigenvalues)  of  the  2-matrix  and 
its  cumulant  matrices  and — possibly  following 
thereof — would  it  be  more  appropriate  to  modify 
the  decompositions  (5.2),  (5.7),  and  (5.13)?  This 
may  be  especially  important  for  cases  where  "su- 
pergeminals"  appear  with  large  GONs  [51]. 

a.  Do  cases  where  ODLRO  {=  off-diagonal 
long-range  order)  appears  need  a  special  treat¬ 
ment? 

b.  A  (at  that  time)  controversial  paper  on  super¬ 
conductivity  from  a  pair  density  is  [52]. 

c.  Do  the  Lagrange  parameters  in  Eqs.  (5.5), 
(5.11),  and  (5.20)  have  any  physical  meaning? 

d.  How  does  strong  electron  correlation  show 
its  peculiarities  in  the  1-  and  2-matrices? 

6.  What  are  the  relations  between  DPT  and 
PDFT?  In  this  connection  one  may  ask:  If  a  good 
DPT  calculation  is  done,  is  it  then  reasonable/pos¬ 
sible  to  think  about  an  additional  succeeding 
PDPT-like  treatment  of  the  1-matrix  remainder 
Ay(l,l0  and  the  pair  density  remainder  An(l;2) 
hidden  in  the  XC  functional  Exct  p]  DPT.  Such 
a  version  IV  should  be  derived  under  the  con¬ 
straint  of  an  already  known  (from  DPT)  density. 

7.  Recently  the  N  representability  of  the  pair 
density  was  studied  [53].  To  what  extent  the  re¬ 
sults  obtained  therein  have  an  impact  on  the  con¬ 
siderations  of  Section  5  remains  to  be  checked.  For 


the  purpose  of  deriving  effective  2P  Schrodinger 
equations  it  seems  to  be  sufficient  that  orJy  the 
variational  ansatze  (5.3),  (5.8),  and  (5.14)  result 
from  N  particle  wave  fimctions. 

In  addition  to  further  questions  and  references 
mentioned  in  [41]  (cf.  references  [18-22]  therein)  in 
the  following  other  hints  to  papers  are  listed,  which 
might  have  relationship  with  the  proposed  PDFT: 

a.  There  exists  a  rich  literature  on  antisym¬ 
metrized  geminal  power  (AGP)  wave  functions,  cf. 
[8]  and  [54];  in  [55]  the  2-matrix  of  the  AGP  wave 
function  has  been  studied. 

b.  Functionals  of  the  1-matrix  and  of  the  pair 
density  are  discussed,  e.g.,  in  [56-58]  and  [46], 
respectively. 

c.  What,  if  any,  is  the  relationship  of  Section  5 
to  the  density  matrix  hierarchy  decoupling  schemes 
of  [59]  and  [60]?  Therein  the  2-matrix  is  approxi¬ 
mated  in  terms  of  the  1-matrix,  respectively,  a 
"density  equation"  is  derived,  which  yields  the 
2-matrix  without  any  use  of  the  many-electron 
wave  function! 

d.  Another  way  of  exploiting  density  matrices 
is  to  obtain  the  XC  potential  (of  the  Kohn-Sham 
scheme)  from  low-order  density  matrices  [61]. 

d.  What,  if  any,  is  the  relationship  to  the 
Euler  “Lagrange  equation  of  [62]  resulting  from  a 
trial  wave  function  with  a  local  correlation  factor? 

e.  What,  if  any,  is  the  relationship  to  quantum 
chemical  methods  (especially  CC)  [34,  72]? 

f.  In  [69]  an  effective  interaction  (instead  of  the 
bare  Coulomb  interaction)  is  introduced  so  that 
already  a  single  determinant  of  IP  orbitals 
(Coulomb-hole  HP)  or  the  linear  combination  of  a 
few  determinants  give  reasonable  results  for  the 
density  and  the  energy.  Is  there  a  relationship 
between  this  effective  interaction  and  that  of  PDFT? 

Finally  the  hope  is  expressed  that  the  different 
approaches  of  quantum  chemistry  and  solid-state 
theory  to  tackle  the  problems  of  weak  and  strong 
electron  correlation  in  small  and  extended  sys¬ 
tems,  crystalline  (metallic  and  nonmetallic,  perfect 
and  with  imperfections)  as  well  as  disordered  (al¬ 
loyed  and  amorphous)  materials  without  and  with 
pairing  phenomena,  with  hard  and  soft  reactivity, 
etc.  can  be  brought  to  a  fruitful/prolific  interaction 
with  the  result  of  a  unified  theory  of  electron 
correlation,  knowing  that  electron  correlation  is,  of 
course,  not  only  matter  of  the  GS  but  of  all  the 
states  of  a  system  and  their  more  or  less  hard  or 
soft  response  when  perturbing  the  system  with 
external  fields  or  when  changing  the  number  of 
electrons  ("particle  bath"  as  a  sort  of  an  external 
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'"field").  For  a  historical  review  of  the  develop¬ 
ment  of  the  electron  correlation  problem  cf.  [63].  In 
the  future  theory  of  electron  correlation  the  GS 
pair  density  surely  will  have  its  firm  place;  al¬ 
though  it  is  a  more  complicated  quantum  kine¬ 
matic  quantity  (compared  with  the  density),  it 
may  give  finally  a  more  clear /lucid  picture  of  the 
internal  dynamical  structure  of  a  many-electron 
system  in  its  GS, 


Appendix:  Generalized 
Hohenberg-Kohn  Theorem 

0.  In  [64]  a  Hohenberg-Kohn  theorem  for  the 
nondegenerate  GS  of  an  interacting  system  with  a 
nonlocal  external  potential  has  been  proven.  Simi¬ 
lar  work  is  in  [65,  66].  Here  the  variation  of  the 
pair  density  (and  following  from  it  also  of  the 
density)  with  local  external  and  interaction  poten¬ 
tials  are  considered. 

1.  For  given  and  the  Hamilto¬ 

nian  H  may  have  a  (nondegenerate)  GS  character¬ 
ized  by  £,  ^  and,  following  from  by  nil;  2)  and 
p(l)  =  fd2n(l;2)/(N  —  1).  Other  potentials 

and  v[^^(l,2)  yield  and  n'(l; 2),  p'(l).  The 

assumption  n'(l;2)  =  n(l;2)  leads  to 

E  <  E'  +  j  dl  p(l)[jy^^^(l)  - 
dld2 

and  a  similar  inequality  with  primed  and  un¬ 
primed  quantities  exchanged.  So  E  +  E'  <  E'  +  E 
would  result  showing  the  above  assumption  to  be 
wrong.  Conclusion  from  this  contradiction;  n'{l;2) 
^  «(1;2)  when  u'^td)  ^  ^’oxtd)  and/or  ¥= 

2.  For  given  and  Uin,(l,  2)  the  GS  energy 

is  a  functional  of  the  pair  density:  E[  n]  =  T[n]  + 
V'ext[  p[«]]  +  Vjj„[n].  T^e  pair  density  of  the  GS 
’Fq  minimizes  E[n]  and  this  minimum  Eing]  is 
the  GS  energy  Eq.  Proof;  Assume  a  state  'F/ A  %) 
which  yields  a  pair  density  n,(l;2)  and  minimizes 
the  kinetic  energy  according  to  Eq.  (6.1),  then 

Eo  =  =  E[n,]. 

3.  In  [67]  (the  author  thanks  N.  Ashcroft  for 
giving  a  hint  to  this  reference),  it  is  similarly  shown 
that  under  given  conditions  of  (temperature  and) 
density,  the  pair  potential  Uj„((l,2),  which  gives 


rise  to  a  given  radial  distribution  function  n(l;2), 
is  imique  up  to  a  constant. 

Note  added  in  proof:  The  DFT  literature  [1-21]  is 
continued  by  H.  Eschrig,  The  Fundamentals  of 
Density  Functional  Theory  (Teubner,  Stuttgart, 
1996). 
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ABSTRACT _ _ _ _ _ 

The  optimal  mixing  coefficient  C  of  the  exchange  energy  and  the  electron-electron 
interaction  part  of  the  exchange-correlation  energy  in  the  formula  for  the  total 
exchange-correlation  energy  was  expressed  through  the  ratio  of  the  kinetic  T  and 
potential  W,  contributions  to  the  correlation  energy  E,.  This  expression  can  be  derived 
from  a  Heavyside  step  function  model  of  the  dependence  of  on  the  coupling 
parameter  of  the  electron  interaction  A.  Values  of  T,  and  obtained  from  ab  initio  wave 
functions  were  used  to  estimate  C  for  a  number  of  atoms  and  molecules.  A  strong 
dependence  of  T„  W,.,  and  C  on  the  bond  distance  was  demonstrated  for  the  case  of 
the  Hj  molecule.  T,  and  C  approach  zero  in  the  bond-dissociation  limit;  so  for  an 
electron-pair  bond,  the  admixing  of  exact  exchange  to  obtain  an  accurate  E^,.  is  strongly 
dependent  on  the  bond  length  and  has  to  disappear  for  weak  interaction/large  bond 
distances.  The  potential  of  the  exchange-correlation  hole  constructed  for  H2  from  an  ab 
initio  second-order  density  matrix  was  compared  with  its  generalized  gradient 
approximation  (GGA).  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 


Exchange  and  correlation  energies  as  defined 
in  density  functional  theory  (DFT)  [1,  2]  are 

International  Journal  of  Quantum  Chemistry:  Quantum  Chemistry 
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different  quantities  than  the  conventional  ones 
used  in  ab  initio  quantum  chemistry.  They  are 
defined  with  respect  to  the  determinantal  wave 
function  of  Kohn-Sham  orbitals  rather  than  with 
respect  to  the  Hartree-Fock  determinantal  wave 
function.  We  briefly  recall  the  relevant  definitions. 

Tiposium  30,  1375-1384  (1996) 
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The  exact  total  energy  of  an  interacting  electron 
system  is  written  as 


E  =  T  +  J pir)v(r)dr  H - j 


1  fpir^)p{r2) 


+  W. 


O'!  -  r. 


(1) 


where  is  the  exchange-correlation  part  of  the 
electron-electron  interaction  energy,  which  can  be 
written  employing  either  the  two-electron  density 
matrix  FfrjSj,  r2S2)  or  the  pair-correlation  factor 


W, 


'J  f  1  -  r  E  / 


1  _  ,r(r,s,,rjS,) 


ir,  -  r. 


dr^  dr2 


1  r  p(r^)p(r2) 


ii 


ir,  -  r. 


dr^  dr 2 


p(>'])[g(ri,  r^)  -  1]  pffj) 


\r^  -  r. 


(2) 

dr^  dr 2. 

(3) 


The  energy  of  the  Kohn-Sham  determinantal  wave 
function  is 


=  Tj  +  j  p(r)v(r)dr 


1  |■p(r^)p(r2) 

ki  -  ^2! 


dr^  dr  2  +  E^., 


(4) 


which  defines  the  exchange  energy  in  terms  of  the 
Kohn-Sham  orbitals  [see  Eq.  (17)  below].  The  ex¬ 
change-correlation  energy  E^^  of  DFT  is  now  de¬ 
fined  by  writing  the  exact  total  energy  in  the 
following  form: 


E  =  +  j  p(r)v(r)dr 


1  .  p(r-^)p(r2) 
iJ  \r^  -  r2l 


dj-j  dr 2  +  (5) 


Note  that  E^^  incorporates,  apart  from  the  elec¬ 
tron-electron  interaction  term  W^,,,  also  the  differ¬ 
ence  between  the  exact  and  Kohn-Sham  kinetic 
energies,  T  —  T^.  The  Kohn-Sham  kinetic  energy 
Is  is  just  the  kinetic  energy  of  independent  parti¬ 


cles  occupying  the  Kohn-Sham  orbitals; 

Ts=  f4>r(r)VWr)dr.  (6) 

^  1  =  1 


The  following  definitions  are  now  self-explanatory: 
E,  =  E  -  -  E, 

y  _  J’  _  y 

=  £*  +  Ec 
Kc  =  E.  +  Wc 

E.c  =  Kc  +  T,  =  E,  +  W,  +  T,  (7) 

Ec  =  W,  +  T,.  (8) 


The  contributions  and  Wj.  will  be  referred  to 
as  the  electron-electron  potential  energy  parts  (or 
just  potential  parts)  of  the  exchange-correlation 
and  correlation  energies,  respectively.  It  is  interest¬ 
ing  to  observe  that  the  DFT  definition  of  the  corre¬ 
lation  energy  E^  does  not  involve,  as  does  the 
conventional  definition,  correlation  corrections  to 
electron-nuclear  and  Hartree  electron-electron  in¬ 
teraction  energies,  since  the  exact  and  Kohn-Sham 
one-electron  densities  do  not  differ,  while  the  exact 
and  Hartree-Fock  one-electron  densities  do.  The 
DFT  definitions  will  not  differ  much  from  the 
conventional  ones  in  systems  where  there  is  little 
near-degeneracy  correlation  and  the  Hartree-Fock 
approximation  is  a  good  zero-order  starting  point. 
In  systems  with  strong  near-degeneracy  correla¬ 
tion,  however,  the  Hartree-Fock  density  may  dif¬ 
fer  strongly  from  the  exact  one  and  the  DFT  quan¬ 
tities  are  very  different  from  the  conventional  ones 
[3,  4]. 

A  key  formula  of  DFT  to  obtain  the 
exchange-correlation  energy  E,,^.[  p]  of  a  many- 
electron  system  is  an  integral  [5-7]  over  the  cou¬ 
pling  parameter  A  of  the  electron  interaction  A/ri2- 

E,vc[  P] 


p(ri)[g^(ri,  >-2)  -  1]  p(r2) 

ki  -  »'2l 


dAdj-j  dr  2, 


(9) 


where  p(r)  is  the  electron  density  and  g^(rj,  j'2)  is 
the  pair-correlation  function  of  a  system  in  which 
the  electron-electron  interaction  is  A/rj2  and  in 
which  an  “external"  potential  v\r)  is  present  of 
such  a  form  that  the  one-electron  density  p\r) 
corresponding  to  the  exact  wavefunction  is 
equal  to  the  exact  density  p{r)  of  the  fully  interact- 
ing  system  at  each  A.  Usually,  the  integration  over 
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A  is  effectively  carried  out  first  and  p]  is 
expressed  as  the  spatial  integral 


p]  =  / 


p(ri)[g(r^,r2)  -  1]  p(»‘2) 


r,  -  r. 


dr^  dr2 
(10) 


of  the  coupling  constant  integrated  pair-correla¬ 
tion  function  g(r^,  r2): 

giri,r2)  =  Cg\r^,r2)  dX.  (11) 

•^0 

Numerous  approximations  of  gitj,  r2)  were  used 
in  DFT  to  calculate  p]  via  Eq.  (10). 

One  can,  alternatively,  carry  out  first  the  spatial 
integrations  in  Eq.  (10)  and  represent  p]  as  an 
integral  of  the  electron-electron  interaction  energy 
part  W/J  p]  of  the  exchange-correlation  energy  of 
electrons  in  the  model  system  with  the  pair-corre¬ 
lation  function  gKr^,  r2): 

£xc[p]  -  f\Mp]d\.  (12) 

*'0 

In  its  turn,  p]  may  be  expressed  as  an  integral 
of  the  potential  of  the  exchange-correlation  hole 


W,:',[p]  =  -fpirW;f^'Kr)dr, 


=  f 


p(r2)[g^(ri>r2)  -  l] 


dr  2- 


(13) 

(14) 


Expression  (12)  can  also  be  used  as  a  basis  for 
the  DFT  approximations.  In  particular,  Becke  pro¬ 
posed  in  [8]  to  approximate  the  dependence  of 
on  A  with  a  linear  function,  so  that  integral  (12)  is 
calculated  in  this  approximation  via  the  simple 
trapezoidal  rule  C'half-and-half '  formula): 

Ejpr^^-\w,\[p]  +  ^E,[p]^  (15) 

where  E^[  p]  =  p]  is  the  exchange  energy  of 
the  Kohn-Sham  (A  =  0)  system,  the  pair-correla¬ 
tion  function  of  which,  [g^ir^,  r2)  -  1],  of  course, 
only  describes  the  exchange  hole.  With  ap¬ 
proximated  with  the  functional  of  the  local 

spin-density  approximation  (LSD A)  [9]  and  with 
the  rigorous  exchange  functional  E^  of  the 
Kohn-Sham  (KS)  orbitals  (the  latter  were  obtained 
from  the  self-consistent  LSDA  calculations)  for¬ 


mula  (15)  provides  reasonable  accuracy  for  the 
calculated  atomization  energies  E^  of  a  representa¬ 
tive  set  of  molecules.  A  linear  combination  of 
and  with  the  coefficients  fitted  to  ex¬ 
perimental  data  yields  substantially  better  E^  val¬ 
ues  [8]  and  an  empirical  combination  of  E^  with 
four  approximate  exchange  and  correlation  func¬ 
tionals  [10]  reproduces  also  total  atomic  energies, 
atomic  and  molecular  ionization  potentials,  and 
molecular  proton  affinities. 

The  question  arises.  What  is  the  optimal  mixing 
of  and  E^  to  provide  an  accurate  E^,  value 
for  a  given  system?  In  this  article,  an  exact  mixing 
coefficient  C  is  expressed  through  the  ratio  of  the 
kinetic  and  potential  contributions  to  the 
correlation  energy  E^.  This  mixing  can  be  derived 
from  a  Heavyside  step  function  model  of  the  de¬ 
pendence  of  on  A,  which  is  an  alternative  to 
the  linear  interpolation  model  of  Becke.  Values  of 
and  W^  obtained  using  accurate  quantities 
( p,  E,  T,  E^)  from  ab  initio  wave  functions  are 
used  to  estimate  C  for  a  number  of  atoms  and 
molecules.  The  dependence  of  W^,  and  C  on  the 
bond  distance  is  analyzed  in  the  case  of  the  H2 
molecule  and  an  accurate  asymptotic  formula  for 
is  obtained  for  the  bond  dissociation  limit.  The 
shape  of  the  potential  as  function  of  the 

electron  coordinate  r  is  obtained  from  an  accurate 
ab  initio  second-order  density  matrix  for  FI  2  [11, 
12]  and  is  compared  to  its  generalized  gradient 
approximation  (GGA)  [13-15]. 


A  Mixing  Formula  and  a  Model  of  the 
\  Dependence 

In  this  article,  a  mixing  formula  of  the  type 

Ejp]’"‘’'  =  a-C)W,\  +  CE,  (16) 

is  considered,  which  mixes  the  upper  and  the 
lower  limits  of  the  integral  (12).  In  the  case  of 
A  =  0,  the  general  formula  (13)  for  W/^  can  be 
rewritten  in  terms  of  the  occupied  KS  orbitals 

I  N  N 

W,°[p]-Ejp]=--E  I 

^  ,  =  1  ;  =  1 

^  <^,(j'i)(/)f  (»-2)<^)*(ri)(/)^(j-2)  ^  ^ 

X  /  ^  j  dVi  ar2- 

^  ki  -  ^2! 

(17) 
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The  difference  between  p]  and  E^[  p]  is  the 
potential  energy  part  W|.[  p]  of  the  correlation  en¬ 
ergy  [cf.  Eqs.  (8)]: 

p]  ~  Et[  p]  =  P]  =  2 

(18) 

where  is  the  potential  of  the  Coulomb  hole, 

the  difference  between  the  complete  hole  de¬ 
scribed  by  J-j)  -  1]  and  the  exchange  hole 

described  by  [g°(ri,  r2)  -  1].  The  integrand  of  Eq. 
(12)  is  the  exchange-correlation  part  of  the  elec¬ 
tron-electron  potential  energy,  but  the  integration 
over  A  produces  the  total  E,.^[  p]  including  its 
kinetic  contribution  T^: 

E.vc-[  p]  =  pKl  P]  rfA  =  E,[  p]  +  W,[  p]  +  TJ  p], 
•^0 

(19) 

For  a  particular  many-electron  system,  the  exact 
value  of  the  mixing  coefficient  C  in  formula  (16) 
can  be  defined  from  the  requirement  that  (16) 
should  reproduce  the  accurate  value  of  p]: 

E,,.[  p]  =  E,[  p]  +  W,[  p]  -6  T,.[  p],  (20) 

p]  =  (1  -  +  CE, 

=  EJ  p]  +  WJ  p]  +  TJ  p],  (21) 

From  Eqs.  (8)  [cf.  also  (18)],  the  expression  for  C 
follows: 

02) 

The  ratio  (22)  provides  a  theoretical  estimate  for 
the  optimal  mixing  coefficient  of  the  exchange- 
correlation  and  exchange  functionals  in  the  for¬ 
mula  for 

The  mixing  formula  (16)  can  be  derived  from  a 
model,  in  which  the  A  dependence  of  is  ap¬ 
proximated  with  the  Heavyside  step  function  0(x): 

_  c)(Wl  ~~  E^)  +  (23) 


The  integration  of  (23)  over  A  yields  the  mixing 
formula  (16).  The  step-function  model  (23)  can  be 
considered  as  an  alternative  to  Becke's  linear  inter¬ 


polation  m.odcii: 

=  AW,1,  +  (1  -  A)£^,.,  (24) 

the  latter  yielding  after  the  A  integration  the  trape¬ 
zoidal  rule  formula  (15).  Contrary  to  model  (24) 
with  its  relatively  smooth  dependence  on  A,  Eq. 
(23)  models  a  sharp  dependence  with  a  region 
A  <  C  of  weak  Coulomb  correlation,  where 
is  assumed  to  be  equal  to  its  exchange-only  limit 
E^  and  a  region  A  >  C  of  strong  Coulomb  correla¬ 
tion,  where  is  assumed  to  be  equal  to  its 

full  strength  limit  In  the  weakly  correlated 
mode,  the  system  is  assumed  to  have  the  ex¬ 
change-only  pair-correlation  function  g^ir^r-y), 
while  the  full-strength  pair-correlation  function 
Yj)  is  attributed  to  the  strongly  correlated 
mode.  The  mixing  coefficient  C  in  this  model  has 
the  meaning  of  a  critical  coupling  strength,  at 
which  the  transition  occurs  from  the  weakly  corre¬ 
lated  system  to  the  strongly  correlated  system.  In 
the  next  sections,  the  optimal  mixing  coefficient  C 
will  be  estimated  from  the  available  data  on 
and  the  Coulomb  correlation  energy  E^  and  the 
dependence  of  C  on  the  bond  distance  will  be 
studied  in  the  case  of  the  H2  molecule. 


The  Ratio  -  for  Atoms  and 

Molecules 

Table  I  represents  values  of  obtained  for  a 
number  of  atoms  [16]  and  molecules  [17,  18]  and 
the  corresponding  values  of  PV^,  and  C  = 

The  values  for  were  obtained  as  the  difference 

jci  _  jci  _  (25) 

of  the  kinetic  energy  calculated  for  the  system 
with  the  configuration  interaction  (Cl)  method  and 
the  latter  being  obtained  by  available  methods 
[19,  20]  to  construct  the  KS  orbitals  and  potential 
from  the  Cl  electron  density.  is  evaluated  as 

W,  =  E^'^,  -  (26) 

where  the  (Cl  approximation  to  the)  conventional 
correlation  energy  is  the  difference  between 

the  total  energies  of  the  Cl  and  Hartree-Fock 
methods: 

=  E^'  -  E»^  (27) 

as  it  is  traditionally  defined  in  quantum  chemistry. 
Note  that  we  are  approximating  here  the  DFT 
correlation  energy  or,  rather,  the  best  Cl  approxi- 
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TABLE  I _ - _ 

Correlation  energy  components  (au)  and  their 
ratios. 


System 

To 

Wc 

c 

He 

0.037 

-0.079 

0.47 

Li 

0.038 

-0.083 

0.46 

Be 

0.074 

-0.168 

0.44 

B 

0.095 

-0.216 

0.44 

C 

0.12 

-0.272 

0.44 

N 

0.15 

-0.330 

0.45 

0 

0.19 

-0.435 

0.44 

F 

0.24 

-0.546 

0.44 

Ne 

0.30 

-0.676 

0.44 

Na 

0.31 

-0.696 

0.44 

Mg 

0.34 

-0.752 

0.45 

Al 

0.35 

-0.765 

0.46 

Si 

0.36 

-0.781 

0.46 

P 

0.41 

-0.862 

0.47 

H2 

0.032 

-0.071 

0.45 

LiH 

0.057 

-0.134 

0.43 

BH 

0.099 

-0.236 

0.42 

HF 

0.237 

-0.541 

0.44 

mation  to  it, 

Ec;  =  £a_EKS[p]^  (28) 

by  the  conventional  correlation  energy  E^hf-  This 
replacement  is  acceptable,  for  the  present  purpose 
of  interpretation,  for  atoms  and  for  the  chosen 
molecules  at  their  equilibrium  geometry,  because 
in  this  case,  the  Hartree-Fock  electron  density  p"'" 
is  close  to  the  correlated  density  p,  so  that  p] 
and  values  should  also  be  close  to  each  other. 
The  accuracy  of  the  Cl  wave  function  varies  from 
system  to  system,  so  that  a  comparison  of  absolute 
values  of  and  is  not  always  meaningful.  On 
the  other  hand,  their  ratio  C  is  expected  to  be  less 
influenced  by  the  errors  of  the  limited  Cl. 

For  eight  of  18  systems  considered,  C  has  the 
same  value  C  =  0.44,  while  for  the  other  systems, 
deviations  from  this  value  are  not  large  and  C 
varies  within  the  range  0.43-0.47.  It  follows  from 
these  results  that  for  the  considered  atoms  and  for 
molecules  at  the  equilibrium  geometry  (at  least  if 
there  are  not  strong  correlation  effects  at  the  equi¬ 
librium  geometry)  the  optimal  mixing  is  not  very 
sensitive  to  the  type  of  the  system  and  E^^  can  be 
reproduced  with  the  following  combination  of  ac¬ 


curate  functionals: 

E^,[  p]  »  0.56W^\  +  0.44E,,  (29) 

which  is  not  far  from  the  “half-and-half"  formula 
(15)  of  Becke.  In  terms  of  the  Heavyside  step 
function  model  (23),  it  means  that  the  region  of 
strong  Coulomb  correlation  spans  a  slightly  larger 
portion  of  the  interval  0  <  A  <  1  and  at  the  cou¬ 
pling  strength  value  A  =  0.44  the  system  under¬ 
goes  a  transition  from  the  weakly  correlated  mode 
to  the  strongly  correlated  mode. 

Note  that  if,  instead  of  the  functional  of  DFT 
and  V\(.  of  Eq.  (26),  their  conventional  quantum 


chemical  analogs  hf 

T,  hf  =  ^ 

(30) 

^c,HF  ^  ^c,HF  ~  HF' 

(31) 

F  —  F  —  F^^ 

HF  ^  ^  ' 

(32) 

are  inserted  in  (22),  C  would  be  always  equal  to 
0.5  because  of  the  virial  theorem,  which  holds  true 
for  the  exact  solution,  as  well  as  for  the 
Hartree-Fock  solution  with  the  total  and  kinetic 
energies  E^^,  and  electron  density  Levy 
et  al.  [21]  recently  made  the  analogous  observation 
that  eq.  (15)  would  be  exact  in  case  the  virial 
relation  E^  hf  =  “T,  would  also  hold  for  E, 
and  T^.  This  is  in  agreement  with  our  exact  relation 
C  =  -  T,).  The  reason  for  the  lower  val¬ 

ues  of  C  obtained  (see  Table  I)  is  the  difference 
between  and  By  its  definition,  the  KS 

kinetic  functional  p]  minimizes  the  kinetic  en¬ 
ergy  (6)  for  the  correlated  density  p.  This  density 
is  usually  somewhat  more  contracted  around  the 
nuclei  (the  exact  kinetic  energy  is  higher  than  the 
HF  kinetic  energy)  and  the  mirumal  kinetic  energy 
for  is  still  higher  than  is  Tfjp.  As  a  result,  C 

of  Eq.  (22)  is  consistently  lower  than  C  = 

From  the  results  presented,  one  can  conclude 
that  for  atoms  and  for  molecules  at  the  equilib¬ 
rium  geometry  the  model  with  the  step-function  A 
dependence  provides  only  a  slight  alteration  in 
comparison  with  the  linear  interpolation  model. 
However,  the  step-function  model  has  a  definite 
advantage  over  the  linear  model  for  the  process  of 
molecular  dissociation,  because  of  the  well-known 
deficiency  of  the  exchange-only  functionals  in  the 
bond  dissociation  limit.  In  the  next  section,  the 
dependence  of  and  C  on  the  bond  distance  will 
be  considered  for  the  simplest  case  of  the  H2 
molecule. 
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Hr-  The  Dependence  of  and  C  on 
the  Bond  Distance 

T-  and  C  exhibit  a  strong  dependence  on  the 
bond  distance  and  in  the  bond  dissociation  limit  of 
the  Ho  molecule  both  T^.  and  C  approach  zero.  To 
establish  this  trend,  we  shall  derive  first  the 
asymptotic  expression  for  T,.  In  the  bond  disso¬ 
ciation  limit,  the  H2  molecule  is  described  prop¬ 
erly  with  the  Heitler-London  wave  function 

a'^'{r^)bir2)  +  a'^{r2)b(r^) 

2(1  + 

X[a{s^)  13(52)  ~  a(s2)/3(S|)], 

(33) 


where  and  are  the  kinetic  integrals: 


1  c 

=  -  -ja*(r)Vh(r)dr, 

(37) 

1  , 

'^ah  =  ~  2  dr. 

(38) 

The  difference  between  the  total  energies  obtained 
with  ^2)  and  %(x^,  X2)y  the  correlation 

energy,  E^,  approaches  with  increasing  H — H  dis¬ 
tance  its  exact  finite  limiting  value  —0.3125  au 
[22].  On  the  other  hand,  T^.,  the  difference  (36) 
between  the  corresponding  kinetic  energies  de¬ 
creases  with  decreasing  and  at  longer  bond 
distances  and  it  becomes  zero  in  the  infinite  sepa¬ 
ration  limit: 

T,  ^0,  R(H-H)^c^.  (39) 


where  a(r)  and  b(r)  are  the  Is-type  atomic  orbitals 
located  on  the  H  atoms  A  and  B,  a(/)  and  /3(i) 
are  the  one-electron  spin  functions,  and  5^,^,  is  the 
overlap  integral: 

S,,/,  =  f  (J'‘'(r)b(r)  dr,  (34) 

A  convenient  feature  of  the  KS  theory  is  that  in  the 
dissociation  limit  the  KS  system  of  is  properly 
described  with  the  one-determinantal  wave  func¬ 
tion  T^,(V|,  X2)  formed  from  the  cr,  combinations 
of  the  Is  orbitals  a(r)  and  b(r)  (except  possibly  at 
the  bond  midpoint): 


Note  a  striking  difference  between  and  the 
conventional  kinetic  contribution  The  latter 

function  approaches  in  this  limit  the  finite  value 
0.2565  au  [4]. 

As  a  consequence  of  the  asymptotics  (36),  the 
optimal  mixing  coefficient  C  of  Eq.  (22)  decreases 
with  decreasing  and  becomes  zero  in  the  infi¬ 
nite  separation  limit: 

Tc 

C=  -  —^0,R(H~H)^^,  (40) 

so  that  the  mixing  formula  (16)  correctly  reduces 
to 


Xo) 


(a'^(r^)  -h  lA(r^))(a(r2)  +  ^(^2)) 
2v/2(l  +  S„„) 

X[a(si))3(s2)  -  a(s2)/3(si)], 

(35) 


since  just  'I'/xj,  X2)  generates  the  ground-state 
density  p(r)  in  this  limit.  Subtracting  the  kinetic 
energy  obtained  with  ^2)  from  that  ob¬ 
tained  with  X2),  one  can  derive  an  asymp¬ 

totic  expression  for  T^.; 


^  2(r„s„„  -  r,„)(i  -  s,„) 
(1  +  sja  +  s„\)  ' 


=  (1  -  -f  CE,  ^  R(H  -  H)  ->  00. 

(41) 

In  terms  of  the  Heavyside  step-function  model 
(23),  Eq.  (40)  means  that  in  the  infinite  separation 
limit  for  all  A  but  A  =  0  the  system  is  in  the 
strongly  correlated  mode.  This  is  physically  rea¬ 
sonable  in  view  of  the  negligible  interaction  be¬ 
tween  the  H  atoms  at  infinite  distance,  so  that  the 
model  offers  a  physically  correct  picture  [23,  24]. 
For  all  A  >  0,  the  interaction  term  A/r^2  favor 
the  Heitler-London  wave  function  X2) 

[Eq.  (33)]  in  which  two  electrons  are  residing  on 
different  atoms.  Only  for  A  =  0,  the  exact  wave 
function  for  the  noninteracting  electrons  is  just  the 
KS  determinant  %(xi,  X2)  of  Eq.  (35).  The  A  =  0 
point,  however,  becomes  unimportant  in  the  cou- 
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pling  constant  integration  for  the  pair-correlation 
function  and  we  have 

lim  g(r,,  rj) 

=  lim  f^g\r^,r2)dX=  f^g\r^,r2)d\ 

R^ccJq  *^0 

=  g’(ri,r2).  (42) 

In  this  case,  the  step-function  model  has  a  definite 
advantage  over  the  linear  interpolation  model  of 
Eq.  (24),  which  for  each  A  appears  to  mix 

Xj)  and  %(x^,  X2X  leading  to  an  unwar¬ 
ranted  contribution  to  As  a  result,  the 
mixing  formula  (16)  provides  in  this  case  a  consid¬ 
erable  improvement  over  Becke's  ''half-and-half" 
formula. 

To  study  the  dependence  of  V\(,,  and  C  for  a 
wide  range  of  bond  distances  R(H — H),  we  use 
and  E,  values  calculated  in  [25]  at  several  dis¬ 
tances  from  a  full-CI  wave  function.  In  this  case, 
the  rigorous  quantity  Ep  of  Eq.  (28)  was  used  to 
estimate  =  E^  -  since  the  difference  be¬ 

tween  Ef^  and  E^/^p  becomes  appreciable  at 
longer  distances.  The  Hartree-Fock  one-electron 
density  deviates  quite  strongly  from  the  exact  den¬ 
sity  at  longer  bond  distances,  to  the  extent  that  the 
conventional  correlation  corrections  to  the  one- 
electron  energies,  the  Hartree-Fock  electron- 
nuclear  potential  energy  and  the  Hartree-Fock  ki¬ 
netic  energy,  become  larger  in  magnitude  than  the 
correlation  correction  to  the  two-electron  part  of 
the  energy  [4].  The  assumptions  made  in  compil¬ 
ing  Table  I  would  not  be  justified  for  dissociating 
H2.  In  Figure  1,  the  plots  of  T^(R),  W^(R),  and 
C(R)  as  functions  of  E(H — H)  are  presented.  One 
can  see  from  Figure  1  that  in  the  interval  of  K  = 
1-4  au  shows  a  slight  dependence  on  R,  reach¬ 
ing  its  maximum  =  0.042  Hartrees  at  R  =  3  au. 
For  longer  distances,  gradually  decreases,  ap¬ 
proaching  zero  asymptotically  (39).  On  the  other 
hand,  the  absolute  value  of  W,  increases  monoton- 
ically  with  increasing  R.  As  a  result,  their  ratio  C 
gradually  decreases  with  increasing  R;  the  only 
exception  is  a  local  plateau  of  C(R)  in  the  interval 
R  =  2.0-2,5  au.  From  these  results,  one  can  con¬ 
clude  that,  at  least  in  the  considered  case  of  the  H2 
molecule,  the  optimal  mixing  coefficient  C  de¬ 
pends  strongly  on  the  bond  distance. 

The  conclusions  reached  here  on  .the  asymp¬ 
totics  of  can  be  generalized  for  the  case  of  a 
many-electron  homonuclear  molecule.  In  this  case, 
the  value  of  in  the  dissociation  limit  becomes 


R(H-H),  a.u. 


FIGURE  1.  T^,  (au),  and  C  as  functions  of  the  bond 

distance  for  the  H2  molecule. 


equal  to  the  sum  of  the  contributions  of  the 
atomic  fragments.  The  A-integrated  exchange-cor¬ 
relation  hole  for  a  reference  electron  on  one  of  the 
atomic  fragments  will  then  be  equal  to  the  A-in¬ 
tegrated  hole  of  the  atom  itself.  In  the  particular 
case  of  the  H2  molecule,  this  leads  to  the  estab¬ 
lished  asymptotics  (39),  because  for  the  one-elec¬ 
tron  H  atom,  =  0. 


Conclusions 

In  this  article,  the  exact  coefficient  C  of 
mixing  of  the  exchange  E^  and  the  exchange- 
correlation  potential  energy  in  the  formula  for 
the  total  exchange-correlation  energy  E^^  was  ex¬ 
pressed  through  the  ratio  of  the  kinetic  and 
potential  contributions  to  the  correlation  en¬ 
ergy  E^.  This  mixing  corresponds  to  a  Heavyside 
step-function  model  of  the  dependence  of  on 
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A,  where  C  has  the  meaning  of  a  critical  coupling 
strength,  at  which  the  transition  occurs  from  the 
weakly  correlated  system  to  the  strongly  corre¬ 
lated  system.  Values  of  and  W,  obtained  from 
Cl  wave  functions  were  used  to  estimate  C  for  a 
number  of  atoms  and  molecules.  For  the  H2 
molecule,  the  strong  dependence  of  T^,  W^.,  and  C 
on  the  bond  distance  was  demonstrated  and  an 
accurate  asymptotic  formula  for  was  obtained, 
according  to  which  and  C  approach  zero  in  the 
bond-dissociation  limit.  This  dependence  of  the 
admixing  of  "exact  exchange"  on  the  bond  dis¬ 
tance  should  be  taken  into  account  in  the  actual 
applications  of  mixing  formulas. 

To  apply  various  mixing  formulas,  one  has  to 
approximate  with  one  of  the  currently  avail¬ 
able  approximate  functionals.  Thus,  the  problem  of 
the  quality  of  the  approximate  functionals 
arises.  The  potential  of  the  exchange-correlation 
hole  which,  according  to  Eq.  (13),  yields 

the  potential  energy  p],  may  be  used  to  ob- 


7.(a.u.) 

FIGURE  2.  Comparison  of  the  potential  of  the 
exchange-correlation  hole  and  its  GGA  approximation 
(au)  for  the  Hg  molecule  at  /?(H — H)  =  1 .401  au. 


z(a.u.) 

FIGURE  3.  Comparison  of  the  potential  of  the 
exchange-correlation  hole  and  its  GGA  approximation 
(au)  for  the  H2  molecule  at  R{H — H)  -  5.0  au. 

tain  some  insight  in  the  quality  of  the  r-depen- 
dence  of  approximations  to  the  exchange-correla¬ 
tion  energy  density.  In  Figures  2  and  3,  the  essen¬ 
tially  accurate  potential  obtained  for  the  H2 

molecule  from  the  full-CI  second-order  density 
matrix  [11^  12]  at  the  equilibrium  bond  distance 
and  at  the  long  distance  R(H — H)  =  5  au  is  repre¬ 
sented  as  a  function  of  the  distance  Z  from  the 
bond  midpoint.  It  is  compared  with  its  GGA  model 
yhoit’^GGA^  which  is  calculated  via  the  following 
formula: 

^  +  26,"(r) 

+  (43) 

Here,  is  the  exchange  energy  density  (per 

particle)  of  the  local  density  approximation  (LDA), 
2e/  is  the  gradient-dependent  correction  to 
of  Becke  [13],  is  the  Coulomb  correlation 

energy  density  of  Perdew  and  Wang  [9,  14,  15], 
and  is  the  kinetic  correlation  energy  den- 
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Exchange-correlation  energy  (au)  and  its  components  for  the  Hg  molecule. 


fl(H— H)  (au) 

E. 

pGGA 

pLDA 

wr 

M/GGA 

^''xc 

1.401 

5.0 

-0.661 

-0.410 

-0.658 

-0.512 

-0.569 

-0.423 

-0.089 

-0.089 

-0.071 

-0.207 

-0.081 

-0.057 

-0.739 

-0.569 

-0.732 

-0.617 

sity,  which  is  obtained  by  the  partial  differentia¬ 
tion  of  ef'^(r)  [26-28]: 


dr^ 


x;  r) 


0.458 


-7rp(r) 

|Vp(r)l 


X  = 


,  (44) 

(45) 

(46) 

(47) 


Insertion  of  of  Eq.  (43)  into  the  inte¬ 

gral  (13)  yields  the  exchange-correlation  potential 
energy  of  GGA: 

In  Table  II,  values  calculated  for 

j^(H— H)  =  1.401  and  5.0  au  are  presented  to¬ 
gether  with  the  contributions  to  from  the 

LDA  exchange  functional  the  Becke  correc¬ 

tion  E®,  and  the  correlation  contribution 
They  are  compared  with  the  corresponding  values 
of  the  exact  W^,  obtained  by  integration  of 
and  its  contributions,  the  exchange  energies  E^ 
and  correlation  energies  W,,.  One  can  see  from 
Table  II  that  at  the  equilibrium  distance  GGA 
reproduces  W,’,.  very  nicely;  the  corresponding  dif¬ 
ference  is  only  7  mHartrees.  The  LDA  value 
is  lower  than  the  accurate  E^,  but  the  Becke  correc¬ 
tion  compensates  the  difference,  so  that  both  ex¬ 
change  and  correlation  contributions  are  repro¬ 
duced  well  within  the  GGA. 

For  the  long  distance  R  =  5.0  au,  the  E^  value 
becomes  251  mHartrees  smaller  (in  an  absolute 
sense)  than  that  for  the  equilibrium  distance,  since 
in  the  former  case,  the  exchange  hole  is  delocalized 
over  two  well-separated  H  atoms.  An  increase  of 
the  left-right  Coulomb  correlation  W,.  from  —  0.071 
to  -0.207  due  to  the  strong  near-degeneracy  ef¬ 
fect,  leading  to  a  more  pronounced  Coulomb  hole 
[4],  compensates  more  than  half  of  the  above-men¬ 
tioned  difference,  and  the  total  value  at  R  = 
5.0  au  is  only  115  mHartrees  smaller  than  that  for 


R  =  1.401  au.  This  reflects  the  fact  that  delocaliza¬ 
tion  of  the  exchange  hole  at  long  bond  distances  is 
effectively  compensated  by  delocalization  of  the 
Coulomb  hole,  so  that  the  total  exchange-correla¬ 
tion  hole  remains  essentially  localized  around  the 
reference  electron  [4]. 

The  GGA  correlation  functional  fails  to 

reproduce  the  near-degeneracy  effect  at  large 
bond  length;  it  is  only  27.5%  of  the  exact  W,  (see 
Table  II)  at  R  =  5.0  au.  Its  value  is  150  mHartrees 
too  small,  being  even  smaller  than  that  at  R  = 
1.401  au.  However,  the  GGA  exchange  functional 
(EGGZ  ^  E®^'^  -t  E®)  simulates  a  major  part  of  the 
correlation:  Its  value  at  R  =  5.0  au  is  ca.  100 
mHartrees  larger  (more  negative)  than  that  of  the 
accurate  KS  functional  E^.  This  comes  about  by 
the  being  now  slightly  larger  than  E^  (13 

mHartrees  larger,  not  ca.  90  mHartrees  smaller  as 
is  the  case  at  R  =  1.401  au)  and  the  Becke  correc¬ 
tion,  being  equally  large  as  at  R  =  1.401  au,  adding 
an  overcorrection  of  89  mHartrees.  This  yields 
about  a  100  mHartrees  too  negative  which 

partly  compensates  the  150  mHartree's  too  small 
GGA  correlation  contribution  resulting  in  a 

total  that  is  “only"  ca.  50  mHartrees  smaller 

than 

A  comparison  of  the  hole  potentials  and 

yhoie.cGA  ghows  appreciable  local  deviations  (See 
Figs.  2  and  3).  For  both  bond  distances  considered, 
^hoie,cGA  jg  attractive  than  is  in  the 

neighborhood  of  the  H  nuclei  and  it  is  less  attrac¬ 
tive  in  the  outer  region  of  the  H2  molecule.  In  the 
bonding  region,  is  more  attractive  than  is 

yho/e,i  fpj.  i  au  (see  Pig  2),  while  it  is  less 
attractive  in  this  region  for  R  =  5.0  au  (see  Fig.  3). 
The  errors  of  opposite  sign  from  the  different  re¬ 
gions  cancel  each  other  when  is  inte¬ 

grated  via  Eq.  (13),  thus  providing  the  above-men¬ 
tioned  reasonable  values. 

One  can  conclude  from  the  present  results  that 
further  work  is  needed  to  develop  an  approximate 
functional  which  will  yield  better 

values  at  longer  distances  and  will  provide  a  bet¬ 
ter  approximation  locally.  In  this  respect,  a  promis- 
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ing  way  is  not  to  split  the  functional  into  exchange 
and  correlation  parts,  but  try  to  develop  from  the 
very  beginning  a  combined  exchange-correlation 
approximation,  which  allows  to  retain  a  localized 
model  of  the  exchange-correlation  hole.  The  corre¬ 
sponding  work  is  in  progress. 
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ABSTRACT _ _ _ _ _ _ 

The  molecular  properties  characterizing  diammine  platinum  complexes  have  by  way  of 
example  been  analyzed  and  selected  because  of  their  role  in  anticancer  activity.  The 
research  was  done  with  the  use  of  the  nonstandard  cjuantitative  method  to  compare 
molecular  properties  resulting  from  density  functional  theory  (DFT).  Among  different 
molecular  parameters  considered,  the  multipole  moment  components  were  found  mainly 
to  have  clear  correspondence  to  the  trans  (anticancer  neutral)  and  cis  classes  of  platinum 
complexes.  The  results  led  to  the  selection  and  conjecture  of  the  specific  significance  of 
some  molecular  characteristics  contrasted  with  other  assemblies  of  parameters.  ©  1996 
John  Wiley  &  Sons,  Inc. 


Formulation  of  the  Problem 

The  invention  of  Rosenberg  [1]  of  the  filamen¬ 
tous  growth  of  Escherichia  coli  during  elec¬ 
trolysis  with  platinum  electrodes  can  serve  as  the 
example  of  an  unplanned  result  and  as  an  investi- 

*To  whom  correspondence  should  be  addressed. 


gation  to  find  factors  which  could  have  influence 
on  the  cell  growth. 

It  is  well  known  [2-5]  that  the  cis  form  of 
diammine  platinum  complexes  is  the  necessary 
condition  to  observe  anticancer  activity.  Hence,  if 
some  physico-chemical  properties  of  the  two  Pt 
complexes  (active  and  inactive)  are  similar,  then  it 
can  be  supposed  these  properties  have  doubtful 
relevance  to  the  activity  considered.  Otherwise, 
the  same  properties  (under  similar  conditions) 
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would  be  assumed  to  underlie  mutually  excluding 
phenomena,  e.g.: 

Active  Inactive 

1  2 

ds-PtCl2(NH3)2—  < — »  —  frfl«s-PtCl2(NH3)2 

(1) 

matched  by  a  resemblance  under  a\,  and 

do  not  show  these  quantities  involved 
in  the  anticancer  mechanisms.  Oppositely,  if  the 
compounds  of  system  (1)  are  differentiated  (un¬ 
matched)  under  the  families  of  properties 
Pi'  -  Pk  then  these  properties 

can  have  an  anticancer  relevance. 

The  first  task  to  resolve  this  problem  is  how  to 
split  the  matched  system  (1)  and  to  differentiate  as 
far  as  possible  the  considered  compoimds  accord¬ 
ing  to  their  inhibiting  cancer  abilities.  Some  molec¬ 
ular  parameters  are  commonly  considered  for  their 
relevance  to  the  anticancer  activity  [2-5].  The 
properties  to  be  investigated  in  this  study  are 
sequences  0L\,...,a'„  of  parameters  evaluated  for 
particular  platinum  complexes  (indexed  by  i). 
Quantitative  properties  aj,...,a’,  have  been  de¬ 
termined  for  cis  and  trans  conformations  with  the 
same  or  different  nonammine  ligands. 

The  second  step  pursued  was  looking  for  subse¬ 
quences  such  that  if  a'i,...,a'^  was 

matched  by  the  resemblance  with  ct'j, . . . ,  al,^,  then 
i  and  j  should  simultaneously  denote  cis  or  simul¬ 
taneously  trans  conformers.  Thus,  if  for  each  Pt 
complex  i  from  the  set  considered  the  sequence 
. . . ,  (x^f.  fulfills  the  condition  just  mentioned, 
then  Vj, . . . ,  vj,  are  indexing  quantities  distin¬ 
guished  from  aj, . . . ,  aj,  to  parallel  the  cis-trans 
dichotomy. 

What  are  parameter  discriminating  procedures 
and  how  to  prove  the  existence  of  the  subse¬ 
quences  postulated  is  shown  below. 


Computational 


TABLE  I 


Calculated  dipole  moments  (Debye)  for  cis 
and  trans  conformations  of  platinum  compounds. 


Molecule 

Cis 

Trans 

RCIglNHglg 

13.49 

0.00 

PtFglNHglj 

12.18 

0.00 

RlOHlglNHglg 

8.99 

0.00 

PTCIFlNHglj 

12.98 

1.29 

RlOHlFlNHg)^ 

13.30 

1.67 

R(0H)CI(NH)2 

11.41 

2.83 

RlSHlslNHg)^ 

1.68 

0.00 

RBrglNHglg 

11.94 

0.00 

RBrCKNHgla 

12.75 

1.06 

If  X  denotes  a  compound  having  characteristics 
“i/ •••/“„/  then  the  relevant  set  spectrum  [8]  is 
defined  as  follows: 

fx  =  ©{«,}  for  i  =  1,...,/, 

i.e.,  fx  is  the  coproduct  of  one-element  sets  includ¬ 
ing  particular  quantities.  This  allows  one  to  find 
the  general  factors  of  similarity  FS( fx,  fy)  and  dis¬ 
similarity  VD{fx,fy)  for  two  molecules  x  and  y 
considered.  The  factors  with  the  relation  of  toler¬ 
ance  t  have  been  determined  in  [8]: 

FS(fl,  b)  =  |[(fl  Xb)  u  (b  X  fl)]  n  t\, 

FD(fl,  b)  =\a  \  dom((fl  X  b)  n  f)| 

+  |fc\im((fl  X  &)  n  f)|,  (2) 

where  a,  b  are  finite  sets  and  domain  (dom)  and 
image  (im)  of  the  relations  indicated  have  been 
involved. 

Two  quantities  aj  and  are  recognized  to 
fulfill  t  if  their  difference  -  a/|  is  less  than  the 
average  difference  in  the  set  of  quantities  of  the  k 
kind.  The  distance  between  two  compounds  x  and 
y  is  estimated  by  the  following  formula 


The  electronic  structure  of  the  systems  under 
consideration  has  been  calculated  within  the  den¬ 
sity  functional  theory  (DFT).  All  the  calculations 
were  performed  with  the  BLYP  functional  [6]  for 
the  fully  optimized  geometries  at  the  all-valence 
MOPAC  level  [7].  The  final,  first  time  computed 
results  for  platinum-coordinated  complexes  are 
listed  in  Tables  I-IV. 


d(x,  y) 


mfx,fy) 

m(fx,fy)  +  FS( fx,fy)  ' 


(3) 


Such  determined  distance  has  the  following  prop¬ 
erties:  (i)  for  each  molecule  x  from  the  set  estab¬ 
lished: 


d(x,  x)  =  0; 
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TABLE  II 

Calculated  main 
compounds. 

quadrupole  moment  components  (Debye-Ang)  for  cis  and  trans  conformations  of  platinum 

Molecule 

Cis  form 

Transform 

Oxx 

Qyy 

Ozz 

Qxx 

Qyy 

N 

o 

PtCIslNHgla 

-67,1 

-62.4 

-67.4 

-67.1 

-40.9 

-94.3 

PtFjlNHala 

-48.9 

-47.1 

-49.3 

-48.9 

-24.9 

-69.4 

RjOHlalNHala 

-53.5 

-43.6 

-42.9 

-53.4 

-30.4 

—  54.0 

RCIFlNHgjg 

-56.8 

-56.3 

-58.0 

-58.0 

-32.9 

-81 .7 

R(0H)F(NH3)2 

-40,3 

-51.6 

-48.6 

-51.2 

-27.8 

-61 .6 

Pt(OH)Cl(NH)2 

-59.8 

-48.1 

-60.3 

-60.3 

—  35.7 

-73.4 

PtlSHlglNHgja 

-64.7 

-51.9 

-51.0 

-65.0 

-54.3 

-49.8 

RBr2(NH3)2 

-78.1 

-65.7 

-79.0 

-78.0 

-53.9 

-103.1 

PtBrCKNHgla 

-65.7 

-71.0 

-72.7 

-72.6 

-47.4 

-99.4 

(ii)  for  each  molecules  x  and  y  from  the  set 
considered: 

dix,  y)  =  d(y,  x)  (symmetry). 

The  distance  mentioned  is  particularly  a  pseu¬ 
dometric  when  the  tolerance  t  is  an  identity  (cf. 
[8]).  Using  distance  d,  one  can  determine  relations 
between  the  objects  compared  simply  by  imposing 
useful  conditions  on  the  function  d  involved: 

( X,  y)  fulfills  r  if  and  only  if  (  x,  y) 
gXxX  and  ^(x,y,d), 

where  the  required  properties  are  established  by 


the  condition  <£>.  If  the  conditionfs)  <I>(  x,  y,  d)  are 
fulfilled,  then  the  points  x,  y  are  matched: 

^  y, 

otherwise,  they  remain  unmatched.  Hence,  a  graph 
equivalent  to  relation  r  can  be  constructed.  For 
example,  in  [9],  the  recurrent  procedure  to  choose 
the  nearest  neighboi(s)  after  rejecting  all  previ¬ 
ously  chosen  objects  has  been  applied,  i.e.,  is 
such  that  for  each 

y'  e  X\  {Xj,...,  X,}:  d(x,-,  x,  +  i)  <  rf(x,,  y'). 

In  this  work,  the  rejected  set  {xj, . . . ,  x,}  of  ele¬ 
ments  has  been  restricted  to  one-element  set  {x,}, 
i.e.,  it  generally  holds  that  (x,  y)  fulfills  r  if  and 


Calculated  octupole  main-moment  components  (Debye-Ang**3)  for  cis  and  trans  conformations  of  platinum 


compounds. 


Molecule 

Cis  form 

Trans  form 

^XXX 

Oyyy 

Ozzz 

^xxx 

Oyyy 

Ozzz 

PtCIgCNHals 

1.9 

83.8 

0.0 

0.0 

0.0 

0.0 

PtFalNHgls 

0.0 

0.0 

-26.8 

0.0 

0.0 

0.0 

RCOHlgCNHgla 

0.0 

0.0 

-10.9 

0.0 

0.0 

0.0 

PtCIFlNHgla 

-87.7 

40.5 

1.8 

0.0 

0.0 

37.9 

PtlOHlFlNHgjg 

-21.5 

51.5 

0.0 

0.0 

0.0 

-26.7 

Pt(OH)CI(NH)2 

80.8 

13.4 

0.1 

0.0 

0.0 

67.3 

PtiSHlalNHgjg 

0.0 

0.0 

36.4 

0.0 

0.0 

1.4 

PtBrglNHgla 

-2.0 

60.3 

0.0 

0.0 

0.0 

0.0 

RBrCKNHglg 

-56.8 

-85.1 

1.9 

0.0 

0.0 

-55.4 
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Calculated  hexadecapole  main-moment  components  (Debye-Ang**3)  for  cis  and  trans  conformations  of  platinum 
compounds. 

Molecule 

Cis  form 

Trans  form 

u 

^XXXX 

u 

yyyy 

u 

^xxxx 

H 

^yyyy 

Wzzzz 

PtCl2(NH3)2 

-87.3 

-619.6 

-696.8 

-87.4 

-300.2 

-1127.4 

PtF2(NH3)2 

-56.0 

-336.8 

-347.4 

-56.7 

-292.4 

-398.8 

Pt{OH)2(NH3)2 

-65.0 

-360.3 

-342.8 

-65.1 

-317.7 

-254.5 

PtCIF(NH3)2 

-649.5 

-361.8 

-71.6 

-71.9 

-295.9 

-728.0 

Pt(OH)F(NH3)2 

-272.0 

-365.0 

-All 

-60.9 

-306.8 

-326.8 

Pt(OH)CI(NH)2 

-667.6 

-281.2 

-76.2 

-76.2 

-309.4 

-637.9 

Pt(SH)2{NH3)2 

-82.1 

-793.6 

-705.7 

-83.3 

-396.4 

-958.7 

PtBr2(NH3)2 

-101.3 

-660.2 

-899.0 

-101.4 

-335.1 

-1469.9 

PtBrCI(NH3)2 

-731.0 

-668.1 

-94.5 

-94.5 

-316.9 

-1316.4 

only  if  X,  y  e  X  and  for  each  y'  e  X  \ 
{x}:  d(x,  y)  <  d(x,  y').  This  all  enables  one  to  dis¬ 
tinguish  classes  with  growing  relevance,  generally 
regarding  the  fixed  condition  <i>.  If  ( x,  y)  g  r,  then 
(y,  x)  does  not  necessarily  fulfill  relation  r,  i.e.,  it 
takes  place  (  x,  y)-asymmetric  matching.  The  sym¬ 
metric  matching  shows  a  closer  proximity  between 
X  and  y. 

The  classes  of  resemblance  of  the  first  kind 
have,  in  general,  elements  (vertices)  keeping  only  a 
rough  kinship  between  each  other.  They  are  equiv¬ 
alence  classes  of  the  quotient  set  X/r~  where  r~ 
is  the  equivalence  closure  of  r.  The  equivalence 
classes  of  this  kind  can  include  the  asymmetric 
matching. 

The  classes  of  resemblance  of  the  second  kind, 

i.e.,  the  elements  of  the  quotient  set  X/(r  n  r~^)“ 
are  more  closely  related  regarding  their  similarity. 
Only  symmetric  matching  is  possible  between  the 
vertices  of  a  fixed  equivalence  class.  Even  more 
closely  related  are  vertices  in  the  classes  of  resem¬ 
blance  of  the  third  kind,  which  are  determined  as 
maximal  elements  with  the  following  property: 
a  X  a  Q  r.  Any  two  vertices  of  such  a  class  are 
related  by  the  symmetric  matching.  If  the  class 
is  of  the  third  kind,  than  there  exist  class  of  resem¬ 
blance  a2  of  the  second  kind  and  class  of  resem¬ 
blance  of  the  first  kind  such  that  ^  ^ 

The  classes  of  resemblance  of  all  kinds  have  been 
considered  to  reveal  the  possible  properties  re¬ 
sponsible  for  the  anticancer  activity. 

The  experimental  thesis  to  regard  trans  isomers 
considered  as  inactive  leads  to  the  utmost  differen¬ 
tiation  Fq  =  {B,  B'}  (utmost  mixing  =  {X})  such 


that 

X-BUB'  and  B  n  B'  =  0, 

where  X  is  the  set  of  compounds  and  B  and  B' 
are  sets  of  cis  and  trans  forms,  respectively.  If  F  is 
a  family  of  finite  sets  [e.g.,  X/r~  or  X/(r  n 
then  its  set  spectrum  is  the  union  of  sets  of  the 
form 

\  ([dom((fl  n  B)  X  (fl  n  B'))]^ 

U[imi(a  nB)  X(a  n  B'))f),  for  a^F, 

where  ^  a  X  a  for  short.  For  example,  /Fq  =  B^ 
U  B'^  and  fF^  =  (B  X  B')  U  (B'  X  B)  which  im¬ 
plies  including  only  (cis,  cis)  and  (trans,  trans) 
pairs  into  /Fq,  and  (cis,  trans)  and  (trans,  cis)  pairs 
of  compounds  into  fF^  Thus,  distance  [Eq.  (3)] 
(with  the  identity  taken  as  tolerance  t)  between  f 
and  Fq  (or  F|)  can  easily  be  determined.  This 
distance  turned  out  very  useful  in  selecting  proce¬ 
dures. 

To  summarize: 

1.  Any  chosen  sequence  of  parameters 

a^, . . . ,  a„  determined  for  each  compound 
of  X  does  allow  one  to  establish  the  rela¬ 
tion  r  and  using  it  to  distinguish  the  fam¬ 
ily  F  of  resemblance  classes  (of  the  fixed 
kind)  covering  X. 

2.  Pattern  Fq  consists  of  two  classes  cis  and 

trans  forms,  respectively. 

3.  Another  family  is  defined  as  F^  =  {X},  where 

no  discrimination  between  cis  and  trans 
conformations  occurs. 
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4.  d(  F,  Fq)  =  1  if  if  ^ 

includes  the  cis-trans  pair  of  compounds. 

5.  d(F,  Fj)  =  1  if  and  only  if  no  class  of  F 

includes  a  cis-trans  pair  of  compounds, 
i.e.,  the  full  separation  of  cis  and  trans 
forms  takes  place.  If  d(F,  Fq)  =  0,  then  F 

=  h- 

Hence,  if  parameters  aj, . . . ,  aj,  (for  i  indexing 
all  complexes  from  the  fixed  set)  induce  full  sepa¬ 
ration  or  close  to  1  distance  diF,  Fj),  then  they  can 
be  considered  as  accompanying  the  cis-trans  di¬ 
chotomy  and  therefore  having  potential  relevance 
to  the  activity  or  inactivity.  Otherwise,  such  a 
relevance  cannot  be  recognized. 

Additionally,  distance  diF,  Fq)  has  been  taken 
into  account  as  an  auxiliary.  It  estimates  the 
“crumbling"  of  X  if  diF,  Fj)  is  constant. 


Selecting  Anticancer  Properties  for 
Platinnm  Complexes 

The  series  of  quantities  computed  by  DFT  were 
leading  to  distinguishing  classes  of  resemblance  of 
the  second  kind.  They  were  determined  for  the 
family  X  of  platinum  (II)  complexes  with  the  SH", 
OH",  Br",  Cr,  F“  ligands. 

It  is  of  great  interest  to  know  the  behavior  of 
system  (1)  imder  quantities  of  various  kinds  and  to 
know  the  distances  under  the  pattern  partition  Fq. 
Table  V  includes  comparison  results  computed  for 
electron  affinity  (A),  electronegativity  (ELN),  nu¬ 
clear  repulsion  energy  (NUC),  total  energy  (HF), 
and  other  properties  as  stated  for  set  X  of  Ft 
complexes  where  X  =  {20H“,2C1'’,2F  ,(20H  )', 
(2C1  )',  (2F“)'}  [only  the  nonammine  ligands  are 
indicated  and  the  trans  forms  are  primed;  e.g., 
(2Cr)'  stands  for  traHS-PtCl2(NH3)2]. 


In  the  case  of  Table  V,  the  unmatched  status  of 
(1)  appears  only  when  the  dipole  moment  is  in¬ 
volved.  For  one  case  of  constants  of  rotation  the 
full  separation  [d(F,  Fj)  =  1]  has  been  reached. 

In  spite  of  alterations  of  system  of  compoxmds 
using  Mg(II),  Be(II),  and  Ca(II)  complexes  with  the 
same  ligands,  the  results  were  not  better  than 
those  presented  in  Table  V.  Hence,  the  results  of 
Table  V  indicate  the  dipole  moment  alone  as  in¬ 
ducing  cis-trans  separation.  After  turning  to  mul¬ 
tipoles,  the  marked  proximity  under  uttermost  dif¬ 
ferentiation  Fq  has  been  fixed  for  some  multipole 
moments  in  X  analyzed  under  their  activity- 
inactivity  relevance.  For  this  reason,  a  more  de¬ 
tailed  analysis  has  been  made  for  dipole, 
quadrupole,  octupole,  and  hexadecapole  moments. 
The  various  structural  modifications  of  the  Ft  com¬ 
plexes  can  throw  light  on  the  cancer-pertinent 
mechanisms  (cf.,  e.g.,  [10]).  Hence,  mixed  functions 
in  the  nonammine  ligands  have  been  introduced. 
Also,  the  number  of  compounds  in  the  systems 
analyzed  was  gradually  increased.  The  extended 
systems  of  Ft  complexes  (denoted  according  to  the 
convention  mentioned)  are  the  following: 

X*  =  {2SH-,  20H-,  2Br-,  2Cr,  2F-,  (2SH-)', 
(20H-)',  (2Br-)',  (2Cr)',  (2F-)'} 

X**  =  {2SH”,20H“,2Br-,2Cr,2F',0H“Cr, 
OH  F-,  Br-Cr,  cr  FA  (2SH-)', 
(20H')',  (2Br-)',  (2Cr)',  (2F-)', 
(OH-Cr)',  (OH-F-)',  (Br-Cl-)', 
(crF-)'}. 

Establishing  matching  (1)  and  computing  distances 
imder  utmost  differentiation  Fq  (mixing  Fj)  was 
the  task  potentially  revealing  invariant  kinship  in 
the  systems  gradually  extended.  The  searched 
family  of  parameters  have  to  show  minimal  dis- 


Differentiation  of  Pt  complexes  under  physicochemical  properties  (X) 


No. 

Properties 

cumulated 

System  (1) 
matched  (-I-) 
unmatched  (-) 

Distance  of  the 
pattern;  d(F,  Fq) 

Cumulation  with  dipole 
moment 

System 

(1) 

Distance  of  the 
pattern  d(F,  Fq) 

1 

A,  NUC,  HF 

+ 

1 

+ 

0.60000 

2 

ELN,  NUC,  HF 

+ 

1 

+ 

1 

3 

Const,  of  rotation 

+ 

1 

- 

0.44444 

4 

Polarizability 

+ 

1 

— 

0.90909 
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TABLE  VI 


Matching  and  distances  (x  10*)  for  multipole  moments  and  their  components.® 


Cumulation  with  dipole  moment 

X 

X* 

X** 

X 

x=. 

Parameters 

m 

Dist. 

m 

Dist. 

m 

Dist. 

m 

Dist. 

m 

Dist. 

m 

Dist. 

Dipole 

mom. 

- 

0 

100 

- 

83.333 

83.333 

- 

54.762 

97.479 

- 

0 

100 

- 

83.333 

83.333 

- 

54.762 

97.479 

dlF.Fg) 
cl(F,  F,) 

Quadrup. 

mom. 

+ 

100 

0 

100 

56.000 

+ 

97.283 

86.826 

+ 

83.333 

72.727 

+ 

100 

32.000 

+ 

97.727 

65.269 

d(F,Fi) 

d(F,Fi) 

Octupole 

mom. 

33.333 

100 

- 

36.000 

100 

- 

78.824 

95.960 

- 

22.222 

100 

- 

36.000 

100 

80.357 

96.823 

d(F,Fo) 

d(F,F,) 

Hexadecap 

mom. 

- 

86.364 

80.952 

- 

85.185 

93.103 

- 

48.765 

100 

55.556 

100 

- 

89.655 

85.714 

- 

81.482 

100 

d(F,Fo) 

d(F,Fi) 

HEXC 

a»»/i _ 

- 

44.444 

100 

- 

56.000 

100 

- 

61 .728 
100 

44.444 

100 

- 

68.000 

100 

- 

81 .482 
100 

d(F.  Fg) 
d{F,Fi) 

^ Where  m  shows  matching  status  of  system  (1)  as  in  Table  V, 


tance  d(F,  Fg)  and  maximal  value  of  d{F,  F^). 
Moreover,  these  distances  should  exhibit  a  stabil¬ 
ity  under  extensions.  Testing  the  systems  intro¬ 
duced,  the  following  results  have  been  obtained 
(Table  VI).  HEXC  is  a  choice  of  hexadecapole  mo¬ 
ment  components  consisting  of  all  of  them  after 
rejecting  five  components  (i.e.,  XXXX,  7.7.77, 
XXYY,  XXZZ,  YYZZ).  The  distances  for  this  assem¬ 
bly  are  the  best  results  attained  here  in  looking  for 
invariants.  Cumulating  HEXC  with  dipole  mo¬ 
ment  yields  the  similar  status  but  with  the  greater 
standard  deviation  (0.09  and  0.19,  respectively). 

In  an  alternative  computational  way,  other  than 
(2)  factors  of  similarity  and  dissimilarity  can  be 
used.  For  example, 

FD(fl,.,  aj)  =  ixittj  A  Qj), 

FS(fl,-,  Uj)  =  ix(ai  n  a j), 

where  a,,  Uj  are  some  neighborhoods  of  a'  and  a', 
respectively,  and  ju,  is  a  measure  (e.g.,  the  measure 
of  Lebesgue).  Using  these  factors  the  hexadecapole 
distances  have  been  determined  for  X**  (for  the 
resemblance  classes  of  the  first  kind); 

d(F,  fg)  =  0.97521,  d(F,  F,)  =  0.49438. 

For  HEXC  assembly  the  results  are  as  follows: 
d(F',  Fg)  =  0.78652,  d(F',  F,)  =  0.92. 


Thus,  distance  d(F',  Fj)  is  close  to  1,  showing  the 
small  fraction  of  mixed  classes  in  F'  (under  HEXC). 
The  existence  of  invariants  and  alternative,  result¬ 
ing  in  principle  in  the  same  algorithms,  does 
prompt  a  good  reason  for  further  testing  assem¬ 
blies  of  parameters  pertinent  to  the  activity  consid¬ 
ered. 


Discussion  and  Conclusions 

From  the  results  of  our  calculations,  we  con¬ 
clude  that  only  some  molecular  properties  enable 
one  to  discriminate  an  active  and  nonactive  sys¬ 
tem.  These  conditions  are  fulfilled  solely  by  higher 
than  dipole,  multipole  moments.  It  seems  that  the 
best  are  components  of  the  hexadecapole  mo¬ 
ments.  However,  it  seems  strange  that  typical 
molecular  parameters  like  ionization  energy,  elec¬ 
tron  affinity,  total  energy,  and  correlation  correc¬ 
tions  are  unable  to  distinguish  between  those  two 
groups  of  molecular  systems. 

It  should  be  stressed  that  parameters  are  dis¬ 
criminated  on  the  grounds  of  the  distances  be¬ 
tween  the  Induced  collections  of  compounds 
(resemblance  classes)  and  Fg,  Fj  families.  These 
estimations  depend  of  the  fixed  pattern.  In  this 
work,  the  trans  and  cis  conformations  have  been 
taken  as  such  a  pattern  in  view  of  their  well-known 
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experimentally  established  activity-inactivity  rela¬ 
tionship. 

Since  dipole  moment  induces  the  ideal  partition 
F  =  Fq  (cf..  Table  VI)  for  system  X,  a  pitfall  arises 
when  making  conclusions  on  the  pertinence  of  this 
moment  to  the  cis-trans  discrimination.  To  im¬ 
prove  and  test  the  conclusion,  Pt(SH)2(NH3)2  com¬ 
plex  has  been  included  (X*),  and  thereafter  the 
partition  F  induced  contained  mixed  classes  [i.e., 
rf(F,  F^)  <  1].  Thus,  the  negative  result  of  the  test 
has  been  obtained.  It  is  so  since  the  complex  just 
considered  has  relatively  much  smaller  value  of 
the  dipole  moment  than  cis  conformers  (of  X*), 
and  therefore  this  dipole  moment  shows  a  better 
kinship  to  the  trans  forms.  Generally,  the  parame¬ 
ters,  being  out  of  the  pertinence  desired,  disturb 
the  cis-trans  dichotomy  affecting  induced  parti¬ 
tions  when  the  database  is  enriched.  On  the  con¬ 
trary,  the  properties  looked  for  should  not  induce 
cis-trans  mixing  in  the  resemblance  classes  after 
substitutions  of  new  ligands. 

This  is  a  hard  postulate  since  even  the  existence 
of  such  properties  within  a  finite  family  of  the 
considered  parameters  is  difficult  to  be  a  priori 
assumed.  Therefore,  the  discriminated  family  of 
multipole  components  HEXC,  inducing  the  parti¬ 
tion  desired  in  the  gradually  enriched  systems, 
shows  the  method  of  the  resemblance  analysis, 
enabling  one  to  find  quite  new  criterion  for  the 
separation  of  cis  and  trans  forms. 

In  view  of  the  example  considered,  it  can  be 
noted  that  the  cancer  diseases  are  one  of  the  most 
serious  challenges  for  modern  science,  particularly, 
the  "reading"  of  cancer  cells  by  drugs  (cf.,  e.g.,  the 
concepts  of  lectins  [11]  or  long-range  carcinogenic 
effects  [12])  illustrate  the  complexity  of  this  prob¬ 
lem.  The  analysis  performed  here  allows  one  to 
frame  the  hypothesis  on  the  specific  significance  of 
some  molecular  parameters  in  the  anticancer  ac¬ 
tion.  It  is  easily  seen  that  the  electric  molecular 


parameters  and  their  symmetry  in  platinum  com¬ 
plexes  should  play  an  essential  role  in  the  mecha¬ 
nisms  of  their  activity.  However,  their  character, 
which  is  necessary  for  the  desired  effect,  remains 
unknown  or  is  difficult  to  determine.  On  the  other 
hand,  the  general  idea  to  pursue  some  multipoles 
does  not  seem  groundless  and  can  probably  be 
helpful  in  predicting  ways  to  stimulate  anticancer 
inhibition. 

The  extensions  or  modifications  of  the  method 
illustrated  can  be  of  great  interest  in  looking  for 
efficient  ways  in  drug  design  and  selecting  useful 
properties. 

Note  added  in  proof:  Annotation  performed  within 
C.E.C.  COST  D3  Action. 
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ABSTRACT _ 

Starting  from  a  formally  exact  density-functional  representation  of  the  frequency- 
dependent  linear  density  response  and  exploiting  the  fact  that  the  latter  has  poles  at  the 
true  excitation  energies,  we  develop  a  density-functional  method  for  the  calculation  of 
excitation  energies.  Simple  additive  corrections  to  the  Kohn-Sham  single-particle 
transition  energies  are  derived  whose  actual  computation  only  requires  the  ordinary 
static  Kohn-Sham  orbitals  and  the  corresponding  eigenvalues.  Numerical  results  are 
presented  for  spin-singlet  and  triplet  energies.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

In  recent  years,  density-fimctional  theory  (DFT) 
[1-5]  has  enjoyed  increasing  popularity  in  the 
field  of  quantum  chemistry.  The  computational 
simplicity  of  the  Kohn-Sham  scheme  and  the 
availability  of  reliable  exchange-correlation  (xc) 
functionals  allow  accurate  calculations  even  for 
very  large  systems  [6,7].  The  traditional  density- 
functional  formalism  of  Hohenberg,  Kohn,  and 
Sham  [1,2]  is  mainly  a  ground-state  theory.  In 
view  of  the  advantages  of  DFT,  a  generalization 
allowing  the  calculation  of  excitation  spectra  is 
highly  desirable,  especially  since  most  experimen- 

*To  whom  correspondence  should  be  addressed. 


tal  data  are  obtained  by  spectroscopical  tech¬ 
niques,  One  might  be  tempted  to  interpret  the 
Kohn-Sham  single-particle  energy  differences  as 
excitation  energies.  This  interpretation,  however, 
has  no  rigorous  basis  and  in  practice  the 
Kohn-Sham  orbital  energy  differences  deviate  by 
10-50%  from  the  true  excitation  energies.  Another 
deficiency  of  the  Kohn-Sham  orbital  energy  spec¬ 
trum  is  the  fact  that  it  shows  no  multiplet  split¬ 
tings. 

To  deal  with  excited  states,  several  extensions 
of  ground-state  DFT  have  been  proposed  which 
rely  either  on  the  Rayleigh-Ritz  principle  for  the 
lowest  eigenstate  of  each  symmetry  class  [8-10]  or 
on  a  variational  principle  for  ensembles  [11-16]. 
A  practical  difficulty  of  these  approaches  is  that 
the  respective  xc  energy  functionals  depend  on  the 
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symmetry  labels  of  the  state  considered  or  on 
the  particular  ensemble.  Until  today,  little  is  known 
about  the  explicit  form  of  these  excited-state  xc 
functionals  although  significant  progress  has  re¬ 
cently  been  made  in  the  ensemble  formalism 
[17-22]. 

In  this  article  we  are  going  to  elaborate  on  a 
different  approach  [23-25]  to  the  calculation  of 
excitation  energies  which  is  based  on  time-depen¬ 
dent  density-functional  theory  (TDDFT)  [26].  In 
the  linear  response  regime  TDDFT  has  been  ap¬ 
plied  rather  successfully  to  the  photo-response  of 
atoms  and  molecules  [27-31].  Dynamical  hyperpo¬ 
larizabilities  have  also  been  calculated  [32]  and 
first  applications  beyond  the  perturbative  regime 
have  emerged  [33-35].  For  a  recent  review  of 
TDDFT  and  its  applications,  see  [36]. 

To  calculate  excitation  energies  from  TDDFT  we 
use  the  fact  that  the  frequency-dependent  linear 
density  response  of  a  finite  system  exhibits  dis¬ 
crete  poles  at  the  true  (correlated)  excitation  ener¬ 
gies  of  the  unperturbed  system.  It  can  be  shown 
[23]  that  the  exact  linear  density  response  of  an 
interacting  system  can  be  written  as  the  linear 
density  response  of  a  noninteracting  (Kohn-Sham) 
system  to  an  effective  perturbation.  The  basic  idea 
is  to  use  this  formally  exact  representation  of  the 
frequency-dependent  linear  density  response  to 
calculate  the  shifts  of  the  Kohn-Sham  orbital  en¬ 
ergy  differences  (which  are  the  poles  of  the 
Kohn-Sham  response  function)  toward  the  true 
excitation  energies  in  a  systematic  way.  This  pro¬ 
gram  has  recently  been  carried  out  [23]  rather 
successfully  to  the  calculation  of  singlet  excitation 
energies.  The  purpose  of  the  present  article  is  to 
extend  the  scheme  to  incorporate  spin  degrees  of 
freedom.  This  will  allow  us  to  calculate  the  excita¬ 
tion  energies  of  spin  multiplets. 


Method 

The  frequency-dependent  linear  density  re¬ 
sponse  w)  of  interacting  electrons  with  spin 

cr  to  a  frequency-dependent  perturbation  co) 

is  usually  written  in  terms  of  the  full  response 
function  Xav 

n^„ir,  w)  =  Y,  j  d’^yXaM^y^  «)•  (D 

V 

Alternatively,  the  exact  frequency-dependent 
spin-density  response  can  be  calculated  (self-con¬ 


sistently)  from 

ni^(r,  w)  =  E/ d^yXsav^^>y'  ")  (2) 

V 


as  the  response  of  noninteracting  (Kohn-Sham) 
particles  subject  to  the  effective  potential 

=  (O) 

+  Lj  _  y,|  +/xc..'(y^y';  «)J 

X  (o).  (3) 


Here  the  spin-dependent  exchange-correlation  ker¬ 
nel  is  given  by  the  Fourier  transform  of 


Sfy.(r',  t') 


(4) 


with  respect  to  (f  —  t').  The  proof  that  Eqs.  (2)-(4) 
represent  the  exact  linear  spin-density  response 
of  the  interacting  system  is  based  on  the  spin- 
dependent  generalization  [37]  of  TDDFT.  The  argu¬ 
ment  follows  step  by  step  the  proof  for  the 
spin-independent  case  given  in  Ref.  [23]  and  will 
therefore  not  be  repeated  here.  The  response  func¬ 
tion  Xs  of  fho  Kohn-Sham  system  can  be  ex¬ 
pressed  in  terms  of  the  unperturbed  static 
Kohn-Sham  spin  orbitals  as 

ca) 

i.k 


where  are  the  Fermi  occupation  factors  (1 

or  0).  The  summations  in  (5)  run  over  all  unper¬ 
turbed  Kohn-Sham  orbitals,  including  the  contin¬ 
uum  states.  Note  that  the  Kohn-Sham  response 
function  (5)  is  diagonal  in  the  spin  variable  and 
exhibits  poles  at  frequencies  cor¬ 

responding  to  Kohn-Sham  single-particle  excita¬ 
tions  within  the  same  spin  space.  In  order  to  calcu¬ 
late  the  shifts  toward  the  true  excitation  energies 
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Q,  of  the  interacting  system,  we  rewrite  Eq.  (2)  as  and  set 


£  f  d^y'\^8„,,d{.x  -  y')  -  E/ d^yxs..M>y> 
1 


X 


TT  +/xc..'(y^y';") 


«i^,(y',  (a) 


.ly-y' 

’^1  y;  ")^w(y/  ")•  (6) 


In  general,  the  true  excitation  energies  O  are  not 
identical  with  the  Kohn-Sham  excitation  energies 
0}-^^.  Therefore,  the  right-hand  side  of  Eq.  (6)  re¬ 
mains  finite  for  o)  ^  fl.  Since,  on  the  other  hand, 
the  exact  spin-density  response  has  poles  at 
the  true  excitation  energies  11,  the  integral  opera¬ 
tor  acting  on  on  the  left-hand  side  of  Eq.  (6) 
cannot  be  invertible  for  w  ^  II.  This  is  easily  seen 
by  reductio  ad  absurdum:  Assuming  the  inverse 
operator  existed,  its  action  on  both  sides  of  Eq.  (6) 
would  result  in  a  finite  right-hand  side  for  o)  — >  11 
in  contradiction  to  the  fact  that  remaining  on 
the  left-hand  side,  has  a  pole  at  =  H. 

Consequently,  the  true  excitation  energies  H  are 
characterized  as  those  frequencies  where  the  inte¬ 
gral  operator  acting  on  the  spin-density  vector  in 
Eq.  (6)  is  not  invertible  or,  equivalently,  where  the 
eigenvalues  of  this  operator  vanish.  If  the  delta 
function  in  Eq.  (6)  is  integrated  out,  the  true  excita¬ 
tion  energies  H  are  those  frequencies  where  the 
eigenvalues  A(co)  of 


X 


ly-y 

A(&>)-y^(r,  co) 


77  +/xc..'(y/y'; 


y^’iy',  w) 


(7) 


satisfy 

A(I1)  =  1.  (8) 

This  condition  rigorously  determines  the  true  exci¬ 
tation  spectrum  of  the  interacting  system  consid¬ 
ered. 

For  notational  convenience,  we  introduce  dou¬ 
ble  indices  q  ^  k)  so  that  de¬ 

notes  the  excitation  energy  of  the  single-particle 
transition  (;cr  ka).  Moreover,  we  define 


■■=Lf  d^y’J  d^y^^.^y^* 

v' 


X 


ly 


TT  +/xc<.x(y/y'; 


X.(y',  o)). 

(11) 


With  these  definitions,  Eq.  (7)  takes  the  form 

(^)  =  A(o,)r,,(r,  o,).  (12) 

^  (0~  +11] 

Solving  this  equation  for  y^(r,  w)  and  reinserting 
the  result  on  the  right-hand  side  of  Eq.  (11)  leads 
to 


^  o- 


-h  il] 


(13) 


Here  we  have  introduced  the  matrix  elements 

^qfrq'cr'^  ^) 

=  01, V'/ d^r’<!)*^(r) 


X 


r  —  r 


^  +/xc<.<r'(r^r';a)) 


‘J>,v(r')- 


(14) 


Note  that  the  summation  in  Eq.  (13)  extends 
over  all  single-particle  transitions  q'a'  between 
occupied  and  unoccupied  Kohn-Sham  (KS)  or¬ 
bitals,  including  the  continuum  states.  Up  to  this 
point,  no  approximations  have  been  made.  In  or¬ 
der  to  actually  calculate  A(w),  the  eigenvalue  prob¬ 
lem  (13)  has  to  be  truncated  in  one  way  or  another. 
One  possibility  is  to  expand  all  quantities  in  Eq. 
(13)  about  one  particular  KS  orbital  energy  differ¬ 
ence  o)^^: 


^,,7(«)  =  ^,cr(«pT)  + 


d(o 


(co  -  Wp^)  ••• 
(15) 


^,<x(r)  ==  (9) 

(10) 


A(w)  = 


0)  - 


(16) 
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The  matrix  elements  with  can  be  ex¬ 

panded  into  a  Taylor  series: 


",>,r  -  %V'  +  iV 


d 


+  — 
do) 


^  +  iv 


Wp^)  +  ••• 

(17) 


whereas,  for  the  matrix  elements  have 

to  be  represented  by  a  Laurent  series: 


M 


qaq'iT 


(w) 


w  -  +  iV 


^^qcriftr  '(  ^pr'^ 

ft)  —  ftjp^  +  irj 


+ 


(18) 


Inserting  Eqs.  (15)-(18)  into  Eq.  (13)  the  coeffi¬ 
cients  A  and  B  are  readily  identified.  If  the  pole 
co^,.  is  nondegenerate,  one  finds 

^(ft)p^)  =  Mp^p^(ft)p^)  (19) 

and 


1 

^  ^  ^pTq'tr'^^pr^  ^q'a'pT^^pT^ 

q'(T'*pT  ^pr  ^q'tr’ 

(20) 


do) 


The  corresponding  eigenvector  (in  lowest  order)  is 
given  by 

1 

ipr  =  ^M,j,rpr((^prUpr  (21) 

with  ( pr)  fixed.  The  number  is  free  and  can  be 
chosen  to  properly  normalize  the  vector 
If  the  pole  0)^^  is  K-fold  degenerate, 

=  ^P.r,  =  (22) 

the  lowest-order  coefficient  A  in  Eq.  (16)  is  deter¬ 
mined  by  the  following  matrix  equation: 

i:  Mp,,,,,.,/ft)„)^;;;)  =  7i„(ft)o)^<;;^ 
k  =  l 

i  =  1,...,K.  (23) 


In  general,  one  obtains  K  different  eigenvalues 
A|, . . . ,  A^.  Then  the  remaining  components  of  the 
corresponding  eigenvectors  can  be  calculated 
from 


tin)  _ 
'=>qa 


— 1 —  E 
71„(«o)  ti 


M. 


‘i<^Pk'^k 


(24) 


once  the  eigenvalue  problem  (23)  has  been  solved. 
Assuming  that  the  true  excitation  energy  fl  is  not 
too  far  away  from  Wq,  it  will  be  sufficient  to 
consider  only  the  lowest-order  terms  of  the  above 
Laurent  expansions.  In  particular,  we  set 


A„(<^)  ~ 


7l„(t<)o) 

(O  —  (Oq 


(25) 


The  condition  (8)  and  its  complex  conjugate,  A*(n) 
=  1,  then  lead,  in  this  order,  to  a  compact  expres¬ 
sion  for  the  excitation  energies: 


—  (Oq  +c^A„(a)o).  (26) 


This  is  the  central  result  of  our  analysis.  Equation 
(26)  shows  that  a  single  KS  pole  can  lead  to  several 
many-body  excitation  energies.  The  corresponding 
oscillator  strengths  can  be  obtained  [25]  from  the 
eigenvectors  and  the  KS  oscillator  strengths. 


Application  to  Closed-Shell  Systems 

In  this  section,  we  will  apply  the  above  formal¬ 
ism  to  systems  with  spin-unpolarized  ground 
states.  For  these  systems,  the  Kohn-Sham  orbital 
eigenvalues  are  degenerate  with  respect  to  the 
spin  variable,  which  implies  a  lack  of  spin-multi- 
plet  structure  in  the  Kohn-Sham  spectrum.  In  the 
following,  we  demonstrate  how  this  is  restored  by 
the  lowest-order  corrections  (26).  Assuming  that 
there  are  no  further  degeneracies  besides  the  spin 
degeneracy,  Eq.  (23)  reduces  to  the  following  (2  X 
2)  eigenvalue  problem: 

E  ^p^;„x'("o)^po-'("o)  =  ^^,,^("o)-  (27) 

cr'=  U  i 


For  spin-saturated  systems,  one  finds  t  r  t  ^ 

I  ^  and  ^  ^  ^  ^  ^ ,  so  that  the  eigen- 
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values  of  Eq.  (27)  are  given  by 

^1,2  ^  T  p  T  i  ^  pi' 

By  Eq.  (26),  the  resulting  excitation  energies  are 

H]  =  (Oq  +  c^{  Mp  T  p  T  ^p  T  p  i  ^  (29) 

H2  ~  ^0  ^p  t  p  T  ~  ^p  T  p  i  ^  *  (30) 

Inserting  the  explicit  form  of  the  matrix  elements 
(14),  one  finds 


=  Wq  +  d^rf 
■  1 

X  1 - n +/xc(i',r';  Wo) 


r  —  r 


$p(r')  (31) 


fij  =  Wq  +  2^  j  d^rj  d^r  '<t>*  (r) 
X  /x^G^,(r,r';  Wo)<I>  (r') 


(32) 


where 


/,,(r,  r';  w) 


j  E  /xc..'(r,r';w)  (33) 

a,  a'  =  i  ,  i 


and 

G^^(r,r';  (o) 

=  A  L  (34) 

4/^0  cr,  o-'=  T,  i 

/Xq  denotes  the  Bohr  magneton  and  =  1  for 
cr  =  (t'  and  =  -1  for  o-  ^  cr^  For  simplicity, 
we  have  dropped  the  spin  index  of  in  Eqs. 
(31)  and  (32).  [This  is  possible  only  if  the  unper¬ 
turbed  KS  ground-state  determinant  is  spin- 
saturated  since,  in  this  case,  ^  (r)  =  cfyj  ^  (r)  for  all 

;■] 

The  quantity  gives  rise  to  exchange  and 
correlation  effects  in  the  Kohn-Sham  equation  for 
the  linear  response  of  the  frequency-dependent 
magnetization  density  m(r,  w)  =  w)  — 

n^(r,  (o)]  [37]. 

The  fact  that  the  magnetization  density  re¬ 
sponse  naturally  involves  spin-flip  processes  sug¬ 
gests  that  02  represents  the  spin-triplet  excitation 
energies  of  many-electron  systems  with  spin- 
saturated  ground  states.  The  corresponding  spin- 
singlet  excitation  energies,  on  the  other  hand,  are 
given  by  This  assignment  will  be  given  further 
support  by  the  numerical  results  presented  in  the 
next  section. 


Results  and  Discussion 

Apart  from  the  trimcation  of  the  expansions 
(15)-(18),  two  further  approximations  are  neces¬ 
sary  in  the  described  TDDFT  calculation  of  excita¬ 
tion  energies: 

1.  The  frequency-dependent  xc  kernels  and 
G^c  have  to  be  approximated. 

2.  The  static  Kohn-Sham  orbitals  entering  Eqs. 
(31)  and  (32)  through  Eq.  (9)  have  to  be 
calculated  with  an  approximate  (static)  po¬ 
tential 

As  an  application  of  the  method,  we  consider 
the  lowest  excitation  energies  of  the  alkaline  earths 
and  the  elements  of  the  zinc  series.  Here,  in  addi¬ 
tion  to  the  degeneracy  with  respect  to  the  spin 
index,  the  s  ^  p  transitions  under  consideration 
are  threefold  degenerate  in  the  magnetic  quantum 
number  m  of  the  'Tinal"  state.  Flence,  we  have  a 
six-fold  degenerate  pole  and  Eq.  (23)  is  a  (6  X  6) 
eigenvalue  problem.  In  our  case,  however,  the 
matrix  in  Eq.  (23)  consists  of  three  identi¬ 

cal  (2  X  2j  blocks,  leading  only  to  two  distinct 
corrections,  independent  of  m,  as  it  should  be.  The 
resulting  triplet  states  are  still  degenerate  with 
respect  to  the  total  angular  momentum  quantum 
number  because  the  spin-orbit  coupling  is  not  ac¬ 
counted  for  in  the  nonrelativistic  Kohn-Sham 
equations. 

In  Tables  I-IV  we  show  excitation  energies  cal¬ 
culated  from  Eqs.  (31)  and  (32).  The  calculation  of 
Table  I  employs  the  ordinary  local  density  approx¬ 
imation  (LDA)  for  (using  the  parametrization 
of  Vosko,  Wilk,  and  Nusair  [38])  and  the  so-called 
adiabatic  LDA  (ALDA)  for  the  xc  kernels: 

yALDA[^J(r,r';  cd) 

=  S(r  -  r')^[  pexc°‘"(P/^)]|(p»«oW,^=o) 

(35) 


and  [39] 


](r,r';w) 


8(r-  r')- 


p,ow(r) 

x[ex^r(p.O]L=„„(rU=o)- 


(36) 
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TABLE  I _ 

Lowest  S  P  excitation  energies  of 
various  atoms.® 


TABLE  II _ 

Lowest  S  ^  P  excitation  energies  of 
various  atoms.® 


Atom 

state 

f^exp 

h(Ascf) 

< 

□ 

_IO 

3 

Be 

3p 

0.388 

0.200 

0.399 

0.331 

0.257 

3^ 

3p 

0.200 

0.200 

0.192 

0.181 

0.257 

Mg 

3p 

o  0 

0.319 

0.199 

0.351 

0.299 

0.249 

3p 

0.199 

0.200 

0.209 

0.206 

0.249 

Ca 

So 

0.216 

0.138 

0.263 

0.211 

0.176 

3^ 

3p 

“2 

0.139 

0.140 

0.145 

0.144 

0.176 

Zn 

3p 

So 

0.426 

0.294 

0.477 

0.403 

0.352 

3^ 

Op 

0.296 

0.300 

0.314 

0.316 

0.352 

Sr 

3P 

So 

0.198 

0.130 

0.241 

0.193 

0.163 

3^^ 

3p 

^2 

0.132 

0.136 

0.136 

0.135 

0.163 

Cd 

3P 

So 

0.398 

0.274 

0.427 

0.346 

0.303 

3p 

^2 

0.279 

0.290 

0.269 

0.272 

0.303 

Atom 

state 

’^exp 

,,OEP 

0)q 

Be 

■<p 

3p 

So 

0.388 

0.200 

0.392 

0.331 

0.259 

Op 

•2 

0.200 

0.200 

0.138 

0.181 

0.259 

Mg 

So 

0.319 

0.199 

0.327 

0.299 

0.234 

Op 

^2 

0.199 

0.200 

0.151 

0.206 

0.234 

Ca 

3p 

So 

0.216 

0.138 

0.234 

0.211 

0.157 

Op 

^2 

0.139 

0.140 

0.090 

0.144 

0,157 

Zn 

3p 

So 

0.426 

0.294 

0.422 

0.403 

0.314 

Op 

^2 

0.296 

0.300 

0.250 

0.316 

0.314 

Sr 

Op 

So 

0.198 

0.130 

0.210 

0.193 

0.141 

"P2 

0.132 

0.136 

0.081 

0.135 

0.141 

Cd 

3p 

So 

0.398 

0.274 

0.376 

0.346 

0.269 

Op 

“2 

0.279 

0.290 

0.211 

0.272 

0.269 

^The  experimental  values  (first  column)  [44]  are  compared 
with  results  calculated  from  Eq.  (31)  for  the  singlet  and  from 
Eq.  (32)  for  the  triplet  (second  column)  and  with  ordinary 
LDA-Ascf  values  (third  column).  The  LDA  was  employed  for 
and  the  ALDA  for  the  xc  kernels.  The  corresponding 
Kohn  -Sham  orbital  energy  differences  coq  are  shown  in  the 
last  column  (all  values  in  rydbergs). 


For  the  xc  energy  per  particle,  of  the 

homogeneous  electron  gas  and  its  second  deriva¬ 
tive  with  respect  to  the  relative  spin  polarization 
^  (n^  -  n  ^)/n,  also  called  the  xc  contribution 

to  the  ''spin-stiffness  coefficient/'  we  use  the 
parametrization  of  Ref.  [38]. 

The  calculation  of  Table  II  uses  the  x~only  opti¬ 
mized  effective  potential  (OEP)  for  in  the 
approximation  of  Krieger,  Li,  and  lafrate  [40]  and 
for  the  kernel 


/™[no](Lr';^) 


|r  -  r'|?7o(r)«|,(r') 

(37) 


which  is  based  on  the  time-dependent  OEP  method 
in  the  x-only  approximation  [23,41]. 


^The  experimental  values  (first  column)  [44]  are  compared 
with  results  calculated  from  Eq.  (31)  for  the  singlet  and  from 
Eq.  (32)  for  the  triplet  (second  column)  and  with  ordinary 
LDA-Aqcf  values  (third  column).  The  x-only  optimized  effec¬ 
tive  potential  was  used  for  and  the  approximate  TDOEP 
kernels  (37)  and  (38).  The  corresponding  Kohn -Sham  or¬ 
bital  energy  differences  coq  are  shown  in  the  last  column  (all 
values  in  rydbergs). 


The  mean  absolute  deviation  from  the  experi¬ 
mental  singlet  spectrum  for  the  excitation  energies 
calculated  from  Eq.  (31)  is  36  mryd  for  the  pure 
LDA  calculation  of  Table  I  and  11  mryd  for  the 
x-only  OEP  calculation  of  Table  11. 

In  Table  III  we  show  the  results  of  a  "hybrid" 
calculation  (OEP  -h  ALDA)  which  employs  the  x- 
only  OEP  for  and  the  ALDA  (including  local 
exchange  and  correlation  terms)  for  the  xc  kernels. 
The  results  of  this  calculation  are  of  similar  quality 
as  the  x-only  OEP  calculation  of  Table  II.  The  mean 
absolute  deviation  from  experiment  is  15  mryd. 
Thus,  for  the  singlet  spectrum,  the  OEP  values  are 
clearly  superior  to  the  LDA  results  and  are  also 
better  than  the  usual  LDA-As^f  values,  which  de¬ 
viate  on  the  average  by  27  mryd  from  the  experi- 
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TABLE  III _ 

Lowest  S  ^  P  excitation  energies  of 
various  atoms.® 


Atom 

State 

f^exp 

qOEP  +  ALDA 

^  OEP 

(Oq 

Be 

0.388 

0.200 

0.398 

0.331 

0.259 

"P2 

0.200 

0.200 

0.196 

0.181 

0.259 

Mg 

Ip 

^Po 

0.319 

0.199 

0.329 

0.299 

0.234 

3p 

0.199 

0.200 

0.196 

0.206 

0.234 

Ca 

^Po 

0.216 

0.138 

0.236 

0.211 

0.157 

3p 

'2 

0.139 

0.140 

0.129 

0.144 

0.157 

Zn 

^P\ 

0.426 

0.294 

0.417 

0.403 

0.314 

3p 

“2 

0.296 

0.300 

0.280 

0.316 

0.314 

Sr 

■'p 

^P\ 

0.198 

0.130 

0.211 

0.193 

0.141 

3p 

^2 

0.132 

0.136 

0.117 

0.135 

0.141 

Cd 

^P 

^Po 

0.398 

0.274 

0.370 

0.346 

0.269 

^P2 

0.279 

0.290 

0.239 

0.272 

0.269 

^The  experimental  values  (first  column)  [44]  are  compared 
with  results  calculated  from  Eq.  (31)  for  the  singlet  and  from 
Eq.  (32)  for  the  triplet  (second  column)  and  with  ordinary 
LDA-Ascf  values  (third  column).  The  x-only  optimized  effec¬ 
tive  potential  was  used  for  and  the  ALDA  (including 
exchange  and  correlation  terms)  for  the  xc  kernels.  The 
corresponding  Kohn-Sham  orbital  energy  differences  wq 
are  shown  in  the  last  column  (all  values  in  rydbergs). 

mental  data.  This  can  be  traced  back  to  the  sensi¬ 
tivity  of  the  unoccupied  orbitals  and  their  energy 
eigenvalues  to  the  asymptotic  behavior  of  the 
Kohn-Sham  potential.  One  major  reason  for  the 
superiority  of  the  optimized  effective  potential  is 
the  fact  that  it  is  self-interaction  free  and  therefore 
has  the  correct  -  1/r  tail  (while  the  LDA  potential 
falls  off  exponentially).  We  note  in  passing  that 
since  the  optimized  effective  potential  decreases 
correctly  for  all  orbitals,  it  is  also  superior  to  the 
Hartree-Fock  (HF)  potential,  which  is  self-interac¬ 
tion  free  only  for  the  occupied  orbitals  but  not  for 
the  unoccupied  ones.  Consequently,  FiF  orbital  en¬ 
ergy  differences  are  typically  too  large. 

The  triplet  spectrum  obtained  from  Eq.  (32) 
within  the  LDA  deviates  on  the  average  from 


TABLE  IV  _ _ 

Singlet-triplet  separations  for  the  lowest  S  ^  P 
transitions  of  various  atoms  calculated  with 
different  approximations  of  the  xc-functionals 
(see  text)  in  comparison  with  experimental  values 
[44]  (EXP).® 


Atom 

J 

EXP 

LDA 

OEP 

OEP  +  ALDA 

^SCF 

Be 

0 

0.188 

1 

0.188 

0.207 

0.253 

0.202 

0.150 

2 

0.188 

Mg 

0 

0.120 

1 

0.120 

0.142 

0.175 

0.133 

0.094 

2 

0.120 

Ca 

0 

0.077 

1 

0.077 

0.118 

0.144 

0.107 

0.067 

2 

0.076 

Zn 

0 

0.132 

1 

0.130 

0.164 

0.172 

0.136 

0.087 

2 

0.126 

Sr 

0 

0.067 

1 

0.066 

0.105 

0.129 

0.094 

0.058 

2 

0.062 

Cd 

0 

0.124 

1 

0.119 

0.158 

0.165 

0.131 

0.074 

2 

0.108 

8 

0.033 

0.057 

0.018 

0.028 

is  the  total  angular  momentum  quantum  number  of  the 
triplet  state;  d  denotes  the  mean  absolute  deviation  of  the 
calculated  values  from  the  experimental  ones  (all  values  in 
rydbergs). 

experiment  by  9  mryd,  which  is  slightly  better 
than  the  mean  absolute  deviation  of  10  mryd  of 
the  LDA-Ascf  spectrum.  However,  in  spite  of  the 
fact  that  the  OEP  provides  self-interaction  free 
orbitals,  it  reproduces  the  triplet  spectrum  less 
accurately:  the  average  deviation  from  experiment 
is  55  mryd,  as  can  be  seen  from  Table  IT  This  is  a 
consequence  of  the  fact  that  we  have  employed  the 
time-dependent  x-only  approximation  for  the  ker- 
nel  /xco-o-'*  This  approximation  neglects  the  correla¬ 
tion  between  electrons  of  antiparallel  spin  and 
leads  to  an  xc  kernel  which  is  diagonal  in  spin 
space.  Accordingly,  from  Eqs.  (33)  and  (34),  we 
have  within  the  x-only  TDOEP 

GTDOEP[^](^^r';  cu)  =  r';  cu).  (38) 

We  expect  that  the  OEP  triplet  energies  can  be 
improved  considerably  by  adding  appropriate  cor¬ 
relation  terms  [42,  43]  to  the  TDOEP  xc  kernels. 
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This  expectation  is  backed  by  the  observation  that 
when  the  x-only  OEP  approximation  of  is 
combined  with  a  local  density  prescription  (ALDA) 
for  exchange  and  correlation  in  the  xc  kernels 
and  the  triplet  spectrum  is  reproduced  fairly 
well  by  Eq.  (32l  This  can  be  seen  from  Table  III, 
where  the  calculated  triplet  spectrum  deviates  on 
the  average  by  15  mryd  from  the  experimentally 
measured  one. 

To  complete  the  discussion  of  the  various  ap¬ 
proximate  xc  functionals  employed,  we  list  the 
singlet-triplet  separations  for  the  lowest  S  ^  P 
transitions  of  the  alkaline  earth  elements  and  the 
zinc  series  in  Table  IV.  Obviously,  by  calculating 
^1  “  ^2  from  Eqs.  (31)  and  (32),  the  Kohn-Sham 
excitation  energies  o)q  and  hence  all  the  errors 
contained  in  the  approximate  Kohn-Sham  eigen¬ 
values  cancel  out.  Again,  the  mean  absolute  devia¬ 
tion  8  from  experiment  clearly  shows  that  singlet- 
triplet  splittings  are  well  reproduced  only  if  corre¬ 
lation  is  included  in  the  xc  kernels.  The  best  results 
are  obtained  when  the  ground-state  potential  is 
self-interaction  free  (OEP  H-  ALDA). 


Summary  and  Conclusion 

We  have  developed  a  formally  exact  scheme  of 
calculating  excitation  energies  from  TDDFT.  Within 
that  scheme,  we  arrived  at  compact  approximate 
expressions  for  the  excitation  energies  which  per¬ 
formed  quite  well  in  practical  calculations  of  sin¬ 
glet  and  triplet  excitations.  The  calculation  in¬ 
volves  only  known  ground-state  quantities,  i.e., 
the  ordinary  static  Kohn-Sham  orbitals  and  the 
corresponding  Kohn-Sham  eigenvalues.  Thus  the 
scheme  described  here  requires  only  one  self- 
consistent  Kohn-Sham  calculation,  whereas  the 
so-called  Ag^^p  procedure  involves  linear  combina¬ 
tions  of  two  or  more  self-consistent  total  energies 
[9]. 

In  spite  of  the  fact  that  we  focused  our  attention 
to  the  lowest  transition  energies  of  closed-shell 
systems,  and  spin-multiplets,  the  method  is  also 
capable  of  dealing  with  spatial  multiplets,  higher 
excitations,  and  open-shell  systems.  Work  along 
these  lines  is  in  progress. 
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ABSTRACT  _ _ _ _ 

In  density  functional  theory  (DFT)  the  exchange-correlation  energy  has  to  be 
approximated.  One  of  the  most  widely  used  approximations  to  the  correlation  component, 
the  Lee- Yang-Parr  (LYP)  functional  does  not  obey  nonuniform  scaling  requirements.  We 
propose  a  modifying  function  that  makes  this  functional  satisfy  nonuniform  scaling 
constraints.  As  a  result,  two  modified  LYP  functionals  are  suggested.  One  of  them, 
optimized  in  a  preliminary  way,  exhibits  some  properties  better  than  the  original  LYP 
functional.  This  implies  that  nonuniform  scaling  requirements  convey  very  important 
information  and  all  approximate  functionals  in  DFT  should  be  made  to  satisfy  these 
requirements.  Our  modifying  factor  could  be  used  to  improve  the  nonuniform  scaling 
properties  not  only  of  but  also  of  most  functionals  which  do  not  obey  these 

conditions.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

Density  functional  theory  (DFT)  has  provided 
a  machinery  for  effective  quantum  calcu¬ 
lations  replacing  the  traditional  wave  function 
approach  by  the  electron  density  description  of 
the  system  under  investigation.  This  methodology 


is  attractive  for  large  systems  because  the  den¬ 
sity  depends  only  upon  three  spacial  coordinates, 
regardless  of  the  size  of  the  system  [1-4].  For 
computational  purposes  the  exact  exchange- 
correlation  (xc)  functional,  n],  must  be  approx¬ 
imated,  and  conveniently  E^^[n]  is  separated  in 
the  following  way,  =  E^[n]  +  Ef_n]  (Ref. 

[5]),  In  order  to  arrive  at  the  very  best  approxima¬ 
tions  to  E^^[n],  one  needs  knowledge  of  conditions 
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reflecting  the  properties  of  the  exact  exchange-cor¬ 
relation  energy. 

We  will  concentrate  our  attention  on  the  exact 
correlation  component,  Ejn],  only  and  more 
specifically  on  the  nonuniform  scaling  properties 
of  the  well-known  Lee-Yang-Parr  (LYP)  approxi¬ 
mation  to  E^[n].  We  propose  a  modifying  factor 
that  makes  this  approximation  obey  nonuniform 
scaling  constraints.  Our  factor  is  not  sensitive  to 
uniform  scaling  but  furnishes  the  correct  nonuni¬ 
form  scaling  behavior  and  could  be  used  to  im¬ 
prove  most  functionals  that  do  not  obey  the 
nonuniform  scaling  requirements.  Uniform  and 
nonuniform  scaling,  when  used  with  other  con¬ 
straints,  turns  out  to  be  very  powerful  for  con¬ 
structing  new  approximations  or  improving  upon 
the  properties  of  already  existing  ones. 

Define  three  types  of  scaled  densities: 


n^(x,  y,  z)  =  Xy,  \z),  (1) 

n^ix,  y,  z)  =  XniXx,  y,  z),  (2) 

n^^ix,  y,  z)  =  X^niXx,  \y,  z).  (3) 


form: 


f  i-a)n 

^  J  1+ 


X 


1  -f  b 


17|Vnp  ^  21V^n 
72  +  72^5/3 


(9) 


By  integrating  by  parts,  one  eliminates  the  term 
containing  the  Laplacian  of  density  [9],  and  ex¬ 
pression  (9)  simplifies  to 


r  i-a)n 

^  1  -f 


xll  +  b 


17|Vn|^ 

72n®/3 


Q(«) 


Equation  (1)  represents  a  uniformly  scaled  density, 
and  (2)  and  (3)  nonuniformly  scaled  densities.  With 
these  definitions,  the  following  equalities  and  in¬ 
equalities  have  been  established  for  the  exact  cor¬ 
relation  energy  [6, 7]. 


with 


Q(n) 


dn 
1  -f 


(11) 


lim  E,[«J  >  -oo,  (4) 

A->3C 

lim  <  0,  (5) 

A->  ^ 

Ec[^a]  ~  A  ^  0,  (6) 

lim  £,[«*]  =0,  (7) 

A^  oc 

lim  A-i£Jn,7]  =  0.  (8) 

A^  0 


Modirications  for  LYP  Functional 

Next,  we  will  focus  our  attention  on  one  of  the 
most  commonly  used  approximations  to  the  exact 
correlation  energy,  the  functional  of  LYP  [8].  This 
functional  stems  from  the  Colle-Salvetti  formula 
for  the  correlation  energy  [9].  The  LYP  functional 
for  spin-nonpolarized  system  has  the  following 


The  values  of  the  constants  for  both  versions  are 
a  =  0.049,  h  =  0.132,  d  =  0.349,  c  =  0.2533,  and 
~  (^X377^)^/^.  The  spin-dependent  forms  of 
(9)  and  (10)  are  given  in  Refs.  [8]  and  [10],  respec- 
tively. 

The  LYP  approximation  satisfies  conditions  (4) 
through  (6)  but  does  not  obey  equalities  (7)  and 
(8).  Instead, 


lim  £7'’[  ^  oc.,  (12) 

A-^  X 

lim  A^1elyp[„7]  _oc..  (13) 

A  — >  0 

In  order  to  improve  the  scaling  properties  of  the 
LYP  functional,  we  suggest  the  following  modify- 
ing  function  that  simply  multiplies  the  integrands 
in  Eqs.  (9)  and  (10).  Our  factor  reads 


fin)  =  exp 


-K- 


iVnl' 


nir) 


8/3 


\ 

/ 


(14) 
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and  makes  the  LYP  formula  obey  constraints  (7) 
and  (8).  Note  that  the  proposed  function  f(n)  is 
not  sensitive  to  uniform  scaling,  but  it  is  submis¬ 
sive  to  nommiform  scaling  as  required  by  (7)  and 
(8).  In  expression  (14),  K  is  left  as  a  positive 
adjustable  parameter;  K  should  be  a  positive  con¬ 
stant  in  order  to  prevent  any  divergence  of  the 
integrand.  In  the  limiting  case,  K  =  0,  f(n)  be¬ 
comes  1,  and  the  original  LYP  functional  is  recov¬ 
ered.  Corresponding  to  expressions  (9)  and  (10), 
the  following  modifications  of  LYP  result: 


7LYPM1 


[n] 


=  ji-a)n- 


xll  +  b 


-K[|V«|V«(08/3) 

1  -i 

17|Vn|^  21V^m 

“  72n8/3  72 


>  dr 
(15) 


£LYPM2[„] 

=  j(-a)n  — 


xll  +  b 


-K[|V«|Vn(r)8/3] 

17|Vn|^ 


72„8/3 


Q(«) 


'  dr. 

(16) 


To  obtain  Eq.  (16)  from  Eq.  (10),  we  multiplied  the 
integrand  by  f(n)  after  Eq.  (9)  had  been  integrated 
by  parts. 

To  demonstrate  how  f(n)  improves  the  scaling 
properties  of  n],  we  will  apply  condition  (7) 

to  formula  (16)  to  arrive  at 


where 


lim  Q(nO  =  T7 

00 

lim  ")■''''  =  1 

CO 

have  been  used.  [A  corresponding  result  applies 
with  expression  (15).] 

Observe  that  the  exponential  function  in  (17) 
approaches  zero  much  faster  than  the  function  in 
the  curly  brackets  and  the  limit  becomes  zero. 
Following  the  same  pattern,  it  can  be  easily  shown 
that  both  modifications  satisfy  condition  (8)  as 
well. 

Spin-dependent  forms  of  our  suggested  func¬ 
tionals  could  be  obtained  by  multiplying  spin- 
polarized  forms  in  Refs.  [8]  and  [10].  Since  f(n) 
involves  only  the  total  density  of  the  system. 

In  order  to  find  the  values  of  the  adjustable 
parameters,  we  have  varied,  thus  far,  only  one  of 
the  parameters  in  the  original  LYP  functional,  de¬ 
noted  by  a.  We  have  looked  for  a  combination 
between  a  and  K,  for  each  modification,  that  gives 
as  good  results  for  atomic  systems  as  «].  For 
the  first  modification,  Eq.  (15),  we  have  obtained  a 
viable  set  of  <?  =  0.052  and  K  =  0.002.  However, 
we  have  not  yet  been  able  to  find  a  good  combina¬ 
tion  of  a  and  K  for  because  it  is  ex¬ 

tremely  sensitive  to  the  value  of  K.  Consequently, 
one  should  try  to  optimize  the  second  modification 
more  thoroughly  by  varying  all  constants  in  it. 


,  i-a)nix,  y,  z) 

llm  [ „;  1  =  lim  / 


X  exp 


-K 


dn{x,y,z)  dnix,y,z)  dn{x,y,z) 
- +  A -  +  A 


dX 


dy 


dZ 


X 


1  +  b 


Cr 


y,  2) 


dnix,y,z)  dn(.x,y,z)  dn{x,y,z) 
A^ - -  -i  A - +  A - 


dx 


dy 


dZ 


24 


y,  z) 


dxdydz  =  0, 

(17) 
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TABLE  I 


Correlation  energies  /  negative  values^ 

Atom  /  ion 

Exp.^^ 

K  =  0.0l 
a  =  0.049 

K  =  0.002  / 
a  =  0.052 

He 

0.042 

0.0436221 

0.0417904 

Li  + 

0.0435 

0.047370 

0.045582 

0.0443 

0.0488703 

0.0473012 

Be 

0.094 

0.0942098 

0.0903355 

B-" 

0.111 

0.106204 

0.102123 

Ne 

0.387 

0.382075 

0.386307 

Ar 

0.79 

0.748027 

0.763763 

Kr 

1.74216 

1 .80303 

^Densities  are  obtained  from  Clementi-Roetti  orbitals. 

^The  experimental  values  are  taken  from  the  original  paper 
by  Lee,  Yang,  and  Parr  [8]. 


Numerical  Results 

We  present  some  numerical  results  showing 
some  improvement  of  the  properties  of  the 
gLYPMij-^]  y^j-sus  First  we  calculated  the 

correlation  energies  of  some  atomic  systems.  The 
third  column  in  Table  I,  where  K  =  0.0  and  a  = 
0.049,  gives  the  correlation  energies  calculated  with 
the  original  LYP  approximation  using  our  integra¬ 
tion  program.  The  fourth  column  gives  the  correla¬ 
tion  energies  computed  by  means  of  formula  (15) 
with  K  =  0.002/a  =  0.052. 

Next,  we  will  utilize  some  recently  derived  rela¬ 
tionships  from  DFT  perturbation  theory  in  order  to 
compare  the  properties  of  and 

It  has  been  shown  by  Gorling  and  Levy  [11]  that 


with 


N 

AV  =Vee- 

1  =  1 


r  n(r-)  dr 

/  5 - T  +  'Oj,([n];ri) 

J  r.  -  r 


,  (19) 


where  is  that  noninteracting  Kohn-Sham 
ground-state  determinant  which  yields  n,  and 
is  the  sum  of  the  orbital  energies  of  The  deter¬ 
minants  are  the  excited  states  of  H^^[n]  with 
the  respective  energies  E/s,  where  is  the 

Kohn-Sham  noninteracting  Hamiltonian,  Ep^  is 
the  leading  term  in  the  following  expansion  for  the 
correlation  energy  of  the  scaled  density: 

E^irij^]  =  E^ln]  +  A”^EfHn] 

+  A-2e(4)[„]  +  ...  (20) 


This  fact  implies  that  lim^_^  EJ^J  ^  Ej^j,  or 
that  E^[n]  is  relatively  insensitive  to  scaling,  ex¬ 
cept  when  n  is  nearly  uniform  and  integrates  to  a 
large  number  of  electrons. 

Table  II  demonstrates  that  for  most  of  the  con¬ 
sidered  systems  our  modified  functional  exhibits 
smaller  sensitivity  upon  imiform  scaling  than  the 
original  LYP  functional. 

Finally,  we  turn  our  attention  to  two  exact 
bounds  on  the  high-density  limit  of  the  correlation 
energy  and  its  functional  derivative  obtained  by 
Ivanov,  Lopez-Boada,  Gorling,  and  Levy  [12].  The 
conditions  are  held  for  a  two-electron  hydrogen¬ 
like  density  of  the  form 


n(r)  = 


77 


(21) 


=  -  E  K‘I>,iAV|<D,>lV(E,  -  Ej,  (18) 
/=! 


or,  n(r)  =  2|ls|^,  where  Is  =  (Z^^^/ /7r)c  in 
atomic  units.  From  the  recent  work  of  Chakravorty 


TABLE  11 


Correlation  energies  /  negative  values® 

Atom  /  ion 

£<2)LYP[n] 

fLYPM1[„] 

f(2)LYPM1[„] 

He 

0.0436221 

0.0578889 

0.0417904 

0.0519484 

Li  + 

0.0473702 

0.0577638 

0.045582 

0.0513296 

Be^-" 

0.0488703 

0.0570887 

0.0473012 

0.0510797 

Be 

0.0942098 

0.157741 

0.0903355 

0.142842 

B  + 

0.106204 

0.157977 

0.102123 

0.143258 

Ne 

0.382075 

0.537544 

0.386307 

0.525429 

Ar 

0.748027 

1 .03754 

0.763763 

1.03109 

Kr 

1.74216 

2.2082 

1 .80303 

2.25591 

^Densities  are  obtaine(d  from  Clementi-Roetti  orbitals:  is  the  result  of  taking  the  limit  of 

£^2)lypmi|-^j  jg  result  of  taking  the  limit  of  as  A  goes  to  infinity. 

as  A  goes  to  Infinity; 
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and  Davidson  [13]  and  from  our  independent  work, 
we  ascertain  that 

£«[„]  =  -0.046663  (ina.u.).  (22) 

Moreover,  we  [12]  have  obtained 

/ufK[n],r)n(r)  dr  =  0.12868  (ina.u.),  (23) 

where  nfX[n],r)  is  the  ftmctional  derivative  of 
E^^*[  n]  with  respect  to  n(r). 

Condition  (22)  Condition  (23) 


Exact  value  —0.046663  0.128680 

Ec^^[n]  -0.056481  -0.108922 

£Lypm2[„]  -0.050375  -0.098721 

It  is  seen  that  the  modified  functional  gives  results 
closer  to  theoretical  values  than  the  original  LYP 
approximation.  Both  approximations  have  wrong 
signs  when  condition  (23)  is  applied.  This  is  not 
surprising  since  the  form  of  the  original  El'^^[n] 
was  not  designed  with  the  functional  derivative  in 
mind.  (It  can  be  demonstrated  that  the  inclusion  of 
fin)  cannot  change  the  sign  of 

j^(2)LYPMi([  ft],i)niT)dr 

for  any  K  >  0.)  Finally,  since  completion  of  this 
work,  the  refined  GGA  correlation  energy  func¬ 
tional  of  Perdew,  Burke,  and  Emzerhof  [14]  has 
been  tested  with  respect  to  condition  (22)  and 
found  to  give  E^^^[n]  =  —0.0482  a.u. 


Concluding  Comments 

Following  nonuniform  scaling  requirements,  we 
have  proposed  a  modifying  factor  that  makes 
E,''^'’[  n]  obey  constraints  (8)  and  (9).  Even  though 
we  optimized  one  of  our  suggested  functionals, 
£Lypm2[jj]^  in  a  preliminary  way,  the  modified 
version  has  certain  better  properties  than  the  origi¬ 
nal  LYP  formula.  It  is  anticipated,  in  the  future, 
that  both  modified  functionals,  Eqs.  (15)  and  (16), 
will  be  optimized  with  respect  to  all  parameters,  to 


closely  reproduce  the  improved  “exact”  correla¬ 
tion  energies  by  Chakra vorty  et  al.  [15].  This  im¬ 
plies  that  nommiform  scaling  conveys  important 
information  for  approximating  Efn],  Moreover, 
our  modifying  function  could  be  used  to  improve 
nonuniform  scaling  properties  of  any  functional 
which  does  not  obey  constraints  (7)  and  (8),  pro¬ 
vided  upon  nonuniform  scaling  fin)  dominates 
the  violating  term  as  far  as  the  asymptotic  behav¬ 
ior  is  concerned  and  approaches  the  correct  limits 
faster. 
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ABSTRACT _ 

The  TT-allyl  insertion  mechanism  of  the  1,4-cis  polymerization  of  butadiene  by  means  of 
allyl-nickel  catalysts  has  been  studied  theoretically  by  density  functional  theory  (DFT)  for 
the  ligand  free  cationic  butenyl  nickel(II)  complexes  [Ni(C3H5XC4H5)]^,  I, 
[Ni(C3H5)(C4H6)(C2H4)]^,  II,  and  [Ni(C7H4i)(C4H6)]'^,  III.  DFT  energy  profiles  have 
been  determined  for  the  insertion  of  s-c/s-butadiene  into  the  anti- 17  ^-butenyl  nickel(II) 
bond  in  the  supine  and  prone  orientation  of  the  reacting  ligands.  One  of  the  objectives  of 
the  study  was  to  support  the  proposed  7r-allyl  insertion  mechanism.  With  increasing  size 
of  the  model  compound  there  is  better  agreement  between  the  calculated  reaction  and 
activation  energies  in  relation  to  the  known  experimental  behavior.  Among  the  different 
arrangements  (anti-  or  syn-butenyl)  of  the  allylic  end  of  the  growing  polymer  chain  and 
the  relative  orientation  (supine  or  prone)  of  cisoid  butadiene  at  the  catalytic  site,  the 
anti-butenyl /prone  reacts  in  compound  III  with  a  small  activation  barrier  (8.2  kcal/mol) 
to  the  product  ( —  5.7  kcal /mol).  The  stabilizing  effect  of  the  coordination  of  an  additional 
double  bond  of  the  growing  polymer  chain  for  the  correct  description  of  the  geometrical 
and  energetical  aspects  of  the  insertion  reaction  was  demonstrated  by  II.  The  results 
support  the  s-c/s-butadiene  insertion  into  the  anti- 17  ^-butenyl  nickel(II)  bond  according  to 
the  proposed  7r-allyl  mechanism.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

The  stereospecific  polymerization  of  butadiene 
can  be  directed  to  the  desired  1,4-czs-poly- 
butadiene.  This  is  an  important  process  in  the 
chemical  industry  for  producing  so-called  syn¬ 
thetic  rubber.  Allyl-nickel  compounds  and  allyl 
*To  whom  correspondence  should  be  addressed. 


compounds  of  other  transition  metal  ions  catalyze 
the  C — C  bond  formation  with  high  cis  selectivity. 
The  reaction  mechanism  is  much  more  compli¬ 
cated  than  it  is  for  mono-olefins  (Ziegler-Natta 
polymerization).  From  the  experimental  observa¬ 
tions  at  the  moment  there  are  two  mechanisms  in 
the  very  complex  reaction  scheme  proposed.  The 
allylic  end  of  the  growing  polymer  chain  interacts 
with  a  butadiene  monomer,  both  coordinated  in 
the  sphere  of  a  catalytic  center.  The  cr-allyl  mecha- 
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nism  first  suggested  by  Cossee  and  Arlman  [1] 
describes  the  C — C  bond  formation  by  a  nucle¬ 
ophilic  attack  of  the  r^kcoordinated  butenyl  end  to 
the  coordinated  butadiene.  The  second  mechanism 
is  the  TT-allyl  insertion  introduced  by  Taube  et  al 
[2]. 

The  aim  of  this  research  is  to  investigate  the 
mechanism  of  the  ir-allyl  insertion  by  reliable  ab 
initio  calculations  of  those  allyl-/butadiene-nickel 
species  (models  of  key  structures)  in  their  equilib¬ 
rium  structures  of  educts  and  products.  The  sec¬ 
ond  task  is  to  locate  and  characterize  the  transition 
states  for  those  reactions. 

In  a  previous  study  [3]  we  reported  the  reliabil¬ 
ity  of  density  functional  theory  (DFT)  and 
Hartree-Fock  (HF)  calculations  in  describing  the 
structure  of  such  experimentally  very  well  known 
compounds  like  l)zs(Tjkallyl)nickel  and  syn- 
crotyl-cyclooctadiene-nickel,  and  intermediates  of 
(7jkfl//y/Xbutadiene)nickel  [4].  In  the  HF  calcula¬ 
tions  different  basis  sets  of  double-zeta  (DZ)  and 
triple-zeta  (TZ)  quality  and  pseudopotentials  were 
used,  but  the  results  at  the  HF-level  are  not  as 
good  as  the  corresponding  DFT  calculations.  As  an 
important  conclusion  for  the  calculation  of 
organo-nickel  compounds,  the  reliability  of  the 
DFT  approach  with  nonlocal  gradient  corrections 
is  of  the  same  good  quality  as  post-HF  results,  but 
not  so  time  consuming. 

The  structural  models  for  the  calculations  had 
been  taken  from  the  mechanistic  investigations  and 
x-ray  studies  of  Taube  et  al.  [5]  of  the  cationic 
Ci2-allyl  nickel(II)  complex  [  NffCij  Hj,)] 
(B(C^H;^XCF^)2)h].  They  have  proved  conclusively 
that  the  cationic  polybutadienyl  butadiene 
nickel(II)  complex  is  the  real 

catalyst  for  the  1,4-cis  polymerization  of  butadiene. 
A  generally  accepted  mechanistic  scheme  for  the 
catalysis  of  the  cis-  and  trans-1,4  polymerization  of 
butadiene  by  allyl  nickelfll)  complexes  was  pro¬ 
posed.  The  different  reactivity  of  the  anti-  and  the 
syn-butenyl  nickel(II)  complex  with  respect  to  the 
mode  of  butadiene  coordination  should  show 
whether  the  insertion  process  is  a  cis  or  trans 
insertion  step  of  the  catalytic  reaction. 


Computational  Details 

The  approximate  density  functional  calculations 
reported  here  were  performed  with  the  DGauss 
program  and  the  UniChem  interface  [6]  on  a  CRAY 
supercomputer. 


All  calculations  were  carried  out  using  the  local 
spin  density  approximation  (LDA)  with  Slater's 
exchange  functional  [7(a),(b)]  and  the  Vosko- 
Wilk-Nusair  parameterization  on  the  homoge¬ 
neous  electron  gas  for  correlation  [7(c)],  aug¬ 
mented  by  gradient  corrections  to  the  exchange- 
correlation  (xc)  potential.  Gradient  corrections  for 
exchange  were  based  on  the  functional  of  Becke 
[7(d)]  and  for  correlation  were  based  on  Perdew 
[7(e)],  Lee-Yang-Parr  [7(f)],  and  Stoll- 
Pavlidou-Preuss  [7(g)].  The  Becke-Perdew  func¬ 
tionals  were  either  added  variationally  within  the 
self-consistent  field  (SCF)  procedure  (LDA/BP- 
NLSCF)  or  added  perturbatively  to  the  LDA  ener¬ 
gies  (LDA/BP).  The  other  correlation  functionals 
were  added  together  with  the  Becke  functional 
either  variationally  (LDA/BLYP-NLSCF)  or  as 
perturbation  (LDA/BSPP). 

All  electron  Gaussian  orbital  basis  sets  were 
used  for  all  atoms.  The  calculations  were  per¬ 
formed  with  two  different  basis  sets  from  the 
DGauss  basis-set  library,  denoted  DZVP  and 
TZVP.  Our  standard  DZVP  basis  is  a  15s /9p /5d 
set  contracted  to  (63321/531/41)  for  nickel  [8(a)],  a 
9s/5p/ld  set  contracted  to  (621/41/1)  for  carbon 
[8(b)],  and  a  4s/lp  set  contracted  to  (31/1)  for 
hydrogen  [8(b)].  A  corresponding  auxiliary  basis 
set  was  used  for  the  fitting  of  the  charge  density 
[8(b)].  The  TZVP  basis  set  consisted  of  orbital 
function  with  the  contraction  patterns 
(63321/5211/41),  (7111/411/1),  and  (311/1)  for 
Ni  [8(a)],  C  [8(b)],  and  H  [8(b)],  respectively. 

The  numerical  integration  was  done  using  a 
standard  fine-meshed  grid  of  adequate  quality. 
The  energetic  effect  of  tighter  meshed  grids  was 
estimated  to  be  less  than  0.2  kcal/mol,  as  indi¬ 
cated  by  single-point  calculations  with  more  than 
twice  as  many  integration  points  as  in  our  stan¬ 
dard  fine  grid. 

The  geometry  optimization  and  the  saddle  point 
search  were  performed  at  the  LDA  and  the 
LDA/BP-NLSCF  level  of  approximation  by  utiliz¬ 
ing  analytical  gradients/hessians  according  to 
standard  algorithms.  No  symmetry  constraints 
were  imposed  in  any  optimization.  The  stationary 
points  were  exactly  identified  by  the  curvature  of 
the  potential  surface  at  these  points  corresponding 
to  the  eigenvalues  of  the  analytically  calculated 
hessians. 

Single-point  calculations  at  the  optimized  sta¬ 
tionary  points  at  the  LSDA/BP-NLSCF  level  with 
our  standard  basis  set  were  performed  using  the 
LDA/BLYP-NLSCF  functional  as  well  as  the  larger 
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TZVP  basis  in  connection  with  the  LDA/BP- 
NLSCF  functional.  Also  the  LDA/BP  and 
LDA/BSPP  functionals  were  utilized  in  single¬ 
point  calculations  at  the  optimized  stationary 
points  at  the  LDA  level  with  our  standard  basis 
set. 


Results  and  Discussion 

We  shall  here  present  the  results  from  our  DFT 
calculations  on  the  cationic  ,  I, 

[iC,Hs)iC2H,)NiiC,He)]\  II.  and 
[{CyH-^^)NiiC^H()V ,  III,  complexes  (see  Fig.  1). 


These  were  chosen  as  models  of  the  real  active 
catalyst,  the  cationic  polybutadienyl  butadiene 
nickel(II)  complex  [R-C3HiNi(C^H^)r ■  The  struc¬ 
tures  of  the  educts  were  derived  from  preliminary 
investigations  [9]  of  the  rj^-butenyl  coordination  in 
anti-  and  syn-conformation  of  the  corresponding 
cationic  C^-allyl  nickeKlI)  complexes.  In  each  of 
the  above  model  systems  two  arrangements  were 
investigated;  the  "supine"  (the  reacting  r/Vh^' 
butadiene  and  77^-butenyl  group  oriented  back  to 
back)  and  the  "prone"  (reacting  ijVh^-butadiene 
and  Tj^-butenyl  group  oriented  back  to  front). 

In  the  smallest  KC^Hs)Ni(CiH,)V  complex,  I, 
no  distinction  between  anti  and  syn  coordination 


^alculltid  energy  profile  for  the  ^-allyl  insertion  in  the  cationic  [(C3H,)Ni(C,He)]^  system  (kcal/mo^ 
?htstru?ure^  V'butadiene  (a),  ^^-butadiene  (b)  educt,  transition  state  (c),  and  product  (d)  m  supme  (1) 

and  prone  (2)  orientation.  - - - - - - - - 


Systenn 

LDA/BP- 

NLSCF^’ 

LDA*' 

LDA/BLYP- 

NLSCF'= 

LDA/BP** 

LDA/BSPP** 

la 

1b 

1c 

Id 

2a 

2b 

2c 

2d 

0.0 

17.7 

24.0 

7.0 

-6.9 

18.6 

18.6 

-0.3 

0.0 

32.2 

24.7 

11.4 

-2.2 

32.8 

14.3 

4.0 

0.0 

12.8 

26.3 

7.8 

-6.6 

13.6 

21.6 

0.7 

0.0 

23.3 

27.0 

12.1 

-2.1 

23.5 

18.3 

4.6 

0.0 

10.1 

30.0 

12.5 

-1.6 

9.7 

23.9 

5.0 

^The  supine  T/'^-butaidiene  eijuct  (1a)  was  chosen  as  arbitrary  reference  point. 
Complete  optimized  structures. 

^Single-point  calculations  using  LDA/BP-NLSCF  optimized  structures. 
^Single-point  calculations  using  LDA  optimized  structures. 
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of  the  17^-butenyl  group  is  possible.  Also,  we  do 
not  expect  large  differences  between  the  educts, 
products,  and  transition  states  in  supine  and  prone 
arrangement,  respectively,  considering  the  lack  of 
coordinative  saturation  of  nickel  in  this  simple 
model. 

The  more  realistic  KC3HsXC2H^)NKC^H(,)V 
complex,  II,  contains  an  additional  ethylene  ligand 
in  an  orientation  above  the  nickel  that  takes  into 
account  the  specific  orientation  of  an  7?^-butenyl 
group  in  the  anti  conformation.  Here  it  is  impor¬ 
tant  to  see  how  the  ethylene  ligand  stabilizes  the 
transition  states  for  the  supine  and  prone  configu¬ 
ration  in  a  different  way. 

In  the  biggest  model  [(CyH^^)Ni(C^H^)]\  III, 
under  investigation  here,  the  stabilizing  ethylene 
itself  is  part  of  the  growing  polymer  chain. 

An  indication  for  the  reliability  of  the  standard 
DZVP  basis  is  given  by  single-point  calculations  at 
the  LDA/BP-NLSCF  optimized  structures  using 
the  larger  TZVP  basis.  The  relative  energetic 
changes  are  sufficiently  small,  about  1  kcal/mol 
for  the  supine  and  up  to  4  kcal/mol  for  the  prone 
arrangement. 

C  VTIOMC  [(C3H,,)Ni(C4H«)r  COMPLEX 

Single-point  LDA/BLYP-NLSCF  calculations 
give  quite  similar  energetics  as  for  the  LDA/BP- 
NLSCF  level.  The  insertion  reaction  is  about  8 
kcal/ mol  endothermic,  and  the  activation  energies 


were  calculated  to  be  of  the  same  order  for  supine 
and  prone  orientations  (26  and  28  kcal/mol,  re¬ 
spectively).  Only  the  rj^-butadiene  educts  (lb,  2b) 
are  about  5  kcal/mol  more  stabilized  relative  to 
the  T7^-butadiene  educts  (la,  2a),  distinctly  below 
the  transition  states  (see  Table  I).  Although  struc¬ 
tural  relaxation  was  not  included,  we  would  not 
expect  any  dramatic  changes,  if  a  complete 
LDA/BLYP-NLSCF  optimization  was  performed. 

At  the  LDA  level  there  is  a  clear  kinetic  and 
thermodynamic  preference  for  the  prone  versus 
the  supine  orientation.  The  supine  reaction  is  about 
twice  as  endothermic  as  the  prone  reaction,  11 
kcal/ mol  and  6  kcal/ mol,  respectively.  The  activa¬ 
tion  energies  were  calculated  to  be  25  kcal/mol 
(supine)  and  16  kcal/mol  (prone),  and  the  r?^- 
butadiene  educts  (lb,  2b)  are  clearly  above  the 
transition  states.  These  considerable  differences 
with  respect  to  the  gradient-corrected  functionals 
can  hardly  be  accepted  from  a  chemical  point  of 
view  for  the  similar  reaction  models  lacking  any 
steric  hindrance.  We  guess  the  inadequate  xc-treat- 
ment  at  the  LDA  level  is  responsible  for  this  fail¬ 
ure.  Perturbatively  adding  the  BP  corrections  at 
the  LDA-optimized  stationary  points  (LDA/BP) 
cannot  overcome  this  behavior.  The  insertion  turns 
out  to  be  12  or  7  kcal/mol  endothermic  and  activa¬ 
tion  energies  of  27  or  20  kcal/mol  for  the  supine  or 
prone  orientation.  Also  the  LDA/BSPP  level  was 
unable  to  give  a  suitable  energetic  description.  So 
we  cannot  agree  with  the  commonly  used  mixed 


TABLE  II 


Th^ef rf the  -iT-allyl  insertion  in  the  cationic  [(CgHsKCaHjNKC^He)]"^  system  (kcal/mol).^ 

S  nrnnftfr  ^  -butadiene  (a),  in  -butadiene  (b)  educt,  transition  state  (c),  and  product  (d)  in  supine 
v3;  and  prone  (4)  orientation.  ^ 


System 

LDA/BP- 

NLSCF*^ 

LDA*" 

LDA/BLYP- 

NLSCF^ 

LDA/BP'* 

LDA/BSPP'* 

3a 

3b 

3c 

3d 

4a 

4b 

4c 

4d 

0.0 

3.6 

26.4 

-8.5 

2.8 

5.3 

6.7 

-14.4 

0.0 

12.6 

26.9 
-8.4 

3.9 

11.8 

6.9 
-7.4 

0.0 

-1.3 

26.8 

-7.7 

2.9 

0.6 

9.0 

-13.4 

0.0 

5.4 

27.8 

-6.7 

4.0 

4.6 

9.9 

-8.2 

0.0 

-6.6 

28.6 

-4.7 

3.7 

-7.3 

14.2 

-10.1 

^The  supine  r^'^-butacdiene  educt  (3a)  was  chosen  as  arbitrary  reference  point 
Complete  optimizecJ  structures. 

^Single-point  calculations  using  LDA/BP-NLSCF  optimized  structures. 
Single-point  calculations  using  LDA  optimized  structures. 
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computational  treatment,  to  perform  optimizations 
at  the  LDA  level  followed  by  a  perturbative  addi¬ 
tion  of  gradient  corrections. 

CATIONIC  [(CsH5)(C2H4)Ni(C4He)]+ 

COMPLEX 

The  optimized  structures  of  the  17  ^-butadiene 
educt  (3a,  4a),  the  17  ^-butadiene  educt  (3b,  4b),  the 
transition  state  (3c,  4c),  and  the  product  (3d,  4d) 
for  the  supine/prone  arrangement  are  compared 
in  Table  II,  respectively.  All  optimized  structures 
have  no  symmetry. 

The  most  remarkable  differences  between  the 
supine  and  prone  coordination  mode,  influenced 
by  the  ethylene  ligand,  occurred  for  the  transition 
states  (3c,  4c).  They  differ  in  the  two  possible 
geometrical  arrangements,  observed  for  the  d^ 
transition  metal  configuration.  The  supine  transi¬ 
tion  state  3c  can  be  described  as  tetragonal  pyra¬ 
midal  and  the  prone  system  4c  as  trigonal  bipyra- 
midal.  The  transition  states  were  located  at  a  car¬ 
bon  to  carbon  distance  of  2.155  and  2.079  A,  for  3c, 
4c,  respectively,  for  the  performed  a-bond.  These 
are  noticeably  larger  than  for  Ic,  2c,  indicating  an 
earlier  occurrence  at  the  potential  surfaces  than  for 
Ic,  2c.  By  comparing  the  transition  states  for  supine 
(3c)  and  prone  (4c)  orientation,  it  can  be  concluded 
that  the  former  appears  product-like  whereas  4c 
appears  educt-like. 

The  simpler  [iC^H^)Ni(C^H()V  model  (dis¬ 
cussed  in  the  previous  section)  is  able  to  give 
essential  insights  into  the  mechanism,  derived  from 
the  located  transition  state  structures.  But  only  by 
the  inclusion  of  an  additional  ethylene  ligand  could 
we  overcome  the  lack  of  coordinative  unsaturation 
at  the  nickel  center.  A  different  picture  between 
the  supine  and  prone  arrangement  results.  The 
coordinative  interaction  of  the  ethylene  with  Ni(II) 
is  very  similar  at  each  stage  of  the  reaction,  as 
indicated  by  the  nearly  identical  ligand  geometry 
distortion,  regardless  of  whether  the  allylic  and 
butadiene  moieties  are  supine  or  prone  oriented. 

As  found  for  the  [C^H^^NiV  system  the  Ni-C 
distances  are  found  to  be  noticeably  shorter  at  the 
LDA  than  at  the  LDA/BP-NLSCF  level.  The 
above-discussed  geometrical  aspects  for  the  educts, 
products,  and  transition  states  are  found  also  with¬ 
out  including  gradient  corrections.  Greater  devia¬ 
tions  from  the  LDA/BP-NLSCF  geometries  occur 
for  the  transition  states  3c,  4c.  Two  different  geo¬ 


metrical  arrangements  are  revealed,  tetragonal 
pyramidal  with  2.084  A  for  3c  and  trigonal  bipyra- 
midal  with  2.079  A  for  the  CC  (r-bond  for  4c. 

In  free  optimizations  starting  downhill  from 
marginally  distorted  transition  state  geometries, 
that  is  with  an  elongated  and  shortened  CC  dis¬ 
tance  (which  is  mainly  involved  in  the  reaction 
coordinate),  proved  that  the  two  transition  states 
correspond  to  the  77^-butadiene  educts  (3a,  4a)  and 
the  products  (3d,  4d),  respectively. 

The  calculated  energy  profile  at  several  levels  of 
theory  using  the  standard  DZVP  basis  is  reported 
in  Table  II.  From  the  above-discussed  geometrical 
aspects  of  the  insertion  reaction  we  would  expect  a 
noticeable  energetic  difference  especially  for  the 
transition  states,  resulting  in  quite  different  activa¬ 
tion  energies  for  supine  and  prone  arrangements. 
Due  to  the  inclusion  of  the  ethylene  ligand,  Tol- 
man's  18-16  electron  rule  should  be  fulfilled;  we 
also  expect  more  realistic  reaction  energies,  com¬ 
pared  with  the  experiment. 

First  we  discuss  the  energetics  for  the  complete 
optimized  structures  at  the  LDA/BP-NLSCF  level. 
In  contrast  to  the  simpler 
model  the  insertion  reaction  was  calculated  to  be 
exothermic  in  both  arrangements,  8.5  kcal/ mol  for 
supine  and  about  twice  as  much,  i.e.,  17  kcal/mol 
(Table  II),  for  prone.  Also  the  (3a,  4a)  and 
(3b,  4b)  educts  are  energetically  similar,  with  the 
'>7Leducts  favored  by  about  3  kcal/mol.  So  from 
the  energetic  point  of  view  both  educts  are  eligible 
as  reactants  for  the  true  insertion  reaction,  but  the 
T7^-educts  should  be  ruled  out  according  to  deduc¬ 
tions  from  the  principle  of  least  structure  variation. 
The  most  important  difference  was  found  for  the 
transition  states.  Whereas  for  the  supine  mode  a 


TABLE  III _ - 

Calculated  energy  profile  at  the  LDA  /  BP-NLSCF 
level  for  the  Ti-allyl  insertion  in  the  [(anti-CyHi,) 
Ni(C4H6)]*  system  (kcal/mol).®  The  structures  are 
T) ^-butadiene  (a),  rj ^-butadiene  (b)  educt,  transition 
state  (c),  and  product  (d).  _ 


Structure  Supine  (5)  Prone  (6) 


3  0.0  3-9 

b  11.5  12.8 

c  25.2  12.2 

d  -4.7  -1.8 


®The  supine  ‘'-butadiene  educt  {5a)  was  chosen  as  refer¬ 
ence  point. 
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similar  activation  energy  of  about  26  kcal/ mol  as  Quite  similar  energetics  result  from  single-point 

for  the  [(CjHslMfQH^)]^  model  arises,  only  LDA/BLYP-NLSCF  calculations  using  the  opti- 

about  4  kcal/mol  was  calculated  for  the  prone  mized  LDA/BP-NLSCF  structures.  The  prone  in¬ 
mode.  So  besides  the  thermodynamic  preference,  sertion  was  twice  as  exothermic  as  supine,  i.e., 

the  prone  orientation  should  also  be  kinetically  about  16  and  8  kcal/mol,  and  the  activation  ener- 

quite  feasible.  gieg  were  calculated  to  be  about  27  or  6  kcal/mol 

The  calculated  reaction  energies  are  not  influ-  for  supine  or  prone.  The  largest  deviation  of  ap- 

e^ed  by  using  the  larger  TZVP  basis  at  the  proximately  5  kcal/mol  appears  in  the  case  of  the 

LDA/BP-NLSCF  level.  Also  the  activation  energy  17^-educts.  At  the  LDA/BP-NLSCF  level  the  tj*- 

for  supine  remains  the  same,  whereas  for  prone  educts  are  slightly  more  stable  than  the  TjLgducts 

about  6  kcal/mol  was  calculated.  but  at  the  LDA/BLYP-NLSCF  level  that  relation  is 


T|4-Educt(18  e")  AE=  0.0  kcal/mol  Product(16  e')  AE  = -4.7  kcal/mol 


Ti2-Edu(a(16e-)  AE=  11.5  kcal/mol  Transition  State  AE  =  25.2  kcal/mol 

FIGURE  2.  Modeling  of  the  Tr-allyl  insertion  mechanism  (antibutenyl  configuration)  (SUPINE)[Ni(T73,T7Ac7H,d(T72/77‘*. 
c/s-C4Hs)]+  (DZVP/A1 -Basis  VWN  BP  NLSCF  XC  grid-fine,  relative  energies,  ANTI-Butenyl  SUPINE  n'^-educt  is  the 
reference  point). 
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turned  around.  The  energetic  effects  caused  by  a 
complete  optimization  at  the  LDA/BLYP-NLSCF 
level  were  estimated  to  be  negligible. 

The  relative  energetics  of  the  insertion  reaction 
are  surprisingly  well  described  at  the  LDA  level 
compared  to  the  more  reliable  gradient-corrected 
functionals.  The  prone  insertion  is  thermodynami¬ 
cally  preferred  over  supine,  although  by  only  about 
3  kcal/mol.  The  calculated  activation  barriers  of 
about  27  or  3  kcal/mol  for  supine  or  prone  agree 
quite  well  with  the  LDA/BP-NLSCF  values.  On 
the  other  hand  unlike  the  gradient-corrected  func¬ 
tionals,  a  rather  large  energy  difference  between 
the  17^-  and  r/^-educts  appears.  Taking  the  strik¬ 


ingly  good  energetically  description  at  the  LDA 
level  into  accoimt,  than  the  fairly  good  agreement 
between  the  perturbatively  added  BP  corrections 
(LDA/BP)  and  the  full  self-consistent  BP  treat¬ 
ment  (LDA/BP-NLSCF)  is  not  so  unexpected.  Al¬ 
though  we  do  not  have  any  real  measure  for  the 
reliability  of  the  calculated  energetics  for  the  in¬ 
vestigated  systems,  for  example,  by  means  of 
highly  correlated  ab  initio  methods,  at  the  moment 
we  are  worried  about  the  reliability  of  the 
LDA/BSPP  calculated  energetics.  More  elaborate 
calculations  on  small  organyl-nickel  compounds  at 
the  coupled-cluster  singles  and  doubles  (CCSD{T)) 
level  have  been  started. 


f 


T|4.Educt(18  e')  AE=  3.9  kcal/mol 


Product(  1 6  e“)  AE  =  -1 .8  kcal/mol 


Ti2-Educt(16  e“)  AE  =  12.8  kcal/mol  transition  state  AE  =  12.1  kcal/mol 


FIGURE  3.  Modeling  of  the  Tr-allyl  insertion  mechanism  (antibutenyl  configuration)  (PRONE) 

cis-C^H^)V  (DZVP/AI-Basis  VWN  BP  NLSCF  XC  grid-fine,  relative  energies,  ANTI-Butenyl  SUPINE  T?^-Educt  is  the 

reference  point). 
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At  the  LDA/BSPP  level,  which  should  be  suit¬ 
able  to  overcome  the  error  introduced  by  self-in¬ 
teraction,  the  17^-educt  results  considerably  more 
stable  than  the  ry^-educt. 

A  detailed  mechanistic  interpretation  and  com¬ 
parison  of  these  models  (1,  2,  3,  4)  and  their  role  in 
describing  the  catalytic  chemical  insertion  reaction 
of  butadiene  will  be  given  in  a  separate  more 
chemical-oriented  work  [4]. 

CATIOIXIC  [(C7Hu)rVi(C4H^)]+  COMPLEX 

In  conclusion  of  the  models  described  above, 
the  additional  ethylene  ligand  gave  a  reliable  pic¬ 
ture  of  the  insertion  process.  Here  we  want  to 
investigate  its  effect,  to  be  part  of  the  growing 
polymer  chain.  The  calculations  were  performed  at 
the  LDA/BP-NLSCF  level  for  an  antibutenyl  con¬ 
formation  of  the  growing  end  of  the  polymer, 
which  transforms  during  the  insertion  to  a  cis  unit 
of  the  polymer.  This  model  (5,  6)  takes  into  ac¬ 
count  all  the  essential  coordination  as  it  is  pro¬ 
posed  for  the  reaction  mechanism  [5]  for  the  real 
catalyst. 

It  was  found  that  the  orientation  of  the  77^- 
olefinic  part  of  the  polymer  is  quite  similar  to  the 
ethylene  ligand  in  the  models  discussed  (3,  4) 
above.  Furthermore  the  allyl  part  in  the  educts  is 
very  similar  to  those  in  the  products,  which  is 
important  for  the  overall  process. 

The  supine  orientation  gives  slightly  more  sta¬ 
ble  educts  (5a)  and  products  (5d)  (see  Table  III), 
but  the  activation  energy  for  the  prone  orientation 
is  essentially  smaller  (only  12  kcal/mol)  than  for 
the  supine  (25  kcal/mol).  Therefore  the  reaction 
starts  from  the  prone  orientation,  which  is  not  the 
most  stable  one.  This  agrees  with  nuclear  magnetic 
resonance  (NMR)  investigations  of  labeled  com¬ 
pounds  in  the  1,4-cis-polybutadiene  [10],  which 
had  been  interpreted  to  arise  from  prone  orienta¬ 
tions. 

The  optimized  structures  of  educts,  transition 
states,  and  products  for  supine  (Fig.  2)  and  prone 
orientation  (Fig.  3)  are  summarized. 


Conclusions 

The  aim  of  this  research  was  to  study  the  inser¬ 
tion  of  s-cis-butadiene  into  the  anti-T^Tbute^yl- 
nickel(II)  bond  attempting  to  support  the  proposed 
77-allyl  insertion  mechanism  for  the  1,4-cis  poly¬ 
merization  of  butadiene.  The  investigations  were 


model  system  [MC^H^r 
transition  state 


modeJ  system  [NiCg  Kjgl* 
transition  state 


model  system 

transition  state 


FIGURE  4,  Calculated  energy  profiles  (kcal/mol)  for 
the  Insertion  of  butadiene  into  the  allyl-nickel  bond  for 
supine  and  prone  orientation  in  the  three  model 
compounds  (I,  II,  III). 
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done  on  model  compounds  of  increasing  complex¬ 
ity,  the  ,  [(031^5X021^4) 

M(04H6)]+,  and  [(07Hii)M(04H6)]-"  system, 
modeling  the  ligand-free  cationic  butenyl  nickel(II) 
complex. 

For  the  [(03H5)M(04H5)]^  complex  no  clear 
difference  for  both  the  main  geometrical  parame¬ 
ters  and  the  reaction  profile  between  the  supine 
and  prone  coordination  mode  was  appeared.  The 
necessity  of  the  incorporation  of  the  next  double 
bond  of  the  growing  polymer  chain  for  a  correct 
picture  of  the  geometrical  and  energetical  aspects 
was  clear  demonstrated  by  the  [(C3H5)(C2H4) 
Ni(QH,)V  and  [{CyH,,)Ni(CMV  systems.  A 
clear  difference  between  the  supine  and  prone 
orientation  has  occurred  for  these  systems,  espe¬ 
cially  realized  at  the  transition  states.  The 
product-like  supine  transition  state  3c,  5c  can  be 
described  as  tetragonal  pyramidal  and  the  rather 
educt-like  prone  transition  state  4c,  6c  as  trigonal 
bipyramidal  structure. 

As  a  result  of  the  lack  of  coordinative  saturation 
of  Ni(II)  in  the  [iC3Hs)Ni(CiH(,)r  system,  the 
insertion  was  calculated  to  be  endothermic  with  an 
activation  barrier  of  about  25  kcal/mol.  For  the 
[(C3H5)(C2H4)M(C4H5)]''  model  a  noticeable 
thermodynamic  and  kinetic  preference  of  the  prone 
over  the  supine  orientation  has  occurred.  The  in¬ 
sertion  was  determined  to  be  exothermic  by  about 
8.5  or  17  kcal/mol  with  a  barrier  of  about  26  or  4 
kcal/mol  for  supine  or  prone,  respectively.  For  the 
biggest  model  [(C'^ the  insertion  is 
5  or  6  kcal/mol  and  the  barrier  is  25  or  8  kcal/mol 
for  supine  or  prone.  The  insertion  reaction  pro¬ 
ceeds  from  the  prone  orientation.  Figure  4  com¬ 
pares  the  energy  profiles  for  supine  and  prone 
orientations  of  all  three  models  at  the  LDA/BP- 
NLSCF  level. 

The  transition  states  can  be  characterized  by  a 
nearly  complete  change  in  hybridization  of  the 
reaction  centers  at  the  carbon  atoms  of  the  newly 
to  form  C — C  cr-bond  from  sp^  to  sp^.  As  a  conse¬ 
quence  this  C — C  tr-bond  is  coordinated  lower  and 
moved  away  from  the  nickel  coordination  sphere, 
while  the  other  ligands  remain  7r-coordinated  to 
Ni(II). 

From  a  theoretical  point  of  view  the  usage  of 
gradient-corrected  functionals  is  recommended  for 
a  reliable  description  of  the  investigated  reactions. 
Very  similar  energy  profiles  were  derived  with 
both  the  LDA/BP-NLSCF  and  the  LDA/BLYP- 
NLSCF  treatment.  Also  for  the  geometries  of  the 


optimized  structures  we  would  expect  no  signifi¬ 
cant  difference  between  these  functionals.  AJ- 
though  the  nickel-ligand  interaction  was  not  prop¬ 
erly  described  at  the  LDA  level,  the  relative  geo¬ 
metrical  changes  are  comparable  to  the  former  one. 
For  the  estimation  of  the  energetics  the  LDA,  even 
by  perturbatively  adding  of  gradient  corrections,  is 
not  able  to  give  a  well-balanced  energetic  descrip¬ 
tion. 

In  further  studies  we  will  extend  our  investiga¬ 
tions  on  one  hand  to  the  use  of  more  elaborated 
methods  to  study  the  reliability  of  the  DFT  calcula¬ 
tions  and  on  the  other  hand  we  want  to  continue 
in  study  the  catalytic  reaction  scheme  of  butadiene 
in  monoligand  nickel(II)  complexes. 
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ABSTRACT  _ _ _ _ 

The  interaction  of  two  iron  atoms  with  molecular  nitrogen  was  studied  by  means  of 
density  functional  techniques.  Calculations  were  of  the  all-electron  type,  and  both 
conventional  local  and  gradient-dependent  approximate  (GDA)  models  were  used.  A 
ground  state  (GS)  of  linear  structure  was  found  for  Fe2-N2  with  2S  -I-  1  =  7;  whereas  a 
distorted  tetrahedral  structure,  being  also  a  septuple!,  was  located  at  4.0  and  14.3 
kcal/mol  above  the  GS,  at  the  local  and  GDA  levels  of  theory,  respectively.  The  N-N 
bond  is  moderately  perturbed  in  the  GS,  but  it  is  strongly  activated  in  the  tetrahedral 
mode:  It  has  bond  orders  of  2.6  and  1.5,  vibrational  frequencies  of  2148  and  1496  cm“’, 
and  equilibrium  bond  lengths  of  1.14  and  1.24  A,  for  the  linear  and  tetrahedral  geometries, 
respectively.  These  values  are  3.0,  2359  cm”^,  and  1.095  A,  for  free  N2.  At  GDA  level  of 
theory,  the  Fe2-N2  binding  energy  is  15  kcal/ mol,  which  is  bigger  than  that  of  Fe-N2  (9 
kcal/mol).  The  rr-back  donation,  in  the  linear  GS,  is  of  0.31  electrons,  but  the  total  charge 
transfer,  from  Fe2  to  N2,  is  only  0.05  units.  This  is  relevant  in  comparison  with  the 
tetrahedral  mode,  where  the  Fe2  to  N2  total  charge  transfer  is  of  0.45  electrons,  yielding 
a  stronger  activated  N2  moiety.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

The  experimental  and  theoretical  study  of  the 
interaction  of  molecular  species  with  transi¬ 
tion  metals  (TM)  is  nowadays  an  active  field  of 
research.  Large  efforts  are  devoted  to  determine 
the  mechanism  of  activation  that  some  molecules 
(such  as  N2,  N2,  O2,  etc.)  experience  in  the  pres¬ 
ences  of  TMs  [1-6].  Surfaces,  organometallic  com- 
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plexes,  and  enzymes  represent  some  catalytic  sys¬ 
tems  responsible  for  many  important  chemical 
reactions  that  occur  in  biological  and  industrial 
processes.  They  usually  contain  one  or  two  t3qpes 
of  TMs  surroimded  by  some  ligands  [1]  (e.g.,  nitro- 
genase  contains  Fe,  Mo,  S,  and  H  atoms).  So  far,  it 
is  not  totally  clear  which  metalhc  site  is  preferred 
for  the  activation;  also  the  role  played  by  the 
ligands  attached  to,  or  surroimding  the  metallic 
centers  is  not  known.  Nonetheless  it  is  expected 
that  both  the  metal-metal  and  metal-ligand  inter¬ 
actions  within  the  enzyme  or  complex  will  pro- 
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mote  a  electronic  state  of  optimal  catalytic  capabil¬ 
ity.  The  study  of  the  catalytic  properties  has  been 
focused  on  a  fragment  of  the  whole  system.  This 
small  piece  commonly  contains  a  TM  atom  joined 
to  some  ligands,  in  the  case  of  enzymes,  or  joined 
to  a  small  number  of  TM  atoms,  in  the  case  of 
metallic  surfaces;  this  TM  atom  will  be  labeled  as 
TM’'.  This  kind  of  approach  reduces  the  problem 
to  the  study  of  a  TM*-L  system,  which  is  more 
amenable  for  the  search  of  lowest  energy  struc¬ 
tures,  transition  states,  activation  energies,  etc., 
that  occurs  during  the  catalytic  processes.  A  limi¬ 
tation  of  this  approach  is  in  the  choice  of  a  struc¬ 
ture  for  the  active  site,  which  is  more  difficult  to 
select  for  an  enzyme. 

On  the  other  hand,  a  theoretical  study  of  the 
whole  enzyme-molecule  system  is  beyond  the 
state  of  the  art  both  of  the  computational  method¬ 
ologies  and  of  the  available  computational  facili¬ 
ties.  One  reason  for  such  limitation  originates  from 
the  complicated  nature  of  the  exchange-correlation 
(XC)  that  arises  when  TM  atoms  are  involved.  For 
instance,  it  is  well  known  that  the  Hartree-Fock 
(FIF)  level  of  theory  is  a  very  bad  starting  point  for 
TMs,  and  that  their  treatment  at  least  requires  a 
second  order  Moller-Plesset  perturbation  theory 
(MP2)  approach  (at  this  level,  the  nonlocal  XC 
operator  scales  as  N^),  or  even  higher,  but  with  a 
consequent  higher  order  scaling,  for  a  reasonable 
accounting  of  XC.  On  the  other  hand,  density  func¬ 
tional  theory  (DFT)  accounts  for  XC  of  many  elec¬ 
tron  systems  through  local,  and  presumably,  more 
accurate  gradient-corrected  functionals  and  at  the 
same  time  scale  as  N'"  (or  even  lower  when  cutoffs 
are  employed).  Then,  DFT  base  methods  may  be 
used  for  the  study  of  molecules  interacting  wit  TM 
systems  [3,  4]. 

In  an  attempt  to  understand  the  activation  of 
molecular  nitrogen  by  systems  containing  Fe 
atoms — organometallic  complexes  [1]  and  en¬ 
zymes  (i.e.,  nitrogenase) — we  have  recently  started 
a  theoretical  study  of  the  interaction  of  N2  with  Fe, 
Fez,  FS2S2/  FeMoSz,  and  up  in  order  to  reach  a 
cluster's  size  where  the  activation  mechanism  has 
been  switched  on.  The  goal  is  to  obtain  a  character¬ 
ization  of  the  site  that  is  responsible  of  the  Nz 
fixation  and  to  get  insight  about  the  role  that 
the  metal-metal  and  metal-ligand  interactions, 
wherein  the  cluster,  play  in  that  kind  of  .process. 

In  a  previous  work  [6],  we  reported  the  results 
of  Fe-Nz.  Two  lowest  energy  states  were  found 
(linear  and  triangular  triplets),  at  which  the  bond¬ 
ing  of  Nz  is  strongly  perturbed  by  the  Fe  atom  [6]. 


Now,  we  are  going  to  discuss  the  lowest  energy 
structures  that  result  from  the  interaction  of  Fez 
with  Nz. 


Computational  Procedure 

The  study  was  done  through  first-principles 
all-electron  calculations  with  the  program  deMon- 
KS  [7]:  a  Linear  Combination  of  Gaussian-type  Or¬ 
bitals  (LCGTO)  DFT-based  method.  Here,  the  local 
spin  density  approximation  (LSD A)  was  included 
as  in  Vosko  et  al.  [8]  while  the  gradient-dependent 
approximations  (GDA)  were  those  of  Perdew  and 
Wang  for  exchange  [9],  and  Perdew  for  correlation 
[10].  The  GDA  potential  was  included  in  the 
Kohn-Sham  potential  during  the  self-consistent- 
field  (SCF)  procedure  [7].  Orbital  basis  sets  of 
DZVP2  quality  [(63321/5211*/41 -F  )  for  Fe  and 
(721/51/1*)  for  N],  optimized  explicitly  for  DFT 
calculations  [11],  were  used.  They  contain  p- 
polarization  and  d-diffuse  functions.  Additional 
auxiliary  bases  of  Gaussian-type  orbitals  were  em¬ 
ployed  to  describe  the  change  density  (CD)  and 
the  XC  potential.  During  the  iterative  step,  the  CD 
is  fitted  analytically,  while  the  XC  potential  is 
fitted  numerically  on  a  fine  grid.  At  the  end  of  the 
SCF  procedure,  the  XC  contributions  to  the  energy 
and  energy  gradients  are  calculated  by  numerical 
integration  on  an  augmented  set  of  grid  points 
[12].  By  means  of  the  Broyden-Fletcher-Gold- 
farb-Shanno  algorithm  [13],  the  geometries  were 
optimized  by  minimizing  the  norm  of  the  gradient 
(with  a  10“^  a.u.  threshold).  A  tolerance  conver¬ 
gence  criteria  of  10^*^  a.u.  was  used  for  the  total 
energy  and  of  10“^  a.u.  for  the  electronic  density. 
These  criteria  are  needed  for  a  correct  determina¬ 
tion  of  the  lowest  energy  structure  for  a  given 
TM-L  geometry,  since  it  was  observed  that,  in 
some  cases,  there  are  several  coordination  modes 
within  a  small  energy  range  [3,  4].  Finally,  a  vibra¬ 
tional  analysis,  under  the  harmonic  approxima¬ 
tion,  was  done  for  the  optimized  geometries.  This 
gives  information  about  the  stability  or  nature,  a 
minimum  or  a  transition  state,  of  the  optimized 
geometry. 


Results  and  Discussions 

We  have  started  the  study  of  Fcz-Nz  by  choos¬ 
ing  some  input  geometries  (see  Fig.  1).  The  geome¬ 
try  optimization  of  these  starting  structures  was 
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Fe2  Fel  N3  N4 


coordination  modes. 


done  for  several  electronic  states  of  different  total 
spin  S  or  multiplicity  M  =  2S  +  1.  In  all  cases,  the 
initial  N-N,  Fe-N,  and  Fe-Fe  internuclear  dis¬ 
tances  were  1.095,  2.0,  and  2.0  A,  respectively.  This 
initial  guess  yields  a  pletora  of  possibilities  for  the 
Fe2-N2  coordination  modes.  Figure  2  shows  some 
optimized  geometries.  Visualization  techniques,  as 
implemented  in  UniChem  [14],  were  used  to  fol¬ 
low  the  evolution  of  the  Fe2-N2  structures  during 
the  geometry  optimization  procedure.  Initial  and 
final  structures,  as  obtained  with  UniChem,  are 
shown  in  Figures  1  and  2.  Also  the  density  contour 
plots  were  obtained  with  the  use  of  UniChem, 
coupled,  in  this  case,  with  the  program  Dgauss 
[15],  which  was  used  for  the  generation  of  the 
electronic  density  of  the  optimized  structure.  The 
density  contour  plots  for  the  GSs  of  Fe2  and  N2  are 


reported  in  Figure  3,  and  those  for  some  located 
Fe2-N2  structures  are  reported  in  Figure  4. 

In  general,  high-spin  states  are  energetically 
more  favored  for  Fe2~N2  (which  shows  prefer¬ 
ences  for  M  =  7  and  M  =  5  states,  with  the  former 
lying  deeper  in  energy)  as  compared  with  Fe2-N2, 
which  has  preferences  for  M  =  3  and  M  =  5  states 
[6].  Some  of  these  lowest  energy  structures  are 
contained  within  an  small  energy  range  (of  about  5 
kcal/mol),  and  they  have  different  dimensionality 
and/or  multiplicity;  see  Figure  2.  The  values  in 
parenthesis  show  the  relative  stability  at  GDA 
level,  which,  in  principle,  is  a  better  approach  than 
LSDA.  These  two  levels  of  theory  yield  a  different 
energy  position  for  some  states,  which  reveals  both 
the  complicated  nature  of  XC  in  systems  involving 
TM  atoms  and  the  complicated  nature  of  the  po¬ 
tential  energy  surface  of  Fe2-N2.  For  example,  the 
two-dimensional  septuple!  is  a  transition  state, 
since  its  vibrational  analysis  yields  an  imaginary 
frequency,  but  while  LSDA  locates  it  at  -t-7.6 
kcal /mol,  with  respect  to  the  GS,  GDA  locates  this 
state  at  only  +2.7  kcal/mol  above  the  GS.  More¬ 
over,  there  is  agreement  between  GDA  and  LSDA 
in  that  the  GS  is  the  linear  septuple!;  but  these  two 
approaches  give  a  different  ordering  for  the  re¬ 
maining  sates.  Now,  we  are  going  to  discuss  the 
main  features  of  the  Fe2-N2  structures.  Table  I 
shows  the  calculated  equilibrium  bond  lengths, 
and  Table  II  shows  the  vibrational  frequencies, 
0)^,  as  well  as  the  and  Aco  changes  experi¬ 
enced  by  N2.  Table  III  presents  a  Mulliken  popula¬ 
tion  analysis  for  the  valence  electrons  of  the  GS; 
while  Table  IV  contains  the  gross  populations  that 
the  N  and  Fe  atoms  have  in  the  Fe2~N2  structures. 
Finally,  the  binding  energies,  computed  at 
LSDA  and  GDA  levels  of  theory,  are  reported  in 
Table  V. 

Fe2-I\2,  THE  E/VD-O/V  GROUND  STATE 

A  linear  structure,  with  M  =  7,  was  found  as 
the  GS  for  Fe2-N2.  The  linear  quintet  was  found 
=  16  kcal/mol  higher  in  energy.  In  the  Fe2"-N2 
end  on  mode,  the  and  of  N2  are,  with 
respect  to  free  N2,  enlarged  and  reduced  by  0.019 
A  (0.017  A)  and  by  195  cm“^  (179  cm"^),  respec¬ 
tively,  at  LSDA  (GDA)  level  of  theory.  These  re¬ 
sults  imply  a  reduction  of  the  multiple  bond  of  N2. 
Indeed,  a  Mayer  bond  order  (BO)  analysis  [16] 
yields  a  BO  of  2.6  for  N2  in  Fe2-N2.  (free  N2  has 
BO  =  3,0).  That  is,  the  bonding  of  the  N2  moiety 
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Angular  (M=7)  +7.6  (+2.7) 
(side-on  Transition  State) 


N4 


Tetrahedral  (M=7)  +4.0  (+14.3) 


Fe2 


quasi  side-on  mode 
(M=7)  +5.3  (+2.6) 


quasi  side-on  mode 
(M=:5)  +9.1  (+12.7) 


Linear  (M=5)  +15.7  (+17.2) 


FIGURE  2.  Optimized  geometries  for  the  lowest  energy  structures  of  Fe2“N2.  LSDA  and  GDA  (values  in  parenthesis) 
relative  energies  are  indicated  as  well  as  the  LSDA  equilibrium  bond  lengths. 


has  been  reduced  by  less  than  a  half  bond.  In 
linear  TM-L  systems  a  synergistic  ligand  to  metal 
(a  cr-donation)  and  metal-to-ligand  (a  7r-back  do¬ 
nation)  charge  transfer  mechanism  is  operative. 
This  last  is  a  measure  of  the  TM-L  bond  strength. 
In  the  GS  of  Fe2-N2  there  is  a  cr-donation  of  0.325 
electrons  and  a  7r-back  donation  of  0.314  electrons; 
see  Table  II.  (These  values  are  smaller  than  those 
of  the  triplet  linear  GS  of  Fe2~N2,  where  there  are 
(T-  and  TT-back  donations  of  0.37  and  0.73  electrons, 
respectively  [6].  The  smaller  7r-back  donation  in 
Fe2-N2  is  due  to  the  fact  that  the  Fe  atom,  which 
is  bonding  directly  to  N2,  is  also  bonding,  through 
some  TT-electrons,  with  the  other  Fe  atom.)  This  tt 
metal-to-ligand  charge  transfer,  although  a  modest 


value,  accounts  for  the  high  stability  exhibited  by 
the  linear  septuple!.  The  total  density  contour  plot 
for  the  GS  [see  Fig.  4(a)]  reveals  some  accumula¬ 
tion  of  charge  density  in  the  metal  (Fei)-ligand 
(N3)  region,  which  accounts  for  a  high  bond 
strength.  In  fact,  in  this  one-dimensional  geometry 
is  where  the  Fe-N  distance  reaches  its  shortest 
value  (see  Fig.  2)  and  its  highest  binding  energy. 
Fe2-N2  have  a  metal-ligand  of  30.3  and  14.6 
kcal/mol  at  the  LSDA  and  GDA  schemes,  respec¬ 
tively.  (These  DJs  were  computed  with  respect  to 
the  GSs  of  Fe2  and  N2  free  molecules.  Note  that 
the  quintet  linear  state  of  Fe2-N2  in  unbound,  at 
GDA  level  of  theory.)  On  the  other  hand,  there  is  a 
very  small  total  charge  transfer  from  Fe2  to  N2, 
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which  is  of  about  0.05  units;  see  Table  III.  This  last 
fact  accounts  for  the  relatively  small  perturbation 
of  the  N-N  bonding  by  Fe2  (the  transferred  elec¬ 
trons  goes  into  antibonding  7r-t5^e  molecular  or¬ 
bitals  of  N2).  Summarizing,  the  main  findings  for 
the  computed  GS  are:  the  metal-ligand  for  the 
linear  septuplet  is  the  biggest  one,  (see  Table  V), 
but  the  perturbation  experienced  by  N2  is  rela¬ 
tively  small. 

Fe2-IN2>  A  SIDE-ON  TRANSITION  STATE 

The  two-dimensional  (2D)  septuplet  of  Fe2-N2 
was  located  as  a  transition  state  (TS),  since  its 
vibrational  analysis  yields  an  imaginary  frequency. 
The  N-N  bonding  experiences  a  bigger  perturba¬ 
tion  in  the  TS  than  in  the  GS.  For  example,  in  the 
TS  the  N-N  distance  (1.16  A)  and  the  (from  Fe2  to 
N2)  total  charge  transfer  (0.1  units)  are  bigger  than 
the  corresponding  values  for  the  GS.  The  2D  septu¬ 
plet  may  be  the  starting  point  that  gives  rise  to 
Fe2-N2  coordination  modes  in  which  the  N2  moi¬ 
ety  is  more  strongly  perturbed  by  Fe2.  Below,  we 
will  show  these  kind  of  states. 


Fe2-N2,  A  QVASI  SIDE-ON,  M=7, 

COORDINATION  MODE 

As  shown  above,  the  side-on  mode  is  unstable. 
We  have  found  that  this  type  of  coordination  (in 
which  the  N  centers  are  mostly  bonding  to  a  single 
Fe  atom)  is  stabilized  by  bending  the  N2  molecule 
out  of  the  plane  by  about  54°.  This  yields  the 
structure  labeled  as  "quasi  side-on"  in  Figure  2. 
This  structure  is  stable  in  both  septuplet  and  quin¬ 
tet  (which  is  bent  by  63°  out  of  the  plane)  states 
(with  the  septuplet  lower  in  energy),  since  their 
vibrational  analysis  does  not  produce  imaginary 
values.  In  the  septuplet  quasi  side-on  mode,  the 
and  0)^  of  N2  are  1.18  A  and  1902  cm  ^  respec¬ 
tively.  These  values  produce  A  R  and  A  co  shifts  of 
+  0.058  A  and  -441  cm“^  respectively.  These 
changes  are  bigger  than  those  of  the  linear  GS,  and 
they  reflect  a  moderate  but  important  reduction  of 
the  N-N  bond  strength.  In  fact,  in  the  M  =  7  quasi 
side-on  structure  there  is  a  total  charge  transfer, 
from  Fe2  to  N2,  of  0.14  electrons.  As  expected, 
most  of  this  charge,  0.13  units,  is  transferred  from 
the  Fe  atom  that  is  directly  bonded  to  the  N2 
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FIGURE  4.  Total  density  contour  plots  for  (a)  linear  GS,  (b)  quasi-side  on,  and  (c)  tetrahedral  coordination  modes  of 
Fe2-N2. 


moiety.  However,  we  have  observed  that  there  is 
some  interaction,  presumably  of  bonding  type, 
among  the  iron  atom  (labeled  as  Fe2)  that  is  far 
away  from  N2,  and  the  nitrogen  centers.  This  is 
reflected  by  the  total  electronic  density  contour 
plot,  displayed  in  Figure  4(b).  This  kind  of  interac¬ 
tion  could  originate  the  appearance  of  directional 
forces  that  would  move  the  N  centers  into  a  posi¬ 
tion  at  which  they  are  able  to  interact  more  equally, 
and  more  effectively,  with  the  two  Fe  centers. 
Below,  we  will  address  this  type  of  mode. 

THE  TETRAHEDRAL  i\I=7  AND  5, 
COORDINATION  MODES 

A  tetrahedral-like  approach  of  N2  to  Fe2  [see 
Fig.  1(c)]  yields  a  distorted  tetrahedron,  as  an 


stable  structure  for  Fe2-N2  (see  Fig.  2).  High  spin 
states,  M  =  7  and  5,  occur  in  this  geometry.  They 
are  quasi-degenerate  at  LSDA  level  of  theory.  But 
the  introduction  of  GDA  corrections  puts  the  quin¬ 
tet  at  8.9  kcal/mol  above  the  septuplet.  In  these 
states,  the  two  N  atoms  are  equally  bonded  to  both 
of  the  two  Fe  atoms.  The  Fe-N  R/s  are  equal  to 
1.92  A^(1.87  A),  and  the  N“N  distance  is  of  1.26  A 
(1.28  A)  for  the  M  =  7  (M  =  5)  state.  This  gives, 
with  respect  to  free  N2,  AR  =  0.14  A  for  M  =  7, 
and  AR  =  0.17  A  for  M  =  5.  In  these  tetrahedral 
modes  occurs  the  biggest  enlargements  of  the  N-N 
bond  lengths,  which  means  a  substantial  weaken¬ 
ing  of  the  N-N  bond.  This  is  also  implied  by  the 
huge  reductions  of  the  N2  frequencies  (in  Fe2-N2). 
These  A  o)  changes  are,  with  respect  to  the  free  N2 
value  (2343  cm“^),  —847  and  —951  cm~^,  for  the 
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TABLE  I _ _ _ 

Equilibrium  bond  lengths  for  the  calculated  structures  of  Fe2-N2.  AR  changes  of  Nj  are  also  indicated. 
GDA  values  are  shown  in  parenthesis. 


Structures 

M 

N-N 

Re  (A) 

Fe-N 

Fe-Fe 

^R  (N-N) 

Linear 

7 

1.137  (1.146) 

1.837  (1.929) 

2.018  (2.016) 

0.019(0.017) 

5 

1.139  (1.147) 

1.846  (1.951) 

1 .853  (1 .856) 

0.021  (0.018) 

Tetrahedral 

7 

1.256  (1.221) 

1.924  (1.87) 

2.068  (2.017) 

0.138  (0.092) 

5 

1.283  (1.301) 

1.867  (1.90) 

2.041  (2.030) 

0.165  (0.172) 

Angular 

7 

1.164  (1.165) 

1.966(2.109) 

2.036  (2.023) 

0.046  (0.036) 

Quasi  side-on 

7 

1.176  (1.167) 

1.923  (2.104) 

2.024  (2.024) 

0.058  (0.038) 

5 

1.187  (1.194) 

1.868  (1.930) 

1.916  (1.928) 

0.069  (0.065) 

Planar 

7 

1.227  (1.239) 

1.810  (1.852) 

2.108  (2.102) 

0.109  (0.110) 

5 

1.251  (1.267) 

1 .727  (1 .756) 

2.108  (2.059) 

0.133  (138) 

Calculated  at  LSDA^ 

1.118  (1.129) 

0.020  (0.031) 

Experimental 

1.098 

®For  free  N2. 

For  free  N2;  from  Refs.  5  and  17. 

M  =  7  and  5  states,  respectively.  In  fact,  the  N-N 

that,  in  this  septuplet  coordination  mode,  the  N2 

bond  strength  (in  Fe2“ 

N2)  is  1.5, 

which  is  a  much 

moiety  is  strongly  perturbed  or  truly  activated  by 

weaker  bond  that  the  triple  bond  of  free  N2.  Also, 

the  Fe  atoms.  This  activation  is  slightly  bigger  in 

there  are  large  total  charge  transfers  from  Fe2  to 

the  quintet  state. 

which  is  higher 

in  energy  than 

N2:  They  are  equal  to  0,44  and  0.52  electrons  for 

the  septuplet  one.  It  is  to  be  remarked  that  the 

the  M  =  7  and  5  states,  respectively.  These  trans- 

biggest  structural  changes  (experienced  by  N2)  are 

ferred  electrons  go  into  high-energy  antibonding 

coupled  with  huge  total  charge  transfers  (from  Fe2 

molecular  orbitals  of  N 

2-  The  density  contour  plots. 

to  N2).  In  fact,  this  occurs  in  the  quintet  tetrahe- 

for  the  M  =  7  tetrahedral  state  [see  Fig.  4(c)],  re- 

dral  state,  and  to 

a  lesser  extent  in  the  septuplet 

veal  that  the  electronic  charge  is 

highly  polarized 

one.  However,  at  GDA  level  of  theory,  the  for 

toward  the  N  atoms.  In  fact,  this  picture  implies 

the  septuplet  is  very  small,  about  0.3  kcal/mol. 

TABLE  II 

Vibrational  frequencies, 

Wg,  for  the  calculated  Fe2-N2 

structures. 

a>e  (cm"^) 

Structures 

M 

N-N 

Fe-N 

Fe-Fe 

Ad) 

Linear 

7 

2148  (2071) 

450  (387) 

332  (332) 

-195  (-179) 

5 

2136 

368 

491 

-207 

Tetrahedral 

7 

1496 

515,  431 

438 

-847 

5 

1392 

601 , 626 

373 

-951 

Angular 

7 

(1934) 

(208,  322) 

(415) 

(-316) 

Quasi  side-on 

7 

1902 

369,  434 

379 

-441 

5 

1837 

492,  502 

423 

-506 

Planar 

7 

1589 

499,  579 

328 

-754 

5 

1455 

336,  635 

373 

-888 

Calculated  at  LSDA^ 

2343  (2250) 

-16  (-109) 

Experimental^ 

2359 

®  For  free  Nj. 

For  free  Ngi  from  Refs.  5  and  17. 
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TABLE  III _ 

Eigenvalues  and  occupation  numbers,  per  atom  and  per  angular  momentum  component,  for  the  valence 
electrons  of  the  ground  state  of  Fe2-N2. 


Fe1  Fe2  N3  N4 


Orb. 

n, 

S 

P 

d 

s 

P 

d 

S 

P 

s 

P 

a 

1(T 

1 

-28.009 

0.01 

-0.02 

0.37 

0.06 

0.46 

0.13 

2a- 

1 

-15.212 

— 

— 

— 

0.03 

0.02 

0.03 

0.42 

0.25 

0.24 

0.02 

Itt 

2 

-11.763 

— 

— 

___ 

— 

0.01 

— 

0.54 

— 

0.43 

3cr 

1 

- 1 1 .232 

— 

— 

0.01 

0.04 

0.03 

0.03 

0.02 

0.21 

0.22 

0.44 

277 

2 

-7.213 

— 

— 

0.57 

— 

— 

0.41 

— 

— 

— 

0.02 

4a 

1 

-7.175 

— 

— 

0.64 

__ 

0.04 

0.30 

0.01 

0.01 

— 

0.01 

^8 

2 

-6.742 

— 

— 

0.77 

— 

— 

0.23 

— 

— 

— 

— 

23 

2 

-6.146 

— 

0.23 

— 

— 

0.77 

— 

— 

— 

— 

5a 

1 

-5.802 

0.64 

— 

-0.02 

0.32 

0.03 

0.04 

— 

— 

— 

— 

377 

2 

-5.392 

— 

0.01 

0.40 

— 

0.02 

0.47 

— 

0.03 

— 

0.07 

6cr 

1 

-5.049 

0.14 

— 

0.31 

0.07 

0.02 

0.47 

— 

-0.01 

— 

— 

477 

ft 

0 

-2.995 

— 

0.06 

0.02 

— 

0.10 

0.08 

— 

0.35 

— 

0.39 

P 

lo- 

1 

-27.966 

— 

— 

. — 

0.01 

-0.02 

— 

0.38 

0.06 

0.46 

0.13 

2o- 

1 

-15.157 

— 

— 

0.03 

0.02 

0.02 

0.42 

0.27 

0.24 

0.02 

I77 

2 

- 1 1 .667 

— 

— 

— 

— 

0.01 

— 

0.55 

— 

0.42 

30- 

1 

-11.120 

— 

— 

0.01 

0.04 

0.03 

0.03 

0.02 

0.21 

0.22 

0.44 

4a- 

1 

-5.573 

0.33 

— 

0.26 

0.10 

0.13 

0.14 

0.01 

0.02 

— 

0.01 

277 

2 

-5.370 

— 

0.01 

0.39 

— 

0.01 

0.54 

0.01 

— 

0.04 

So¬ 

1 

-5.017 

0.25 

0.01 

0.23 

0.22 

-0.01 

0.31 

— 

-0.01 

— 

— 

ls 

1 

-4.233 

— 

— 

0.47 

— 

— 

0.53 

— 

— 

— 

23 

0 

-3.703 

— 

-- 

0.38 

— 

0.06 

0.12 

— 

0.18 

— 

0.18 

cr-donation  0,325 

TT-back-donation  0.314 


while  there  is  no  positive  for  the  quintet  state 
(it  is  about  —8.6  kcal/mol).  The  enlargement  of 
the  cluster  or  improvement  of  the  model  (i.e.,  the 
use  of  Fe2S2  or  FeMoS2)  in  order  to  obtain  a  better 
description  of  the  active  site  may  enhance  the 
stabilization  of  these  types  of  coordination  modes. 
We  are  searching  currently  these  possibilities;  re¬ 
sults  will  be  reported  elsewhere. 


Fe2-N2,  the  planar,  M=  5  AND  7, 

COORDINATION  MODES 

A  parallel  approach  of  N2  to  Fe2  produces  two 
planar  coordination  modes  of  Fe2-N2  with  M  =  5 
and  7.  These  two  states  differ  by  only  =  2 
kcal/mol;  see  Figure  2.  The  two  N  atoms  in  these 
planar  modes  are  equally  bonded  to  the  two  Fe 


TABLE  IV _ 

Mulliken  populations  for  the  lowest  energy  states  of  Fe2“N2.  The  dipole  moment,  p,  is  also  indicated. 


Linear  Tetrahedral  Quasi  side-on 


Atom 

M-7 

M  =  7 

M  =  5 

M  =  7 

M  =  5 

Fel 

Fe2 

N3 

N4 

P 

26.00  (26.03) 

25.95  (25.93) 

7.03  (  7.02) 

7.02  (  7.01) 

1 .058  (0.605) 

25.77  (25.79) 

25.78  (25.87) 

7.22  (  7.16) 

7.22  (  7.19) 

1.105  (1.104) 

25.74  (25.72) 
25.74  (25.73) 

7.26  (  7.28) 

7.26  (  7.28) 
3.379 

25.87  (25.90) 
25.99  (26.01) 

7.07  (  7.05) 

7.07  (  7.05) 

1.561  (0.992) 

25.89  (25.86) 
25.95  (25.97) 
7.08  (  7.09) 
7.08  (  7.09) 
2.005  (1.823) 
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TABLE  V _ 

Binding  energies,  for  the  computed  states 
of  Fe2-N2. 


Fe2-N2 

M 

De 

LSDA 

(Real  /  mol) 

GDA 

Linear 

7 

30.258 

14.627 

5 

14.577 

-2.541 

Tetrahedra! 

7 

26.274 

0.311 

5 

25.673 

8.566 

Angular 

7 

22.659 

11.918 

Quasi  side-on 

7 

24.962 

12.088 

5 

21.146 

1.972 

Planar 

7 

18.104 

-4.486 

5 

27.661 

-2.234 

centers.  The  N-N  distance  is  of  1.25  A  (1.23  A)  for 
the  M  5  {M  =  7)  state.  This  gives  (with  respect 
to  free  N2)  enlargements  of  0.13  A  for  M  =  5  and 
of  0.11  A  for  M  =  7 ,  for  the  N-N  R^.  There  are 
also  big  reductions  (888  cm“^  for  M  =  5  and  754 
for  M  =  7)  for  the  co  value  of  N2.  Consis¬ 
tently,  there  are  large  total  charge  transfers,  from 
Fe2  to  N2;  they  are  0.40  and  0.38  electrons  for  the 
M  =  5  and  7  states,  respectively.  These  changes  of 
N2  are  relatively  big  (particularly  those  for  M  ==  5), 
but  smaller  than  those  observed  in  the  tetrahedral 
modes,  and  implies  a  substantial  reduction  of  the 
N-N  bond. 


Conclusions 

A  systematic  study  of  the  Fe2-N2  coordination 
modes  was  done  by  means  of  the  program  deMon- 
KS,  a  DFT-based  method.  The  calculated  ground 
state  corresponds  to  a  linear  geometry,  with  M  = 
7,  in  which  the  metal-ligand  binding  energy 
reaches  its  highest  value,  and  where  N2  shows  a 
relatively  small  perturbation  by  the  Fe  atoms.  A 
highest  activation  of  N2  occurs  in  the  tetrahedral 
Fe2“N2  geometry,  which  is  stable  in  both  quintet 
and  septuplet  high  spin  states.  Indeed,  the  bond 
length  and  vibrational  frequency  of  N2  are  consid¬ 
erably  enlarged  and  reduced,  respectively,  in  these 
tetrahedra  modes.  These  changes  indicates  that  N2 
is  truly  activated  in  this  type  of  geometries,  where 
the  two  N  atoms  interact  equally  with  the  two  Fe 
centers.  These  results  suggest  a  strong  correlation 
between  the  electronic  charge  transfer  (from  the 
metal  to  the  moiety)  and  the  activation  experi¬ 
enced  by  N2  (measured  by  the  extent  of  its  struc¬ 


tural  changes).  Big  structural  changes  of  N2  were 
also  observed  in  the  planar  coordination  modes, 
although  to  a  lesser  extent  than  those  in  the  tetra- 
hedrals. 

Going  to  a  bigger  cluster,  in  order  to  achieve  an 
improvement  in  the  description  of  the  active  site, 
our  preliminary  results  of  Fe2S2-N2  indicates  that 
the  sulfur  atoms  further  enhance  the  activation 
experienced  by  N2.  Studies  to  determine  the  role 
played  by  both  sulfur  and  molybdenum  atoms  in 
the  catalytic  process  are  in  progress  in  our  labora¬ 
tory;  results  will  be  published  elsewhere  [18]. 
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ABSTRACT _ 

Density  functional  theory  (DFT)  calculations  of  the  electronic  structure  and  properties  of 
Fe,  Ni,  and  Cu  carbonyls  with  one  and  five  transition-element  atoms  are  reported,  due  to 
their  importance  in  heterogeneous  catalysis  (especially  in  Fischer-Tropsch  synthesis).  The 
local  density  approximation  (LDA)  with  the  Vosko-Wilk-Nussain  (VWN)  correlation 
functional  as  well  as  the  generalized  gradient  approximation  (GGA)  using  Becke's  1988 
exchange  functional  and  Perdew's  1986  correlation  fimctional  (BP  method)  were  applied 
to  the  study  of  the  clusters  using  a  triple-zeta  plus  polarization  (TZP)  basis  set.  Triplet 
and  quintuplet  states  for  FeCO,  singlet  and  triplet  for  NiCO,  and  doublet  for  CuCO  were 
considered,  and  optimization  of  energy  at  the  LDA/BP  level  for  these  cases  was  done. 
Dissociation  energies  in  M  and  CO  fragments,  frequencies,  and  population  analysis  were 
performed.  Geometry  optimization  of  the  distance  between  the  C  atom  and  the  clusters 
M5  (with  geometries  taken  from  the  crystals)  were  carried  out  and  the  dissociation 
energy  in  fragments  M5  and  CO  as  well  as  population  analysis  at  the  optimized 
geometries  were  also  done.  Comparison  between  our  results  and  some  others  published 
using  ab  initio  and  density  functional  methods  were  performed.  ©  1996  John  Wiley  & 
Sons,  Inc. 
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Introduction 

Carbonyls  of  transition  elements  are  a  very 
important  class  of  compounds  for  several 
reasons.  One  of  these  is  that  they  are  considered  to 
be  the  first  step  in  the  reaction  of  CO  with  H2 
catalyzed  by  transition  elements  (Fischer-Tropsch 
synthesis)  which,  from  the  industrial  point  of  view, 
is  of  paramount  importance  due  to  the  large  num¬ 
ber  of  chemicals  that  can  be  obtained  from  it.  The 
initial  step  of  the  chain  of  reactions  is  the 
chemisorption  of  CO  on  materials  which  contain 
such  elements.  The  simplest  model  for  these  mate¬ 
rials  is  one  atom.  That  is  where  the  importance  of 
the  MCO  compounds  comes  from.  There  are  many 
experimental  and  theoretical  studies  of  these  car¬ 
bonyls,  and  our  theoretical  study  uses  one  of  the 
important  methodologies:  the  density  functional 
theory  (DFT)  in  one  of  the  successful  versions 
developed  at  present. 

As  a  further  step  it  is  natural  to  consider,  as 
suitable  models  for  real  materials,  clusters  or  crys¬ 
tals  of  the  elements  involved  and  to  use  the  appro¬ 
priate  computational  techniques. 

In  order  to  compare  the  results  of  the  MCO  and 
M„CO  calculations  (n  =  2, 3, . . . ),  we  must  use  the 
same  computational  level  in  both  cases,  and,  there¬ 
fore  the  cluster  size  must  not  be  too  large.  These 
calculations  must  provide  accurate  predictions  at 
least  for  the  equilibrium  molecular  geometries, 
dissociation  energies,  and  vibrational  frequencies. 
For  this  reason  we  have  studied,  besides  the  MCO, 
the  M5CO  compounds  with  symmetry  C4V  that 
involve  transition-atom  clusters  with  geometries 
taken  from  the  crystal.  Models  with  fewer  or  larger 
number  of  atoms  are  scarce  in  the  current  litera¬ 
ture  [1-3], 

The  transition  elements  considered  in  this  re¬ 
port  Cu,  Ni,  and  Fe,  are  important  in  the 
Fischer-Tropsch  synthesis  and  present  quite  dif¬ 
ferent  difficulties.  The  Cu  case  is  the  simplest 
because  of  its  electronic  structure  as  compared  to 
Ni  and  Fe. 

Important  points  in  our  study  are  the  prediction 
of  the  place  where .  the  CO  will  be  preferently 
adsorbed,  the  geometry  of  the  adsorbate-substrate 
and  the  energy  involved  in  such  adsorption. 

We  will  focus  our  interest  in  the  geometrical 
and  energetical  aspects  of  the  systems  under  con¬ 
sideration  rather  than  in  the  bonding  of  the  car¬ 


bonyls  that  are  extensively  considered  in  the  litera¬ 
ture. 


Method 

The  calculations  reported  in  this  work  are  based 
on  the  Kohn-Sham  approach  to  the  DFT  with  the 
methodology  of  Baerends  and  co-workers  [4,  5]. 

The  density  fxmctional,  also  called  the  ex- 
change-and-correlation  (XC)  fimctional,  consists  of 
a  local  part  [local  density  approximation  (LDA)] 
and  a  so-called  gradient  correction  [generalized 
gradient  approximation  (GGA)].  Initially,  the  LDA 
used  by  the  calculation  of  the  exchange  energy 
Ex(LDA)  was  that  proposed  by  Slater  [6]  (Xa  ap¬ 
proximation);  version  that  includes  also  a  correla¬ 
tion  part,  which  is  used  in  this  report,  is  that 
proposed  by  Vosko,  Wilk,  and  Nusair  (VWN)  [7]. 

The  GGA  introduces  terms  depending  on  the 
gradient  of  the  electron  density  that  account  for 
inhomogeneity  in  order  to  describe  the  nonuni¬ 
form  electron  gas  in  the  case  of  atomic  systems. 
GGA  is  the  sum  of  two  terms:  the  exchange  and 
the  correlation  ones.  The  introduction  of  these 
terms,  correcting  LDA,  usually  gives  better  results 
for  comparison  with  experimental  data  for  molecu¬ 
lar  geometries,  bond  energies,  etc. 

We  have  several  possibilities  using  the  ADF 
program  for  the  GGA  calculations.  The  one  used  in 
this  report  is  the  Becke's  calculation  [8]  for  the 
exchange  term  and  the  Perdew's  [9]  for  the  corre¬ 
lation.  The  method  is  called  LDA/BP. 

Slater  orbitals  (STO)  basis  set  for  the  molecular 
orbital  (MO)  expansion  linear  combination  of 
atomic  orbitals  (LCAO)  method  were  used.  For  the 
transition  elements  an  uncontracted  triple  zeta 
(three  3d  and  three  4s)  and  one  4p  were  used  as 
the  valence  basis  set  and  2  s  and  1  p  for  the  frozen 
core  shell  (FC).  In  order  to  establish  the  orthogo¬ 
nality  of  the  valence  MO,  with  the  FC,  it  is  neces¬ 
sary  to  add  s  and  p  orbitals  to  the  valence  set.  We 
add  4s  and  3p  and  we  have  in  this  way  14  STO  for 
the  valence  set. 

For  the  main  elements  (C  and  O)  we  set  up  also 
a  triple  zeta  basis  set  with  3  s  and  3p  and  one 
polarization  d  STO;  we  add  also  Is  to  have  the 
valence  orbitals  orthogonal  to  the  FC  Is.  We  have, 
therefore,  8  STO  for  the  valence  basis  set. 

We  can  consider  the  molecular  symmetry  in 
order  to  have  the  overlap  and  Fock  matrices 
block-diagonal  by  using  symmetry-adapted  linear 
combinations  of  the  above  mentioned  STO, 
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A  set  of  auxiliary  s,  p,  d,  /,  and  g  STO  centered 
on  each  nucleus  were  used  to  fit  the  electronic 
density.  For  each  transition  metal  atom  34  STO 
were  used  and  for  C  and  O  20  STO  were  used.  In 
this  way  integrals  involving  electronic  density 
(such  as  Coulomb  potentials)  were  evaluated  us¬ 
ing  these  auxiliary  functions  and  then  taking  linear 
combination  of  these  integrals.  The  coefficients  of 
these  linear  combinations  were  evaluated  in  each 
SCF  cycle. 

The  matrix  elements  of  the  overlap  and  Fock 
matrices  were  calculated  by  numerical  integration. 
The  success  of  the  DFT  methods  such  as  the  ADF 
is  due,  among  other  reasons,  to  the  existence  of 
accurate  and  very  efficient  numerical  methods  for 
the  calculation  of  integrals  using  STO  functions 
[10]. 

Instead  of  calculating  total  energies,  as  most 
methods  do,  ADF  calculates  bond  energies  accord¬ 
ing  to  Ziegler's  procedure  [11].  These  bond  ener¬ 
gies  are  the  difference  between  the  total  energy  of 
a  molecule  (or  atom)  and  those  of  the  atoms  (or 
fragments)  in  established  reference  states. 

Optimization  of  the  geometry  at  the  LDA/BP 
level  was  done  for  all  the  geometrical  parameters 
in  the  case  of  the  MCO  compounds:  triplet  and 
quintuplet  for  FeCO,  singlet  and  triplet  for  NiCO, 
and  doublet  for  CuCO. 

Mulliken  population  analysis  was  carried  out 
and  the  intermolecular  charge  distribution  and  re¬ 
lated  dipole  moment  as  well  as  the  vibrational 
frequencies  were  calculated  for  each  molecule. 

We  have  several  possibilities  for  the  M5  clusters 
in  the  case  of  M5CO  compounds.  Ni  and  Cu  crys¬ 
tals  have  fee  (face-centered  cubic)  elementary  cells 
and  the  Fe  has  a  bcc  (body-centered  cubic)  one. 

In  the  first  two  cases  we  have  selected  two 
cluster  models,  one  that  we  call  "planar"  and 
symbolized  by  (5,  0)  that  has  the  five  atoms  on  the 
top  face.  The  other  that  is  called  "hollow"  and 
symbolized  by  (4,  1)  has  the  four  center  atoms  of 
the  lateral  cubic  faces  and  one  atom  located  at  the 
center  of  the  bottom  face  (see  Figs.  1  and  2).  We 
have  used  the  nomenclature  given  in  [2]. 

In  the  case  of  Fe5  we  have  selected  the  hollow 
(4, 1)  model,  taking  the  four  Fe  atoms  of  the  square 
top  face  and  the  central  cubic  Fe  atom. 

The  CO  group  was  oriented  normal  to  the  M4 
plane  and  passing  through  its  center.  In  one  calcu¬ 
lation  (CuCO)  we  have  tilted  the  CO  group,  to 
take  into  account  the  nonlinearity  of  the  CuCO 
molecule,  and  the  CuCO  plane  was  taken  parallel 


T° 

#  C 


FIGURE  1. 


to  one  side  of  the  M4  square,  but  the  results 
obtained  do  not  change  significant. 

Optimization  of  the  distance  d  between  the  C 
atom  and  the  M4  plane,  with  the  CO  distance 
unchanged  at  the  value  optimized  in  the  MCO 
calculation,  was  the  first  step  of  the  calculation. 
This  was  done  at  the  selected  electronic  configura¬ 
tion  with  the  lowest  energy. 

All  calculations  were  done  at  the  unrestricted 
level,  giving  initially  the  total  number  of  alpha 
and  beta  spin  electrons.  We  checked  the  electronic 
configuration  after  the  self-consistent  field  (SCF) 
convergence  to  see  if  the  lowest  unoccupied 
molecular  orbital  (LUMO)  is  lower  in  energy  than 
the  highest  occupied  molecular  orbital  (HOMO) 
both  for  alpha  and  beta  spin. 

If  this  was  not  satisfied,  we  changed  the  config¬ 
uration  accordingly  (making  sure  that  the  energy 
decreased)  until  we  got  the  lowest  value. 
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We  have  the  very  cumbersome  problem  of  the 
oscillation  in  energy  when  working  with  transition 
elements,  where  occupied  and  empty  orbital  levels 
are  very  close  in  energy.  This  is  particularly  diffi¬ 
cult  in  the  case  of  the  M5  clusters  for  Ni  and  Fe 
cases.  The  level  shift  technique  of  Saunders  and 
Hillier  in  many  cases  solves  the  problem. 

Two  procedures  were  used  to  obtain  the  sum- 
of-fragments  energy  one  was  to  increase  the 

M-C  distance  until  a  constant  value  of  the  energy 
was  reached  (asymptotic  energy)  and  the  other 
was  to  calculate  directly  the  energies  of  the  frag¬ 
ments  M  (or  M5)  and  CO. 


Results  and  Discussion 

MCO  (lVI  =  Fe,I\i,Cu)  CASE 

Molecular  Geonielries 

Optimized  geometries,  as  well  as  bond  and  dis¬ 
sociation  energies  and  electronic  configurations  are 
summarized  in  Table  I. 

All  calculations  reported  were  done  at  the 
LDA/BP  level.  It  is  well  known  [12]  that  the 
predictions  for  molecular  geometries  are  improved 
when  the  BP  corrections  are  included  in  the  LDA 
methodology,  resulting  in  values  very  close  to  the 
experimental  ones.  In  general,  DFT  methods  are  in 
many  cases  far  superior  to  Hartree-Fock  (HF)  ones, 
in  particular  for  metal-ligand  bond  lengths. 

For  CuCO  a  linear  geometry  was  supposed  [13], 
but  the  bond  energy  between  Cu  and  CO  was 
almost  null.  Optimization  of  the  energy  without 
this  restriction  gives  a  nonlinear  geometry,  as  was 
pointed  out  by  Fournier  [14],  with  a  significantly 
higher  bond  energy.  His  calculation  method  is 


very  similar  to  the  ADF  that  we  are  using  in  this 
report. 

Our  results  (Table  I)  were  similar  to  the  ones 
from  [14]  (1.858/1.160/142.8),  where  the  symbol 
(d/d' /a)  means  d  =  d(MC),  d'  =  d(CO),  and  a  = 
angle(M-C-O). 

Summarizing  the  results  we  have: 

(1.906/1.160/136.7)  Table  I 
(1.858/1.160/142.8)  [14] 

(1.963/1.132/153.1)  [15] 

The  result  from  [15]  is  the  most  exact  of  the  several 
reported  there. 

The  ground  state  was  2 A'  with  the  electronic 
configuration  (11  4/10  4)  in  the  symmetry  group 
Cs  where  the  symbol  means  11  electrons  in  a'  and 
4  in  a"  orbitals  for  alpha  spin  and  10  electrons  in 
a'  and  4  in  a”  orbitals  for  beta  spin. 

For  NiCO  we  have  considered  the  two  lowest 
energy  states:  the  ground  state  whose  geometry  is 
linear  and  is  a  singlet  (12)  with  the  configuration 
(6  6  2/6  6  2)  (6  electrons  in  a  orbitals,  6  in  tt 
orbitals,  and  2  in  6  orbitals  both  for  alpha  and  beta 
spin),  and  the  first  electronic  excited  state  is  a 
triplet  with  a  bent  geometry  with  the  configuration 
(11  4/9  4), 

Many  calculations  for  NiCO  have  been  reported 
in  the  literature.  Among  others,  we  consider  those 
of  references  [14]  and  [16]  (DFT)  ([14]  at  the  LDA 
level  and  [16]  at  the  LDA/BP  level),  [13]  (ab  initio 
CASSCF),  [17]  (ab  initio  MRSDCI-CIPSI)  in  order 
to  compare  with  those  of  Table  I. 

A  general  review  about  ab  initio  calculations  of 
structure  and  molecular  properties  of  transition- 
metal  organometallics  is  given  in  [18]. 


TABLE  I _ 

Molecular  geometries,  bond  and  dissociation  energies,  and  electronic  configurations  of  the 
MCO  compounds. 


Molecule/ state 

Molecular  geometry 

Bond 

energy 

Dissociation 

energy® 

Electronic  configuration 

d(MC){A) 

d(CO)(A) 

<  MCO  (deg) 

E  (a.u.) 

Dg  (kcal  /  mo!) 

CuCO(2A') 

1.9057 

1.1600 

136.7 

-0.575237 

16.05  (16.03) 

(11 4/10  4) 

NiCOdS) 

1.6614 

1.1692 

180.0 

-0.660224 

63.93  (62.84) 

(6  6  2/6  6  2) 

NiCO(3A') 

1 .8575 

1.1633 

143.2 

-0.605832 

31.10  (28.25) 

(11  4/9  4) 

FeCOOS) 

1 .6972 

1.1731 

180.0 

-0.753607 

58.74  (55.34) 

(6  6  2/6  6  0) 

FeCO(5A") 

1.8626 

1.1656 

164.6 

-0.741648 

35.70  (40.34) 

(11  4/8  3) 

^Parentheses  values  calculated  from 
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For  the 


For  the 


linear  NiCCXlS) 

we  have: 

(1.661/1.169) 

Table  I 

(1.663/1.715) 

[16] 

(1.637/1.167) 

[14] 

(1.693/1.164) 

[13] 

bent  NiCO(3A'): 

(1.876/1.163/143.2)  Table  I 
(1.788/1.162/147.5)  [14] 


As  we  can  see  from  these  results,  the  general 
agreement  is  rather  good.  It  must  be  pointed  out 
that  our  calculations  are  all  at  the  LDA/BP  level 
and  that  without  the  BP  corrections  the 
metal-ligand  bond  lengths  are  underestimated  by 
an  average  of  0.05  A,  and  the  BP  corrections  re¬ 
duce  the  error  to  an  average  of  0.01  A  [12].  Bond 
angles,  in  general,  are  not  influenced  significantly 
by  these  corrections. 

We  observe  from  our  results  an  increase  of  0.22 
A  for  the  Ni-C  and  a  reduction  of  0.006  A  for  the 
d(C-O)  distances  as  well  as  a  change  in  the  bond 
angle  Ni-C-O  from  180°  to  143°  when  we  go  from 
the  ground  to  the  first  excited  state. 

In  the  case  of  FeCO  we  have  a  linear  triplet  3S 
ground  state  and  a  bent  quintuplet  5A"  first  ex¬ 
cited  state.  A  comparison  between  reported  ge¬ 
ometries  and  ours  is  given  below, 

FeCO(32): 


(1.697/1.173) 

Table  I 

(1.660/1.175) 

[14] 

(1.809/1.190) 

[17] 

(2.039/1.256) 

[19] 

FeCO(5A"): 


(1 ,863 /1 .166 /164.6)  Table  I 
(1.808/1.160/158.1)  [14] 

(1.900/1.201/180)  [17] 

The  agreement  between  [14]  and  ours  is  rather 
good  (taking  into  account  the  above-mentioned 
calculated  levels).  The  bond  distances  of  [17]  are  a 
little  bit  higher  and,  furthermore,  the  geometry  of 
the  excited  state  was,  apparently,  taken  to  be  lin¬ 
eal. 


Dissociation  Energies 

Bond  energies  for  the  MCO  compounds  are 
given  in  Table  I  for  the  above-mentioned  states  at 
the  equilibrium  geometry. 

The  asyny  totic  energy  was  obtained  at  approxi¬ 
mately  4.0  A  where  we  can  consider  the  molecule 
dissociated  into  the  M  and  CO  fragments.  The 
identification  of  the  state  of  these  fragments  is 
interesting.  This  is  done  by  calculating  the  energies 
of  selected  states  of  these  fragments  that  reproduce 
these  constant  energies  (that  we  call  asymptotic 
energies). 

Starting  with  CuCO  we  have  an  asymptotic 
(bond)  energy  of  -0.54968  a.u.  For  the  fragments 
we  have  E{Cu(slrf9)}  =  -0.00941  a.u.  (it  is  not 
zero  because  this  is  not  the  reference  state  chosen 
for  the  Cu  atom)  and  E{CO(1.1600)}  =  —0,539719 
a.u,,  and  their  sum  is  -0,54966  a.u.,  in  very  good 
agreement  with  the  asymptotic  energy.  The  differ¬ 
ence  between  the  minimum  reported  in  Table  I 
and  the  asymptotic  energy  is  16.05  kcal/mol  (1 
a.u.  =  627.5  kcal/mol).  The  analog  if  using  E^^f  is 
16.03  kcal/mol. 

In  this  way  we  identify  the  state  of  the  products 
as  Cu(sld9)  and  CO(lS)  (the  ground  state  for  CO). 
The  experimental  value  reported  in  [17]  is  16.6 
kcal/mol,  in  very  close  agreement  with  ours.  Re¬ 
ported  values  are  20.3  (19.6)  kcal/mol  in  [16],  19.4 
kcal/mol  in  [17]  (but  the  linear  geometry  was 
considered  here).  A  value  of  22.72  kcal/mol  was 
quoted  in  [12]  where  the  ADF  method  was  used 
(but  in  [1]  in  which  an  earlier  version  of  the  ADF 
method  was  used  a  value  of  12  kcal/mol  was 
given).  However,  only  in  [16]  and  ours,  a  bent 
geometry  was  assumed. 

In  the  case  of  NiCO,  we  consider  the  singlet 
ground  state  IS  with  a  configuration  (6  6  2/6  6 
2)  and  the  corresponding  asymptotic  energy 
is  -  0.55853  a.u.  In  this  case,  the  SCF  convergence 
is  difficult  to  reach,  and  it  was  necessary  to  intro¬ 
duce  a  level  shift  of  0.1  a.u.  So  we  have  perhaps, 
an  error  in  this  value.  The  E^^f  of  the  fragments  Ni 
and  CO  that  fits  better  this  value  is  the  one  corre¬ 
sponding  to  Ni(slrf9)  with  a  bond  energy  (with 
respect  to  the  reference  state)  of  -0.021245  a.u. 
that  added  to  -0.538842  a.u.  corresponding  to 
CO(1.1692  A)  gives  -  0.560087  a.u.,  in  relative  good 
agreement  with  the  asymptotic  energy.  With  the 
equilibrium  value  of  Table  I  and  the  sum  of  frag¬ 
ments  we  have  62.84  kcal /mol. 

This  is  quite  a  high  value  compared  with  those 
reported  in  the  literature.  However,  a  higher  value 
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of  79.0  kcal/mol  by  a  DFT  method  was  reported  in 
[20].  In  [16]  a  value  of  54.5  kcal/mol  is  given,  but 
as  a  result  of  the  most  accurate  calculations  and 
experiments  a  value  of  about  40  kcal/mol  seems 
to  be  quite  acceptable.  For  example,  a  Coupled 
Cluster  Singles,  Doubles  (Triples)  (CCSD(T))  calcu¬ 
lation  with  a  large  basis  gives  a  value  of  38.0 
kcal/mol  [21]. 

The  lowest  excited  triplet  state  is  33.8  kcal/mol 
above  the  ground  state  (36  kcal/mol  according  to 
[16]).  As  we  said,  the  equilibrium  geometry  is  bent 
(symmetry  Cs).  The  electronic  configuration  is  (11 
4/9  4)  and  the  asymptotic  energy  is  -0.556276 
a.u.  The  values  of  are  —0.560799  a.u.  if  we 
adopt  for  CO  (C-O  distance  equal  to  1,1633  A)  the 
value  —0.539554  a.u,  and  for  Ni(sld9)  the  value 
—  0.009941  a.u.  and  —0.572375  a.u.  if  consider  in¬ 
stead  the  Ni(s2rf8)  state,  whose  energy  is 
-0.032821  a.u.  The  asymptotic  energy  is  closest  to 
the  first  value,  so  we  can  identify  the  dissociated 
state  of  the  Ni  atom  as  the  sld9  one. 

With  the  asymptotic  value  the  dissociation  en¬ 
ergy  is  31.10  kcal/mol,  but  if  we  consider  the 
ones,  this  result  moves  to  28.25  kcal/mol  for 
Ni{sld9)  and  21.00  kcal/mol  for  Ni(s2d8).  These 
values  are  compared  with  18  kcal/mol  found  in 
[14].  In  [13]  the  geometry  considered  for  this  ex¬ 
cited  state  is  a  linear  one. 

Let  us  consider  now  the  states  of  FeCO.  The 
ground  state  is  a  linear  triplet  (32)  with  an  asymp¬ 
totic  energy  of  -0.6600  a.u.  and  its  configuration 
is  (6  6  2/6  6  0). 

The  states  of  the  fragments  Fe  and  CO  that  give 
the  best  fit  for  this  value  are  CO(l2  +  ;  1.1731) 
with  a  bond  energy  of  —0.538511  a.u.  and  Fe(sl^f7) 
quintuplet  with  a  bond  energy  of  —0.104308  a.u. 
The  EgQjr  value  is,  therefore,  —0.665422  a.u.  giving 
a  dissociation  energy  of  55.34  kcal/mol.  If  we  use 


instead  the  asymptotic  energy,  it  changes  to  58.74 
kcal/mol. 

Reported  values  for  this  energy  are  49(33) 
kcal/mol  [14],  30.9  kcal/mol  [17]  (experimental 
values  between  25.1  and  36.9  are  reported  here), 
and  28.6  kcal/mol  [13]. 

Our  calculated  value  is  about  18  kcal/mol 
higher  relative  to  the  average  value  of  40  kcal/mol. 
This  is  surprising  because  DFT  methods  are  con¬ 
sidered  to  yield  results  at  a  very  good  level  of 
accuracy  when  the  BP  corrections  are  made  (at  the 
LDA  level,  however,  errors  of  about  100%  are 
common)  [12]. 

Something  similar  happens  in  the  case  of  NiCO 
ground  state,  whose  corresponding  difference  is 
about  23  kcal/mol. 

The  first  electronic  excited  state  for  FeCO  is  the 
bent  quintuplet  FeCC)(5A" )  with  the  configuration 
(11  4/8  3),  7.50  kcal/mol  above  the  ground  state. 
The  asymptotic  energy  is  -  0.684752  a.u.  when  we 
specify  15  and  11  as  the  total  number  of  alpha  and 
beta  electrons,  respectively,  and  consider  the  ge¬ 
ometry  at  diEe-C)  =  5.0  A  as  linear.  The  CO 
ground-state  energy  is  —0.539340  a.u.  If  we  adopt 
the  si  d7  quintet  state  for  Fe,  the  sum  of  the 
fragments  energy  is  -0.666251  a.u.  On  the  other 
hand,  if  we  choose  the  s2d6  quintet  state  for  Fe, 
we  get  a  value  of  —0.677357  a.u.  The  asymptotic 
energy  is  closer  to  the  last  value,  whence  we  can 
correlate  the  dissociated  state  of  Fe  with  the  quin¬ 
tet  s2d6.  We  have,  consequently,  the  value  40.34 
kcal /mol  for  the  dissociation  energy  using  the  E^^f 
value  and  35.70  kcal/mol  using  the  asymptotic 
energy. 

Reported  values  are:  31  kcal/mol  in  [14],  13,4 
kcal/mol  in  [17],  and  22.6  and  18.5  kcal/mol  in 
[13];  the  last  value  was  obtained  at  higher  compu¬ 
tational  level  then  the  previous  22.6  kcal/mol.  In 


TABLE  II _ 

Atomic  charges  and  dipole  moments  for  the  MCO  compounds. 


Atomic  charges 


Molecule/ state 

M(s) 

M(p) 

M(d) 

M 

c 

0 

Dipole  moment  (D) 

CuCO(2) 

0.1011 

-0.2214 

0.2235 

0.1032 

0.2064 

-0.3096 

0.7014 

NiCOd) 

0.3042 

-0.0566 

-0.1100 

0.1376 

0.2404 

-0.3780 

3.5023 

NiCO(3) 

0.0461 

-0.2581 

0.3251 

0.1132 

0.2136 

-0.3268 

0.9997 

FeCO(3) 

0.0304 

-0.0655 

0.2290 

0.1939 

0.1866 

-0.3805 

3.2350 

FeCO(5) 

0.1407 

-0.3826 

0.4315 

0.1896 

0.1488 

-0.3384 

0.7694 
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the  two  last  references  the  FeCO  molecule  was 
supposed  to  be  linear. 


Atomic  Charges  and  Dipole  Moments 

We  consider  in  this  report  the  Mulliken's  popu¬ 
lation  analysis  as  the  method  of  choice  for  obtain¬ 
ing  the  charge  distribution  in  the  molecule.  In 
order  to  understand  the  data  that  are  displayed  in 
Tables  IT  HI,  and  IV  we  consider  the  case  of  the 
CuCO  molecule  at  the  equilibrium  geometry. 

The  Elementary  Atomic  Basis  Functions  (BAS) 
population  is  given  for  the  orbitals  s,  p  and  d,  in 
this  order,  for  Cu,  C,  and  O  and  for  alpha  and  beta 
spin: 


Alpha  spin  Beta 

spin 

Cu 

s 

1 .7301  Cu 

s 

1.1688 

p 

3.1669 

p 

3.0545 

d 

4.8968 

d 

4.8797 

C 

s 

0.7750  C 

s 

0.7279 

p 

1 .2042 

p 

0.9994 

d 

0.0445 

d 

0.0427 

0 

s 

0.9364  0 

s 

0.9362 

p 

2.2171 

p 

2.1613 

d 

0.0291 

d 

0.0295 

The 

differences  between  the  corresponding 

numbers  are  the  spin  polarization  that 

are  shown 

in  Table  IV  for  the  metal  (Cu 

in  our 

case).  The 

TABLE 

III 

Spin  densities  on  the  metal  atom  for 

MCO  compounds. 

Molecule /state 

cl(s) 

Cl(p) 

cl(d) 

CuCO(2) 

0.5613 

0.1124 

-0.0171 

NiCO(3) 

0.5581 

0.1440 

1 .0537 

FeCO(3) 

-0.2202 

-0.0133 

2.4964 

FeCO(5) 

0.7241 

0.2590 

3.1361 

TABLE 

IV 

Overlap  populations  C-M  for  the 

MCO  compounds. 

Molecule /state 

Alfa 

Beta 

Total 

CuCO(2) 

0.1293 

0.1447 

0.2740 

NiCOd) 

0.2076 

0.2076 

0.4152 

NiCO(3) 

0.1439 

0.1789 

0.3228 

FeCO(3) 

0.1877 

0.2124 

0.4001 

FeCO(5) 

0.1674 

0.2032 

0.3706 

corresponding  sums  for  Cu  are  2.8989  for  s  or¬ 
bitals,  6.2214  for  p,  and  9.7765  for  d,  and  they  give 
the  total  (alpha  +  beta  spin)  population  as  the 
valence  contribution.  If  we  adopt  the  valence  con¬ 
figuration:  (3s)2(3p)6(4s)l(3d)10,  then  we  have 
3s,6p,10d  electrons  that  give  us  the  net  electronic 
charge  0.1011  for  s  orbitals,  -0.2214  for  p,  and 
0.2235  for  d.  These  are  the  values  in  Table  II 
labeled  as  M(s)M(p)M(d).  The  sum  of  these  three 
numbers  gives  the  value  0.1032  labeled  M  in  Table 
II. 

This  number  can  also  be  obtained  from  the  total 
alpha  +  beta  population  for  Cu  that  amoimts,  in 
our  case,  to  18.8968.  For  Cu  (atomic  number  29)  we 
have  10  core  electrons  and,  therefore,  19  valence 
electrons.  The  net  charge  for  Cu  is,  therefore, 
0.1032.  Similarly,  we  can  obtain  0.2064  and  -  0.3096 
as  the  net  charges  for  C  and  O,  respectively. 

In  Table  II  we  also  report  the  dipole  moment  (in 
Debye),  and  it  is  interesting  to  compare  some 
dipole  moments  reported  in  the  literature  with 
ours. 

This  comparison  give  the  results: 

CuCO(2): 


0.7014  D 
0.64  D 
3.49  D 

3.74  D  (2.74  D) 


Table  II 

[16]  (0.68  D  at  LDA  level) 

[17]  (linear  geometry) 

[13]  (linear  geometry) 


NiCOd): 


3.5023  D  Table  II 

3.42  D  [16] 

3.18  D  (4.55  D)  [13] 

NiCCXS): 

0.9997  D  Table  II 

2.69  D  (1.63  D)  [13] 

FeCCXS): 

3.2350  D  Table  II 
1.38  D  [17] 

2.18  D  [19] 

FeCO(5): 


0,7694  D  Table  II 

1.55  D  [17] 

0.762  D  (1.85  D)  [13] 
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As  we  can  see  from  these  results,  the  agreement 
with  DFT  calculations  of  [16]  is  quite  good,  but  the 
other  calculation  gives  results  coincident  with  ours 
only  in  a  few  cases. 

Overlap  Populations 

We  consider  only  the  overlap  populations  be¬ 
tween  the  C  and  the  M  atoms,  which  are  a  mea¬ 
sure  of  the  C-“0  bond  strength.  We  expect  to  find, 
therefore,  a  relationship  between  this  population 
and  the  dissociation  energy  of  the  CO  group.  From 
the  data  of  Tables  I,  III,  IV  we  have,  taken  in 
increasing  order  the  overlap  populations: 

CuCO  (0.2740, 16.03)  (overlap  population 

and  dissociation 
energy,  in 
parentheses, 
respectively) 

NiCO(3)  (0.3228, 28.25) 

FeCO(5)  (0.3706, 40.34) 

FeCO(3)  (0.4001, 55.34) 

NiCO(l)  (0.4152,62.84) 

This  relationship  can  be  graphed  and  the  corre¬ 
sponding  curve  is  quite  smooth.  CuCO  gives  the 
lowest  overlap  population  and  NiCO(l)  the  high¬ 
est. 

Vibrational  Frequencies 

Vibrational  frequencies  (in  cm“^)  for  the  MCO 
molecules  are  displayed  in  Table  V  as  well  as 
intensities  (in  parentheses)  in  km/mol.  We  com¬ 
pare  these  values  with  those  reported  in  the  litera¬ 
ture.  The  three  values  considered  are:  for  the 

bending  M-C-O  mode,  /2  for  the  stretching  of  the 
M-C  bond,  and  the  highest  corresponding  to 
the  stretching  of  the  C-O  bond. 

Comparison  with  vibrational  frequencies  re¬ 
ported  in  the  literature  give  the  results: 

CuCa2): 

231(13)  386(7)  1951(848)  Table  V 

237(23)  439(6)  1978(906)  [16] 

—  363  2075  [15] 

—  320  2010  (experimental 

value  quoted 
in  [17]) 


243(18)  517(7)  2026(861)  [14]  (here  an 

experimental 
value  of  2042 
for/3  is 
quoted) . 

NiCOd)  (we  have  a  great  deal  of  calculations  to 
compare  with  from  which  we  have  made  the  fol¬ 
lowing  selection): 

342(10)  617(0.2)  2005(530)  Table  V 

311(12)  614(2)  2019(565)  [16]  (an 

experimental 
value  of 
2028  for  /3  is 
quoted) 

324(11)  646(2)  2076(524)  [14] 

—  —  2010-2074  [19] 

NiCO(3): 

243(20)  460(9)  1939(861)  Table  IV 

265(25)  551(10)  2017(871)  [14] 

FeCO(3): 

357(5)  574(26)  1948(986)  Table  V 

341(13)  638(5)  2020(680)  [14]  (an 

experimental 
value  of  1930 
is  quoted 
for/3). 

546  1986  [17] 

320  1546  [19] 

FeCCXS): 

153(51)  462(6)  1960(551)  Table  V 

241(45)  542(10)  2003(659)  [14] 

552  1876  [17] 

We  conclude  that  the  agreement  of  our  values 
with  the  experimental  ones  are  very  satisfactory  as 
well  as  with  those  from  [14].  Our  calculations  are 
at  the  LDA/BP  level  (those  of  [14]  are  at  the  LDA 
level),  and  it  is  considered  that  LDA  corrections 
(such  as  the  GGA  ones)  usually  improve  the  re¬ 
sults  of  the  frequencies  obtained  by  DFT  methods 
[22]. 
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MgCO  (M  =  re,lNi,Cu)  CASES 

In  Tables  VI  and  VII  the  relevant  parameters 
that  we  have  calculated  in  this  report  are  dis¬ 
played. 

In  Table  VI  we  report  the  optimized  distance 
from  the  C  atom  to  the  center  of  the  M4  square 
keeping  the  CO  distance  fixed  at  the  value  of  the 
corresponding  MCO  compound.  Only  in  the  case 
of  CuCO  have  we  optimized  this  distance. 

In  the  cases  of  CU5CO  and  NijCO  we  have  the 
two  choices  for  the  cluster  M5  those  of  (5,  0)  and 
(4, 1);  for  the  FesCO  we  have  only  the  (4, 1)  cluster 


geometry  among  the  M5  clusters  of  C4V  symme¬ 
try. 

We  report  bond  energies  at  equilibrium  and  E^f 
energies.  The  difference  between  these  two  values 
is  the  vertical  dissociation  energy  that  is  also  re¬ 
ported  as  well  as  the  corresponding  configuration 
using  the  symmetry  C4V  (symmetries  are 
flj,  a2,  b^,  b2,  Cl,  in  that  order).  We  have  also  calcu¬ 
lated  asymptotic  energies  and  checked  that  they 
agree  with  the  corresponding  energies.  The 
energy  of  CO  was  obtained  at  the  bond  distance 
used  in  the  M5CO  calculations.  We  subtracted 


TABLE  V  _ - _ _ _ _ _ 

Vibrational  freauencies  f  (cm  ’)  and  (in  parentheses)  intensity  in  (km /mol)  for  MCO  compounds. 

Molecule /state 

stretch.  (MC) 

f2  bend. 

fg  Stretch.  CO 

CuCO(2) 

231  (13) 

386  (7) 

1951  (848) 

NiCO(l) 

342  (10) 

617  (0.2) 

2005  (530) 

NiCO(3) 

243  (20) 

460  (9) 

1939  (861) 

FeCO(3) 

357  (5) 

574  (26) 

1948  (986) 

FeCO(5) 

153  (51) 

462  (6) 

1960  (551) 

TABLE  VI  - - - - - - 

Distance  d  from  C  to  the  M4  plane  (A),  bond  energy  £  (a.u.),  E^ot  (a.u.),  dissociation  energy  (kcal  /  mol), 


and  electronic  configuration  for  the  M5CO  compounds. _ 

Molecule /state  d  E  E3,,  D,  Configuration 


Cu5CO(2Ai)(5,0) 

1.80 

-0.791656 

Cu5CO(2Ei)(5,0) 

1.90 

-0.793778 

Cu5CO(2Ai)(4,1) 

0.85 

-0.707294 

Cu5CO(2Ei)(4,1) 

1.73 

-0.719386 

NisCO(1)(5,0) 

1.70 

-0.956774 

Ni5CO(1)(4,1) 

0.00 

-0.863319 

Ni5CO(3)(5,0) 

1.80 

-0.957368 

Ni5CO(3)(4,1) 

0.10 

-0.850991 

Fe5CO(3)(4,1) 

1.05 

-1.543196 

Fe5CO(5)(4,1) 

1.15 

- 1 .585505 

-0.752472 

24.59 

(13  3  7  4  26/12  3  7  4  26) 

-0.783390 

6.52 

(13  3  7  4  26/13  3  7  4  25) 

-0.673491 

21.21 

(13  3  7  4  26/12  3  7  4  26) 

-0.713274 

3.86 

(13  3  7  4  26/13  3  7  4  25) 

-0.893778 

33.37 

(12  3  7  4  24/12  2  7  4  25) 

-0.820598 

19.07 

(12  3  7  4  24/12  2  7  4  25) 

-0.896044 

38.48 

(12  3  6  4  26/12  3  6  3  25) 

-0.820598 

19.07 

(12  3  6  4  26/12  3  6  3  25) 

-1.498714 

27.86 

(12  2  4  6  22/12  2  3  5  22) 

-1.498714 

54.56 

(11  3  3  6  24/12  1  3  6  21) 

TABLE  VII _ _ 

Net  atomic  charges  and  dipole  moments  (D)  of  the  M5CO  compounds. 


Dipole 


Molecule /state 

q(Mi) 

q(M) 

q(C) 

q(0) 

q(CO) 

moment 

Cu5CO(2)(5,0) 

-0.2842 

0.0729 

0.3324 

-0.3400 

-0.0076 

0.4747 

Cu5CO(2)(4,1) 

0.1049 

0.0183 

0.1478 

-0.3258 

-0.1780 

0.4241 

Fe5CO(3)(4,1) 

-0.0026 

0.0460 

0.2124 

-0.3940 

-0.1816 

0.1990 

Fe5CO(5)(4,1) 

0.0422 

0.0321 

0.2281 

-0.3989 

-0.1708 

0.5176 

Ni5CO(1)(5,0) 

-0.1815 

0.0660 

0.2963 

-0.3786 

-0.0823 

0.0920 

Ni5CO(1)(4,1) 

0.1669 

0.0458 

0.0171 

-0.3671 

-0.3500 

0.3019 

Ni5CO(3)(5,0) 

-0.1995 

0.0655 

0.2937 

-0.3562 

-0.0625 

0.0976 

Ni5CO(3)(4,1) 

0.1419 

0.0552 

-0.0248 

-0.3380 

-0.3682 

0.8143 
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from  the  occupation  numbers  for  the  alpha  and 
beta  spin  the  corresponding  ones  to  CO  in  the 
symmetry  C4V,  that  are  (3  0  0  0  2/3  0  0  0  2/ 
and  we  obtained  the  occupational  numbers  for  the 
M5  fragment. 

Consider  as  an  example  the  case  of  the  Fe5CO 
quintuplet.  The  bond  energy  at  3.0  A  is  - 1.490355 
a.u.  and  the  configuration  is  (11  3  3  6  24/12  1  3 
6  21).  The  configuration  for  the  Fe5  fragment  is, 
therefore  (9  333  22/9  1  36  19),  and  its  bond 
energy  is  —0.953107  a.u.  The  CO  bond  energy  at 
the  CO  distance  of  1.17  A  is  -  0.538758  a.u.,  whence 
the  EgQf  is  -1.491865  A,  agreeing  with  the  calcu¬ 
lated  energy  at  3.0  A. 

The  reason  for  this  checking  is  that  in  some 
cases  we  had  to  shift  the  virtual  orbitals  because  of 
convergence  problems,  and  this  might  introduce 
some  uncertainty  in  the  energy.  These  problems 
appeared  very  often  at  large  bond  distances  when 
we  wanted  to  calculate  the  asymptotic  energies. 

It  is  interesting  to  assess  the  influence  on  the 
energy  of  the  position  of  the  ring  M4  when  it  is 
moved  along  the  axis  of  the  MCO  molecule  con¬ 
sidered  in  the  linear  geometry. 

At  long  distances  between  the  center  of  the  M4 
square  and  the  M  of  the  MCO,  the  energy  of  the 
system  is  the  sum  of  the  energies  of  MCO  and  M4. 
This  is  natural  because  the  interaction  between  the 
two  fragments  is  negligible.  When  the  two  frag¬ 
ments  become  closer,  the  energy  drops  when,  as  in 
this  case,  there  is  an  attractive  interaction  between 
the  two  molecules.  This  continues  until  an  equilib¬ 
rium  situation  is  set  up,  and  then  the  energy 
begins  to  increase. 

In  the  case  of  NiCO  at  long  distances  between 
the  square  Ni4  and  the  NiCO  the  energy  is  -0.818 
a.u.,  that  is,  the  sum  of  the  energies  of  the  NiCO 
(1)  -0.660  (from  Table  I)  and  of  the  square  Ni4 
that  was  calculated  at  the  experimental  geometry 
of  the  Ni  crystal  and  was  -0.158  a.u. 

When  the  Ni4  square  moves  toward  the  Ni 
atom  of  NiCO,  the  energy  drops  until  a  minimum 
is  reached  at  which  the  energy  is  -0.956774  a.u. 
This  happens  when  the  Ni  of  the  NiCO  is  in  the 
center  of  the  Ni4  square  (d  ~  1.70  A)  and  we  are 
in  the  case  of  the  cluster  (5,  1)  (see  Table  VI). 

When  the  Ni4  square  continues  to  move  toward 
the  C  atom  and  the  Ni  atom  of  the  NiCO  is  on  the 
other  side  of  this  square,  the  energy  begins  to 
increase,  and  this  continues  until  the  distance  of 
this  fifth  Ni  atom  from  the  Ni4  square  is  the 
corresponding  to  the  Ni(4,l)  cluster.  In  this  case 
the  energy  increases  up  to  the  value  -0.863619 


a.u.  according  to  Table  VI  and  the  distance  d  of  the 
C  atom  from  the  center  of  the  Ni4  square  is  0.0  A. 
This  particular  distance  in  the  case  of  Cu^CO  is 
0.85  A  and  in  the  case  of  Ni5CC)(3)  is  0.65  A. 

Bond  Distances  and  Dissociation  Energy 

Let  us  consider,  in  the  first  place  the  case  of 
CU5CO.  There  are  two  studies  from  Baerends  et  al. 
[1,  2]  in  which  this  compound  with  the  geometries 
(5,0)  and  (4,1)  and  symmetries  C4V  were  studied. 
This  was  done  at  the  LDA  level  instead  of  at  the 
LDA/BP  as  was  done  in  this  report. 

According  to  Tables  I  and  VI  the  distance  C-Cu^ 
changes  from  1.91  to  1,80  A  when  we  move  from 
CuCO  to  CU5CO  (5,0)  due  to  additional  effect  of 
the  CU4  group  and,  at  the  same  time,  the  dissocia¬ 
tion  energy  is  increased  from  16.0  to  24.6  kcal/mol, 
meaning  a  more  stable  C-Cu^  bond.  On  the  other 
hand,  when  we  consider  the  cluster  (4,1)  whose 
equilibrium  distance  from  the  C  to  the  CU4  plane 
is  0.85  A  and  taking  into  account  that  the  distance 
from  the  Cu^  to  that  plane  is  1.795  A,  the  distance 
C-Cu^  is  2.60  A,  larger  than  the  corresponding  one 
to  the  (5,0)  case,  and  the  dissociation  energy  is 
therefore  lower  (21.2  kcal/mol  in  Table  VI). 

Contrary  to  the  conclusion  reached  in  [1],  the 
most  stable  place  for  CO  to  interact  with  the  Cu 
crystal  is  the  top  position,  and  the  hollow  position 
would  be  a  little  bit  less  preferred.  In  [1]  the 
distances  from  C  to  the  CU4  plane  are^l.93  A  (ours 
is  1.80  A)  for  the  (5,0)  case  and  1.39  A  (1.217  A  in 
[2])  for  the  (4,1)  one  (ours  is  0.85^  A).  The  experi¬ 
mental  distance  reported  is  1,90  A,  agreeing  with 
the  top  site  adsorption.  The  dissociation  energies 
reported  in  [1]  are  very  different  from  ours  (14.6 
kcal/mol)  (22.72  kcal/mol  in  [12])  (ours  is  24.6 
kcal/mol)  for  the  (5,0)  case  and  30.9  kcal/mol 
(38.0  kcal/mol  in  [12])  (ours  21.2  kcal/mol)  for  the 
(4,1)  one.  The  experimental  one  is  15  kcal/mol,  as 
reported  in  [1]. 

We  consider  now  the  Ni5CO(l)  case.  Going  from 
NiCO  to  Ni5CO  (5,0)  the  distance  C-Ni;^  changes 
from  1.66  to  1.70  A  and  the  dissociation  energy 
from  63.9  to  39,52  kcal/mol,  meaning  that  the 
bond  C-Ni|  is  weaker  when  the  Ni4  group  inter¬ 
acts  with  NiCO,  and  therefore  the  distance  C-Ni^ 
must  increase  somewhat,  in  agreement  with  our 
results. 

When  Ni4  moves  toward  the  C  atom  the  cluster 
(4,1)  is  set  up  with  a  distance  of  1.76  A  from  Ni^  to 
the  Ni4  plane,  then  this  C  atom  is  on  the  Ni4  plane 
(d  =  0  according  to  the  Table  VI).  The  distance 
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C-Nij  is  1.76  A,  a  little  longer  than  in  the  (5,0) 
case,  and  the  dissociation  energy  is  lowered  from 
39.52  k  in  the  (5,0)  case  to  22.1  kcal/mol. 

The  only  reported  data  we  found  were  in  [23], 
where  a  value  of  2.04  A  of  the  C-Nij  is  given  for 
the  (5,0)  case  (ours  is  1.70  A),  but  we  have  used 
the  BP  correction. 

Moving  to  the  NijCOO)  case,  we  find  the  inter¬ 
esting  fact  that  for  the  cluster  (5,0)  this  triplet  state 
is  a  little  more  stable  than  the  singlet  one  by  1.42 
kcal/mol,  contrary  to  the  NiCO(3)  where  the  sin¬ 
glet  is  more  stable  than  the  triplet  by  34.13 
kcal/mol.  This  situation  is  opposite  to  the  case  of 
the  (4,1)  cluster,  where  the  singlet  is  more  stable 
than  the  triplet  by  7.79  kcal/ mol.  ^ 

The  C-Nj  distance  is  changed  from  1.88  A  in 
NiCO  to  1.80  A  in  NijCO  (5,0)  and  to  2.41  A  in  the 
(4,1)  case.  The  corresponding  dissociation  energies 
are  31.10  kcal/mol  from  Table  I  and  38.48  kcal/mol 
for  the  (5,0)  case  and  15.0  kcal  for  the  (4,1)  one. 
This  shows  us  that  the  behavior  for  the  triplet  state 
of  NijCO  is  different  from  the  singlet  case.  The 
maximum  interaction  of  the  Ni4  group  with  NiCO, 
that  occurs  in  the  (5,0)  cluster,  shortens  the  C-Nij 
bond  and  therefore  increases  the  bond  energy.  For 
the  (4,1)  case  the  distance  from  C  to  the  Ni4  plane 
is  0.65  A  and  as  a  consequence  the  distance  to  Nij 
is  2.41  A,  which  is  longer  than  the  corresponding 
to  Ni-C  in  NiCO  (1.88  A).  The  bond  energy  re¬ 
duces  to  15.0  kcal/mol,  less  than  the  31.10 
kcal/mol  value  corresponding  to  the  NiCO  case. 

Similarly  to  the  CuCO  case,  for  both,  the 
NiCO(l)  and  NiCC)(3),  the  top  position  from  the 
energetical  point  of  view  is  preferred  to  the  hollow 
one. 

For  FejCO  we  have  only  the  case  (4,1).  We 
observe  from  Table  VI  that  the  Fe5CO  quintuplet 
is  more  stable  than  the  triplet  by  28.2  kcal/ mol. 


The  fifth  Fe  atom  is  1.435  A  from  the  Fe4  plane, 
therefore,  according  to  Table  VI,  the  distance  C— Fcj 
is  2.485  A  for  the  triplet  FegCO  and  2.535  for  the 
quintuplet. 

For  the  triplet  the  distance  C-Fe  changes  from 
1.70  to  2.49  A  and  the  dissociation  energy  drops 
from  58.74  kcal/mol  for  the  triplet  FeCO  to  27.86 
kcal/mol  for  the  triplet  FegCO.  This  means  that 
the  C-Fe  bond  is  now  weaker.  In  the  case  of  the 
quintuplet,  the  distance  C-Fej  is  also  increased 
from  1.86  to  2.54  A,  but  the  corresponding  dissoci¬ 
ation  energy  does  not  drop,  as  in  the  previous 
case,  but  increases  from  35.70  to  56.12  kcal/mol. 
This  could  be  rationalized  if  the  effect  of  the  Fe4 
group  were  greater  than  the  change  of  the  C— Fej 
distance,  so  the  direction  in  which  the  change  of 
dissociation  energy  occurs  would  be  more  influ¬ 
enced  by  the  presence  of  this  Fe4  group  than  for 
the  change  in  the  Fe-C  distance.  It  is  also  interest¬ 
ing  to  note  that,  for  the  FeCO  case,  the  triplet  is 
more  stable  than  the  quintuplet,  contrary  to  the 
FejCO  case  where  the  quintuplet  is  more  stable 
than  the  triplet. 

Atomic  Charges  and  Dipole  Moments 

In  Table  VII  the  net  gross  atomic  charges  of  the 
metal  atom  Mj  and  the  rest  the  M4  atoms,  as  well 
as  C,  O,  and  CO,  and  the  dipole  moments  are 
displayed. 

Observing  those  values  we  conclude: 

1.  Going  from  the  MCO  to  the  M5CO(5,0)  cases, 
there  is  a  significant  lowering  of  the  negative 
CO  charge,  meaning  that  electronic  charge  is 
flowing  from  CO  to  the  M5  group.  Dipole 
moments  are  also  reduced,  meaning  that  the 
electric  charge  is  more  smoothly  distributed. 


TABLE  VIII _ _ 

Overlap  populations  C-M  of  M5CO  compounds. 


Molecule/ state 


Cu5CO(2)(5,0) 

Cu5CO(2)(4,1) 

Ni5CO(1)(5,0) 

Ni5CO(1)(4,1) 

Ni5CO(3)(5.0) 

Ni5CO(3)(4,1) 

Fe5CO(3)(4,1) 

Fe5CO(5)(4,1) 


Alpha  Beta 


CM,  CMa  CM, 


0.1495 

0.0042 

0.1560 

-0.0161 

0.0374 

0.0430 

0.1640 

0.0199 

0.1742 

0.1452 

0.0403 

0.1442 

0.1753 

0.0177 

0.1787 

0.1436 

0.0545 

0.1167 

0.0355 

0.0496 

-0.0118 

0.0125 

0.0633 

-0.0224 

Total 


CM2 

CM^ 

CM2 

CM 

0.0226 

0.3055 

0.0268 

0.4127 

0.0690 

0.0269 

0.1067 

0.4537 

0.0226 

0.3382 

0.0425 

0.5082 

0.0476 

0.2894 

0.0879 

0.6410 

0.0202 

0.3540 

0.0379 

0.5056 

0.0603 

0.2603 

0.1148 

0.7195 

0.0575 

0.0237 

0.1071 

0.4521 

0.064 

-0.0099 

0.1273 

0.4993 
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2.  Unlike  the  MCO,  in  the  M5CO(44)  cases  what 
takes  place  is  an  increase  in  the  electronic 
charge  of  the  CO  group,  except  in  the  Fe5CO 
triplet  and  quintuplet  cases,  where  there  is 
only  a  small  reduction  in  this  charge  and  the 
dipole  moments  are  reduced  too. 

3.  In  the  singlet  and  triplet  Ni5CO  cases,  com¬ 
paring  the  (5,0)  with  the  similar  (4,1)  cases, 
we  observe  that  the  electronic  charge  of  the 
CO  group  is  increased  as  well  as  the  dipole 
moments.  On  the  other  hand,  in  the  CugCO 
cases,  both  the  CO  electric  charge  and  the 
dipole  moments,  remain  almost  unchanged. 

Overlap  populations 

In  Table  VIII  the  overlap  populations  of  the 
C-M^  and  the  four  C-M2  bonds  for  alpha  and 
beta  spin  are  shown,  as  well  as  the  total  alpha  plus 
beta  spin.  The  total  overlap  population  between  C 
and  M5  is  CM  =  CM^  4-  4CM2.  This  is  displayed 
in  the  last  column  and  is  the  corresponding  to  the 
C~M  bond  of  the  MCO  compounds. 

As  in  the  case  of  the  MCO  compounds  we  can 
observe  a  close  relationship  between  the  total  over¬ 
lap  population  of  the  C  and  the  M5  cluster  and  the 
dissociation  energy.  In  the  case  of  the  (5,0)  clusters 
we  have  that  increased  values  of  the  overlap  popu¬ 
lations  are  in  the  order  Cu5CO(2Ei)(0.3420), 
Cu5C)(2A|)(0.4127), . . . ,  and  the  highest  values  are 
those  of  the  Ni5CO  singlet  (0.5082)  and  triplet 
(0.5086),  and  they  have  also  a  very  similar  dissoci¬ 
ation  energy.  In  the  case  of  the  (4,1)  clusters  the 
situation  is  not  so  clear. 


Conclusions 

The  DFT  method  at  the  LDA/BP  level  is  very 
satisfactory  for  the  predictions  of  equilibrium  ge¬ 
ometries,  dissociation  energies,  and  vibrational  fre¬ 
quencies,  as  well  as  dipole  moments  for  MCO 
compounds  when  compared  to  experimental  val¬ 
ues. 

For  M5CO  compounds  the  theoretical  values, 
given  in  this  report,  are  comparable  with  the  few 
experimental  values  available. 

In  particular  our  calculations  have,  established 
that: 

1.  CuCO  is  bent  in  the  doublet  ground  state, 
and  this  is  also  true  for  triplet  NiCO  and 


quintuplet  FeCO.  The  ground  states  for  NiCO 
(singlet)  and  FeCO  (triplet)  are  linear. 

2.  Dissociation  energies  calculated  using  bond 
asymptotic  energies  are,  in  general,  quite 
close  to  those  obtained  using  the  sum-of- 
fragments  energies. 

3.  There  is  a  close  relationship  between  the  dis¬ 
sociation  energy  and  the  overlap  population 
of  the  M-C  bond  for  the  MCO  and  M5CO 
(5,0)  compounds. 

4.  Bond  energies  are  lower  in  the  (5,0)  cases 
compared  to  the  (4,1)  ones.  The  dissociation 
energies  are  lower  for  the  (4,1)  cases  com¬ 
pared  to  the  (5,0)  ones. 

5.  CU5CO  has  two  states  with  almost  the  same 
bond  energy  but  with  very  different  dissocia¬ 
tion  energies  in  the  case  (5,0).  On  the  con¬ 
trary,  in  the  (4,1)  case,  the  state  2E-^  is  much 
more  stable  and  has  much  lower  dissociation 
energy  than  the  2A^  state. 

6.  The  (4,1)  case  of  NigCO,  similarly  to  NiCO, 
has  the  singlet  more  stable  than  the  triplet, 
but  this  behavior  is  reversed  in  the  (5,0)  case. 

7.  For  Fe5CO,  the  quintuplet  is  more  stable  than 
the  triplet,  opposite  to  FeCO  where  the  triplet 
is  more  stable  than  the  quintuplet. 

ACKNOWLEDGMENTS 

The  support  of  the  CONICYT  (Consejo  Nacional 
de  Investigaciones  Cientificas  y  Tecnologicas)  of 
Uruguay  is  gratefully  acknowledged. 


References 

1.  D.  Post  and  E.  J.  Baerends,  J.  Chem.  Phys.  78,  5663  (1983). 

2.  G.  te  Velde  and  E.  J.  Baerends,  Chem.  Phys.  177,  399  (1993). 

3.  A.  C.  Pavao,  M.  Braga,  C.  A.  Taft,  B.  L.  Hammond,  and  W. 
A.  Lester,  Jr.  Phys.  Rev  B  44,  1910  (1991). 

4.  ADF  2.0.1,  SCM,  Theorethical  Chemistry  (Wrije  Universiteit, 
Amsterdam)  E.  J.  Baerends,  D.  E.  Ellis,  and  P.  Ross,  Chem. 
Phys.  2,  41  (1973);  G.  te  Velde  and  E.  J.  Baerends,  J.  Comp, 
Phys.  99,  84  (1992). 

5.  W.  Kolin  and  L.  J.  Sham,  Phys.  Rev,  A.  140,  1133  (1965). 

6.  J.  C.  Slater,  Phys.  Rev.  81,  381  (1952). 

7.  S,  J,  Vosko,  L.  Wilk,  and  M.  Nusair,  Can.  J.  Phys.  58,  1200 
(1980). 

8.  A.  D.  Becke,  Phys.  Rev,  A  38,  3098  (1988). 

9.  J.  P.  Perdew,  Phys.  Rev.  B  33,  8822  (1986);  errata:  B  34,  7406 
(1986). 

10.  P.  M,  Boerrigter,  G.  te  Velde,  and  E.  J.  Baerends,  Int.  J. 
Quantum  Chem.  33,  87  (1988). 


1440 


QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  30 


PROPERTIES  OF  MCO  AND  M5CO  CARBONYLS 


11.  T.  Ziegler  and  A.  Rauk,  Theor.  Chim.  Acta  46,  1  (1977). 

12.  L.  Fang  and  T.  Ziegler,  J.  Chem.  Phys.  95,  7401  (1991), 

13.  C.  W,  Bauschlicher  Jr.,  P.  S.  Bagus,  C.  J.  Nelin,  and  B.  O. 
Ross,  J.  Chem.  Phys.  85,  354  (1986). 

14.  R.  Fournier,  J.  Chem.  Phys.  99,  1801  (1993). 

15.  C.  W.  Bauschlicher,  J.  Chem,  Phys.  100,  1215  (1993). 

16.  R.  Fournier,  J.  Chem.  Phys.  98,  8041  (1993). 

17.  G.  Berthier,  A.  Daoudi  and  M.  Suard,  J.  Mol.  Struct  (Theo- 
chem)  179,  407  (1988). 

18.  A.  Veillard,  Chem.  Rev.  91,  743  (1991). 


19.  C.  W.  Bauschlicher,  Jr.  and  L.  G.  M.  Pettersson,  J.  Chem. 
Phys.  87,  2129  (1987). 

20.  B.  I.  Dunlap,  FI.  L.  Yu,  and  P.  R.  Antoniewicz,  Phys.  Rev.  A 
25,  25  (1982). 

21.  M.  R.  A.  Blomberg,  P.  E.  M.  Siegbahn,  T.  J.  Lee,  A.  P. 
Rendell,  and  J.  E.  Rice,  J.  Chem.  Phys.  95,  5898  (1991). 

22.  T.  Ziegler,  Chem.  Rev.  91,  651  (1991). 

23.  N.  Rosch,  in  Cluster  Models  for  Surface  Phenomena,  G.  Pac¬ 
chioni,  P.  S.  Bagus,  F.  Parmigianni,  Eds.  (NATO  ASI  Series, 
Series  B,  Phys.  Vol.  283,  Plenum,  New  York,  1992)  p.  251. 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


1441 


Ensemble  Density  Functional  Theory 
for  Inhomogeneous  Fractional  Quantum 
Hall  Systems 


O.  HEINONEN,*  M.  I.  LUBIN,  AND  M.  D.  JOHNSON 

Department  of  Physics,  University  of  Central  Florida,  Orlando,  Florida  32816-2385:  e-mail  for  O.H.: 
ogh  @phy  sics, ucf.edu 

Received  February  26,  1996;  revised  manuscript  received  March  19,  1996;  accepted  March  19,  1996 


ABSTRACT _ 

The  fractional  quantum  Hall  effect  (FQHE)  occurs  at  a  certain  magnetic  field  strengths 
F*(n)  in  a  two-dimensional  electron  gas  of  density  n  at  strong  magnetic  fields 
perpendicular  to  the  plane  of  the  electron  gas.  At  these  magnetic  fields  strengths,  the 
system  is  incompressible,  i.e.,  there  is  a  finite  cost  in  energy  for  creating  charge  density 
fluctuations  in  the  bulk,  while  the  boundary  of  the  electron  gas  has  gapless  modes  of 
density  waves.  The  bulk  energy  gap  arises  because  of  the  strong  electron-electron 
interactions.  While  there  are  very  good  models  for  infinite  homogeneous  systems  and  for 
the  gapless  excitations  of  the  boundary  of  the  electron  gas,  computational  methods  to 
accurately  model  finite,  inhomogeneous  systems  with  more  than  about  10  electrons  have 
not  been  available  until  very  recently.  We  will  here  review  an  ensemble  density 
functional  approach  to  studying  the  ground  state  of  large  inhomogeneous  spin-polarized 
FQHE  systems.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

The  fractional  quantum  Hall  effect  (FQHE)  is 
manifested  in  a  two-dimensional  electron  gas 
(2DEG)  in  a  strong  magnetic  field  oriented  perpen¬ 
dicular  to  the  plane  of  the  electrons  [1].  The  effect 
was  discovered  as  a  transport  anomaly.  This  is  still 
the  ''Hallmark"  of  the  effect,  even  though  there 

*  To  whom  correspondence  should  be  addressed. 


are  now  a  host  of  other  phenomena  associated 
with  the  effect,  which  have  been  studied  experi¬ 
mentally.  In  a  transport  measurement  it  is  noted 
that  at  certain  strengths  B*(n),  which  depend  on 
the  density  n  of  the  2DEG,  current  can  flow  with¬ 
out  any  dissipation.  That  is,  there  is  no  voltage 
drop  along  the  flow  of  the  current.  At  the  same 
time,  the  Hall  voltage  perpendicular  to  both  the 
direction  of  the  current  and  of  the  magnetic  field  is 
observed  to  attain  a  quantized  value  for  a  small, 
but  finite,  range  of  magnetic  field  or  density,  de- 
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pending  on  which  quantity  is  varied  in  the  experi¬ 
ment.  The  effect  is  understood  to  be  the  result  of 
an  excitation  gap  in  the  spectrum  of  an  infinite 
2DEG  at  these  magnetic  fields,  so  that  there  is  a 
finite  cost  in  energy  to  be  paid  for  making  density 
fluctuations  in  the  system.  This  means  that  the 
2DEG  is  incompressible.  In  general,  the  magnetic 
field  strengths  at  which  the  quantum  Hall 

transport  anomalies  are  observed  are  related  to  the 
density  through  the  filling  factor  v  =  2iT\\n,  with 
Iq  =  ^Jhc/(eB)  the  magnetic  length.  The  quantum 
Hall  effect  was  first  discovered  [2]  at  integer  filling 
factors.  In  this  integer  quantum  Hall  effect,  the 
energy  gap  is  nothing  but  the  kinetic  energy  gap 
hoj^.  =  heB/im'^'c).  Later,  the  fractional  quantum 
Hall  effect  was  discovered  [3]  at  certain  rational 
filling  factors  of  the  form  v  =  p /q,  with  p  and  q 
relative  primes,  and  q  odd.  In  the  FQHE,  the 
excitation  gap  is  a  consequence  of  the  strong  elec¬ 
tron-electron  interactions. 

Our  understanding  of  the  origin  of  the  FQHE 
started  with  Laughlin's  seminal  study  of  1983  [4], 
which  dealt  with  the  simplest  fractions  v  =  \  /ni, 
with  m  an  odd  integer.  At  these  values  of  v,  there 
are  on  the  average  m  magnetic  flux  quanta  (Fq  = 
hc/e  per  electron.  In  that  work,  Laughlin  con¬ 
structed  a  variational  wave  function  for  spin- 
polarized  systems  in  strong  magnetic  fields,  strong 
enough  that  the  splitting  between  the  mag¬ 
netic  subbands,  or  Landau  levels,  can  be  taken  to 
be  infinite.  The  wave  function  can  then  be  con¬ 
structed  from  single-particle  states  entirely  within 
the  lowest  Landau  level.  Laughlin  wrote  the  varia¬ 
tional  wave  function  as 


«  n(z,  -  Zy)"'exp 

i.i 


1 

4 


(1) 


where  m  is  an  odd  integer,  and  Zj  =  Xj  +  i\j-  is  the 
coordinate  of  the  jth  electron  in  complex  notation. 
This  wave  function  is  an  eigenstate  of  angular 
momentum.  Laughlin  went  on  to  demonstrate  that 
the  system  having  the  wave  function  of  Eq.  (1)  is 
an  incompressible  liquid  with  p  =  I /m,  m  odd, 
and  with  an  energy  gap  to  excitations,  and  that  the 
elementary  excitations  are  fractionally  charged 
quasi-holes  or  quasi-particles  of  charge  = 
±e/ m.  The  origin  of  the  energy  gap  can  be  under¬ 
stood  in  the  so-called  pseudo-potential  representa¬ 
tion  of  the  electron-electron  interactions  [5].  Here, 
the  electron-electron  interaction  y(ry  -  r^  be¬ 
tween  electrons  i  and  j  is  decomposed  into 
strengths  Vi  in  relative  angular  momentum  chan¬ 


nels  /  =  0, 1, 2, . . . ,  of  the  two  electrons.  For  any 
realistic  interaction  —  r^,  it  turns  out  that 
Vq  >  >  V2  . , ,  .  Consider  the  case  In  this 

case,  the  lowest  angular  momentum  pseudo¬ 
potential  that  enters  into  the  m  =  3  Laughlin  de¬ 
scription  is  the  interaction  energy  of  two  elec¬ 
trons  of  unit  relative  angular  momentum  (in  units 
of  h),  (Even  relative  angular  momenta  are  not 
permissible  for  spin-polarized  electron  wave  func¬ 
tions,  since  they  have  to  be  antisymmetric  under 
interchange  of  electron  coordinates).  The  Laughlin 
wave  function  is  a  very  cleverly  constructed  highly 
correlated  state  which  completely  excludes  unit 
relative  angular  momentum  between  any  two  elec¬ 
trons,  and  is  furthermore  the  only  state  which 
satisfies  this  property  at  v  =  I,  Therefore,  any  ex¬ 
cited  state  must  contain  some  electrons  with  unit 
relative  angular  momentum.  The  energy  gap  is 
due  to  the  cost  of  this,  and  hence  is  of  order 
Note  that  because  of  the  nature  of  the  correlations 
between  the  electrons,  which  are  contained  in  the 
factors  (zy  -  ZjY',  the  Laughlin  wave  function  can¬ 
not  be  expressed  as  a  single  Slater  determinant  of 
single-particle  states  in  the  lowest  Landau  level. 
Figure  1  depicts  the  exchange-correlation  energy 
per  particle  for  infinite,  homogeneous  FQHE  sys¬ 
tems  vs.  filling  factor.  The  cusps  at  filling  factors 
^  =  h  h  h  ^  =  f  have  been  included  to  scale, 
and  these  filling  factors  marked  by  vertical  lines 
for  clarity.  Note  that  these  cusps  are  barely  visible 
on  this  scale,  yet  they  are  responsible  for  all  the 
physics  of  the  FQHE! 

More  modern  theories  of  the  FQHE  are  based 
on  the  so-called  composite  fermions.  This  idea  was 
originated  by  Jain  [6]  who  noted  that  the  m  =  3 
Laughlin  wave  function  can  be  written  as  n(zy  - 
2^^%,  where  %  =  n(z,.  -  zpexp[-  iUzf]  is 
the  Slater  determinant  wave  function  of  a  filled 
lowest  Landau  level.  Because  of  constraints  on  the 
Hilbert  space,  is  an  exact  eigenstate  of  an 
interacting  electron  system  at  v  =  1.  Although 
need  not  in  general  be  the  ground  state  of  the 
interacting  system,  it  is  if  the  Landau  level  split¬ 
ting  hco^  is  much  greater  than  the  scale  of  the 
Coulomb  interaction,  and  if  the  external 

potential  is  sufficiently  well  behaved  [7],  e.g.,  it  is 
caused  by  a  uniform  positive  background  charge 
density,  and  we  will  assume  that  these  conditions 
are  satisfied.  Multiplication  by  the  Jastrow  factor 
n(zy  -  Zy)^  makes  the  total  wave  function  ^3  van¬ 
ish  as  the  cube  of  the  separation  of  two  electrons 
when  they  approach  each  other,  rather  than  lin¬ 
early.  In  the  two-dimensional  world  of  the  FQHE, 
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Exchange-correlation  energy 


FIGURE  1 .  The  ground-state  energy  per  particle  of  an  infinite,  homogeneous,  spin-polarized  FQHE  system  is 
depicted  as  a  function  of  filling  factor.  The  cusps  at  i^=  5,  f ,  |,  and  1^=  f  are  included  (these  filling  factors  are 
indicated  by  vertical  lines  for  clarity). 


a  zero  of  the  wave  function  at  a  fixed  point  Zq 
produces  a  phase  factor  of  27r  when  an  electron 
adiabatically  encircles  Zq.  This  is  equivalent  to 
adding  a  flux-tube  containing  a  single-flux  quan¬ 
tum  <I>o  at  Zq.  This  is  purely  an  Aharonov-Bohm 
phase — no  magnetic  field  is  added  anywhere  ex¬ 
cept  at  Zq,  where  the  wave  function  vanishes. 
Therefore,  the  FQHE  at  wr  =  3  can  be  interpreted 
as  a  system  of  electrons  at  r”  =  1,  but  with  two  flux 
quanta  added  to  the  position  of  each  electron. 
These  composite  objects,  electrons  plus  an  integer 
number  of  flux  quanta,  are  called  composite 
fermions.  So  one  can  say  that  the  fractional  quan¬ 
tum  Hall  effect  at  =  |  is  an  integer  quantum 
Hall  effect  (v  =  1)  of  composite  fermions.  This 
generation  of  FQHE  states  from  composite 
fermions  was  subsequently  generalized  to  all 
FQHE  fractions.  Qne  way  of  studying  FQHE  sys¬ 
tems  of  electrons  theoretically  is  to  perform  a  sin¬ 
gular  gauge  transformation  [8],  which  is  a  gauge 
transformation  in  which  an  odd  number  of  flux 
quanta  is  added  to  the  position  of  each  electron. 
The  transformed  wave  function  is  thus  a  wave 
function  for  composite  fermions.  In  a  Lagrangian 
formulation,  a  term  has  to  be  added  to  the  La¬ 


grangian  to  ensure  that  the  flux  tubes  indeed  are 
located  at  the  electrons.  The  resulting  term  in  the 
Lagrangian  is  called  the  Chern-Simons  term,  and 
is  well  known  from  earlier  topological  field  theo¬ 
ries. 


Finite  Systems 

We  have  outlined  above  how  the  electron-elec¬ 
tron  interactions  in  an  infinite,  homogeneous  sys¬ 
tem  produce  the  excitation  gap.  It  is  important  to 
note  that  these  gaps  are  only  for  excitations  in  the 
bulk  of  the  system.  When  a  system  is  bounded 
there  must  be  gapless  excitations  located  at  the 
boundaries  of  the  system.  The  following  simple 
argument,  due  to  MacDonald  [7],  demonstrates 
this  point.  Consider  a  finite  system  in  which  the 
chemical  potential  jx  lies  in  the  bulk  "charge  gap," 
i.e.,  we  have  to  pay  the  price  of  the  energy  gap  to 
introduce  particles  to  the  bulk  of  the  system.  Now 
imagine  that  the  chemical  potential  is  increased  an 
infinitesimal  amount  8fx,  and  consider  the  result¬ 
ing  change  in  the  current  density.  In  the  bulk,  the 
current  density  cannot  change  since  Syu,  is  infinites- 
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imal  and  cannot  overcome  the  energy  gap  in  the 
bulk.  It  follows  that  if  there  is  a  change  in  the 
current  density  as  a  response  to  this  must  be 
located  at  the  edges  of  the  system.  Current  conser¬ 
vation  also  requires  that  if  there  is  a  resulting 
change  in  the  current  along  the  edge,  this  change 
must  be  constant  along  the  edge.  We  can  relate  the 
change  in  current  dl  to  the  change  in  orbital  mag¬ 
netization  through 


SI  =  ^8M,  (2) 


where  A  is  the  total  area  of  the  system.  This 
relation  is  nothing  but  the  equation  for  the  mag¬ 
netic  moment  of  a  current  loop.  But  we  can  write 
5M  in  terms  of  S/u  using  a  Maxwell  relation: 


SM 


dM 

dfJL 


S/JL 


dN 

Jb 


(3) 


where  N  is  the  total  number  of  electrons.  By 
combining  Eqs.  (2)  and  (3)  we  arrive  at 


81 

8jji 


(4) 


Since  dn/dB  is  nonzero,  it  follows  that  8I/8fi  is 
nonzero,  and  we  conclude  that  there  must  be  gap¬ 
less  excitations  of  the  system,  and  that  these  exci¬ 
tations  must  be  localized  to  the  edges.  Since  all 
experimental  systems  are  finite  and  inhomoge¬ 
neous,  the  low-energy  properties  probed  by  exper¬ 
iments  must  be  determined  by  the  gapless  edge 
excitations.  Advances  in  semiconductor  nanofabri¬ 
cation  technologies  have  lead  to  the  possibility  of 
manufacturing  systems  which  are  extremely  inho¬ 
mogeneous  and,  in  practice,  dominated  by  edges. 
As  an  example,  recent  experiments  have  even  been 
performed  on  tiny  quantum  dots,  with  about  30 
electrons  on  them  [9,  10].  In  order  to  accurately 
understand  the  experiments  and  inhomogeneous 
FQHE  systems  in  general,  we  must  have  a  way  of 
accurately  calculating  their  properties.  Certain  as¬ 
pects  of  inhomogeneous  FQHE  systems  have  been 
studied  by  different  techniques.  For  example,  field 
theories  can  be  constructed  to  study  the  low- 
energy  limit  of  the  gapless  edge  excitations  [11]. 
Composite  fermion  methods  have  been  used  in  a 
Hartree  approximation  to  study  finite  FQHE  sys¬ 
tems  [12,  13].  In  this  approach,  the  Chern- Simons 
term,  arising  from  the  singular  gauge  transforma¬ 
tion,  and  the  electron- electron  interaction  are 
treated  in  a  self-consistent  Hartree  approximation. 


The  hope  is  then  that  the  most  important  aspects 
of  the  electron-electron  correlations  are  included 
in  this  approximation.  Near  v  =  1,  at  which  the 
Slater  determinant  is  the  exact  grormd  state,  it 
makes  sense  to  use  the  Hartree-Fock  approxima¬ 
tion,  and  the  stability  of  a  quantum  dot  at  ?/  =  1  as 
a  function  of  confining  potential  has  been  studied 
in  this  approximation  [14,  15].  The  edge  structure 
has  also  been  studied  using  semiclassical  methods 
[16,  17]  in  which  the  electron-electron  interaction 
is  included  at  the  Hartree  level,  and  it  is  further¬ 
more  assumed  that  all  potentials  vary  on  a  length 
scale  much  larger  than  /g.  Beenakker  [16],  and 
Chklovskii,  and  Glazman  [17]  demonstrated  that 
the  edge  of  an  integer  quantum  Hall  system,  in 
which  the  correlation  energies  between  the  elec¬ 
trons  can  be  ignored,  consists  of  a  sequence  of 
compressible  and  incompressible  strips.  Imagine 
going  from  the  bulk  of  an  integer  quantum  Hall 
effect,  with  infilled  Landau  levels  in  the  bulk  (so 
that  the  chemical  potential  lies  above  the  energy  of 
these  V  Landau  levels)  toward  an  edge.  There  is  an 
external  potential  confining  the  system,  and  this 
potential  rises  toward  the  edge,  causing  the  Lan¬ 
dau  levels  to  bend  upward  near  the  edge.  The 
compressible  strips  occur  where  the  chemical  po¬ 
tential  crosses  a  Landau  level.  There  are  then  both 
empty  and  occupied  single-particle  states  avail¬ 
able,  and  the  electron  gas  can  screen  the  external 
potential  perfectly.  Eventually  this  Landau  level 
bends  upward,  rising  above  the  chemical  potential. 
There  are  then  no  more  empty  single  particle  states, 
and  changing  the  electron  density  would  involve  a 
cost  of  energy  of  the  order  of  ho)^,  the  spacing 
between  the  Landau  levels.  Thus,  an  incompress¬ 
ible  strip  forms.  Here,  the  electrons  cannot  screen 
the  external  potential.  Further  out  on  the  edge,  the 
chemical  potential  crosses  the  next  lower  Landau 
level,  and  a  new  compressible  strip  forms.  Thus, 
whenever  the  chemical  potential  lies  between  two 
Landau  levels,  the  electron  gas  is  incompressible, 
and  the  electron  density  is  constant,  while  the  total 
potential  varies.  On  the  other  hand,  whenever  the 
chemical  potential  crosses  a  Landau  level,  the  elec¬ 
tron  gas  can  screen  perfectly  the  external  potential, 
and  the  electron  density  varies  while  the  total 
potential  remains  constant.  The  width  of  the  in¬ 
compressible  strips  is  then  determined  by  the 
length  over  which  the  confining  potential  varies  an 
amount  equal  to  the  energy  gap  ho)^.  The  origin  of 
the  compressible  and  incompressible  strips  are  the 
energy  gaps,  which  are  the  kinetic  energy  gaps 
ft  o)^  in  the  case  of  the  integer  quantum  Hall  effect. 
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But  it  is  easy  to  heuristically  generalize  the  argu¬ 
ment  to  include  the  energy  gaps  causing  the  FQHE 
[17].  The  conclusion  is  then  that  there  should  be 
compressible  and  incompressible  strips,  with  the 
density  of  the  incompressible  strips  fixed  at  the 
density  of  an  FQHE  fraction.  The  width  of  each 
incompressible  strip  is  then  fixed  by  the  length 
over  which  the  confining  potential  varies  an 
amount  equal  to  the  energy  gap  of  the  FQHE 
fraction  corresponding  to  the  density  of  that  strip. 

Finite,  inhomogeneous  systems  have  also  been 
studied  by  direct  numerical  diagonalizations  [18]. 
At  the  present,  numerical  diagonalizations  are  lim¬ 
ited  to  systems  with  about  10  electrons. 

It  is  highly  desirable  to  have  a  computational 
approach  which  accurately  includes  electron-elec¬ 
tron  correlations  and  can  handle  inhomogeneous 
systems  with  on  the  order  of  10^-10^  electrons. 
One  such  approach  is  in  principle  valid  for  any 
interacting  electron  system  is  the  density  func¬ 
tional  theory  (DFT)  [19-21].  There  have  been  some 
attempts  to  apply  density  functional  theory  to  the 
FQHE. 

Ferconi  and  Vignale  [22]  applied  current  density 
functional  theory  [23]  to  small,  parabolically  con¬ 
fined  quantum  Hall  systems  and  showed  that  the 
current  density  functional  theory  gave  good  re¬ 
sults  for  the  ground-state  energy  and  spin  polar¬ 
ization  near  v  =  1,  However,  the  energy  gaps  due 
to  correlation  effects  were  not  included  in  that 
calculation.  Ferconi,  Geller,  and  Vignale  [24]  also 
recently  studied  FQHE  systems  within  the  spirit  of 
the  DFT  using  an  extended  Thomas-Fermi  ap¬ 
proximation  at  low,  but  nonzero,  temperatures.  In 
this,  the  kinetic  energy  was  treated  as  a  local 
functional,  as  in  the  standard  Thomas-Fermi  ap¬ 
proximation,  while  the  exchange-correlation  en¬ 
ergy  was  included  in  a  local  density  approxima¬ 
tion  (LDA).  This  extended  Thomas-Fermi  approxi¬ 
mation  is  valid  in  the  limit  of  very  slowly  varying 
confining  potential.  Ferconi,  Geller,  and  Vignale 
focused  on  the  incompressible  and  compressible 
strips  at  an  edge  of  an  FQHE  system  and  obtained 
results  in  agreement  with  the  predictions  by 
Chklovskii,  Shklovskii,  and  Glazman  [17]. 

We  have  developed  for  the  fractional  quantum 
Hall  effect  an  ensemble  DFT  scheme  within  the 
local  density  approximation  and  have  applied  it  to 
spin-polarized  circularly  symmetric  quantum  dots 
[25].  In  our  approach,  the  kinetic  energy  is  treated 
exactly,  and  the  density  represented  by 
Kohn-Sham  orbitals.  The  results  are  in  good 
agreement  with  results  obtained  by  semidassical 


[16,  17,  24],  Hartree-Fock  [14,  15]  (for  cases  where 
the  correlations  do  not  play  a  major  role),  and 
exact  diagonalization  methods  [26].  Our  calcula¬ 
tions  show  that  the  exchange  and  correlation  ef¬ 
fects  of  the  FQHE  are  very  well  represented  by  the 
LDA  and  that  our  approach  provides  a  computa¬ 
tional  scheme  to  model  large  inhomogeneous 
FQHE  systems.  We  note  that  there  exist  previous 
formal  DFTs  for  strongly  correlated  systems,  in 
particular  for  high-temperature  superconductors 
[27],  and  DFT  calculations  of  high-T,  materials  [28] 
and  transition-metal  oxides  [29],  However,  ours 
are,  to  the  best  of  our  knowledge,  the  first  practical 
LDA-DFT  calculations  of  a  strongly  correlated 
system  in  strong  magnetic  fields,  and  demonstrate 
the  usefulness  of  the  LDA-DFT  in  studying  large 
inhomogeneous  FQHE  systems. 


Ensemble  Density  Functional 
Theory  Approach 

In  typical  DFT  calculations  of  systems  of  N^/ 
electrons,  the  standard  Kohn-Sham  (KS)  scheme 
[30]  is  implemented  in  which  the  particle  density 
nix)  is  expressed  in  terms  of  a  Slater  determinant 
of  N  >  KS  orbitals,  i/^^(r).  These  obey  an  effec¬ 
tive  single-particle  Schrodinger  equation  = 

which  is  solved  self-consistently  by  occupy¬ 
ing  the  Ngi  KS  orbitals  with  the  lowest  eigenval¬ 
ues,  and  iterating.  This  scheme  works  well  in 
practice  for  systems  for  which  the  true  electron 
density  can  be  represented  by  a  single  Slater  deter¬ 
minant  of  single-particle  wave  functions.  How¬ 
ever,  when  the  KS  orbitals  are  degenerate  at  the 
Fermi  energy  (which  we  identify  with  the  largest 
of  the  occupied  orbitals)  there  is  an  ambiguity 
of  how  to  occupy  these  degenerate  orbitals.  There 
exists  an  extension  of  DFT  which  is  formally  able 
to  deal  with  this  situation.  This  extension  is  called 
ensemble  DFT  [20,  21],  and  in  it,  the  density  of  the 
system  is  represented  by  an  ensemble  of  Slater 
determinants  of  KS  orbitals.  However,  while  it  can 
be  shown  using  ensemble  DFT  that  such  a  repre¬ 
sentation  of  the  density  is  rigorous,  it  cannot  be 
shown  hoTV  the  degenerate  KS  orbitals  at  the  Fermi 
energy  should  be  occupied,  i.e.,  there  has  not  been 
available  a  practical  computational  scheme  for  en¬ 
semble  density  functional  theory. 

We  argued  above  that  the  Laughlin  wave  func¬ 
tion  cannot  be  represented  as  a  Slater  determinant 
of  single-particle  wave  functions.  Therefore,  one 
may  suspect  that  the  density  of  a  general  FQHE 
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system  cannot  be  represented  by  a  single  Slater 
determinant.  We  will  now  argue  that  this  is  indeed 
the  case.  Consider  a  FQHE  system  in  the  xy  plane 
with  the  magnetic  field  along  the  z  axis.  A  circu¬ 
larly  symmetric  external  potential 
(due,  e.g.,  to  a  uniform  positive  background  charge 
density)  confines  the  systems  such  that  the  density 
is  fixed  with  a  local  filling  of  r-  =  |  up  to  an  edge 
at  Tq  (^q  »  /g)  where  the  density  falls  to  zero 
within  a  distance  of  order  Ig.  That  such  systems 
exist  is  well  demonstrated  by  the  excellent  agree¬ 
ment  between  the  Laughlin  wave  function  and 
experiments,  and  by  many  numerical  calculations 
[31,  26].  Due  to  the  circular  symmetry  we  can  label 
single-particle  orbitals  by  angular  momentum  m, 
and  by  a  Landau  level  index  n  >  0.  The  orbitals 
„(r)  are  centered  on  circles  of  radii  ~  ]/2  ml  g 
with  Gaussian  fall-offs  for  r  ^  and  r  » 

The  single-particle  orbitals  with  n  =  0  are  then  in 
the  bulk  all  degenerate,  and  the  degeneracy  is  not 
lifted  by  electron-electron  interactions  since  the 
system  is  homogeneous  in  the  bulk.  Only  the  or¬ 
bital  0  is  nonzero  at  the  origin — all  others  van¬ 
ish  at  r  =  0.  In  order  to  obtain  a  constant  density 
at  p  =  I  even  at  the  center  of  the  system,  all 
single-particle  orbitals  in  the  bulk  with  ^  =  0  must 
have  occupancies  If  the  Fermi  energy  lay  above 
the  energies  of  the  bulk  orbitals,  they  would  all  be 
filled  and  one  would  have  v  =  1.  Therefore,  to  get 
occupancies  ^  the  Fermi  energy  must  lie  at  the 
degenerate  energy  of  these  orbitals.  Thus,  in 
applying  DFT  to  the  FQHE  we  can  expect  a  huge 
degeneracy  of  KS  orbitals  at  the  Fermi  energy,  and 
they  must  all  have  fractional  occupancies.  Conse¬ 
quently,  the  particle  density  cannot  be  expressed 
in  terms  of  a  single  Slater  determinant.  Instead,  the 
density  has  to  be  constructed  from  an  ensemble  of 
Slater  determinants,  i.e.,  the  orbitals  at  the  Fermi 
energy  are  assigned  fractional  occupation  num¬ 
bers,  just  as  the  Laughlin  wave  function  for  v  =  ^ 
is  not  a  single  Slater  determinant,  but  a  highly 
correlated  state  with  average  occupancies  of  ^  of 
single-particle  states.  Therefore,  one  might  expect 
from  the  outset  that  one  has  to  use  ensemble 
density  functional  theory — the  standard 
Kohn-Sham  scheme  may  not  converge. 

Although  ensemble  DFT  has  been  developed 
formally,  there  are  in  practice  few  examples  of 
applications  and  calculations  using  ensemble  DFT 
for  ground-state  calculations.  A  significant  aspect 
of  our  work  is  that  we  have  developed  an  ensem¬ 
ble  scheme  which  is  practical  and  useful  for  the 


study  of  the  FQHE.  In  ensemble  DFT,  any  physical 
density  n(r)  can  be  represented  by  nlr)  = 
where  are  occupation  num¬ 
bers  satisfying  0  <  1,  and  the  orbitals 

satisfy  the  equation 


l2 


p  +  -A(r) 


(5) 


where  V  X  A(r)  =  B(r).  In  Eq.  (5),  V^(r)  is  the 
Hartree  interaction  of  the  two-dimensional  elec¬ 
trons,  and,  as  usual,  B)  is  the  exchange-corre¬ 
lation  potential,  defined  as  a  functional  derivative 
of  the  exchange-correlation  energy  E^^[n(r),B]  of 
the  system  with  respect  to  density: 


3£,Jn(r),B] 

8n(r) 


(We  will  hereafter  not  explicitly  indicate  the  para¬ 
metric  dependence  of  and  on  B.)  For  the 
case  of  the  FQHE,  we  know  that  the  exchange-cor¬ 
relation  potential  will  be  crucial,  as  it  contains  all 
the  effects  of  the  electron  correlations  which  cause 
the  FQHE  in  the  first  place,  and  a  major  part  of  the 
DFT  application  is  to  come  up  with  an  accurate 
model  of  £^^  and  so  of  14^,.  Leaving  this  question 
aside  for  a  moment,  and  assuming  that  we  have 
succeeded  in  doing  so,  the  practical  question  is 
then  how  to  determine  the  KS  orbitals  and  their 
occupancies  in  the  presence  in  degeneracies.  We 
devised  an  empirical  scheme,  which  means  that 
after  a  lot  of  trial  and  error  we  made  some  edu¬ 
cated  guesses  that  work.  Our  scheme  produces  a 
set  of  occupancies  for  the  KS  orbitals  which  satisfy 
some  minimum  requirements,  namely  (a)  the 
scheme  converges  to  physical  densities  (to  the  best 
of  our  knowledge)  for  FQHE  systems,  (b)  it  repro¬ 
duces  finite  temperature  DFT  distributions  at 
finite  temperatures,  and  (c)  it  reproduces  the  stan¬ 
dard  Kohn-Sham  scheme  for  systems  whose  den¬ 
sities  can  be  represented  by  a  single  Slater  deter¬ 
minant. 

In  our  scheme,  we  start  with  input  occupancies 
and  single-particle  orbitals  and  iterate  the  system 
Ngq  times  using  the  KS  scheme.  The  number  is 
chosen  large  enough  (about  20-30  in  practical  cal¬ 
culations)  that  the  density  is  close  to  the  final 
density  after  iterations.  If  the  density  of  the 
system  could  be  represented  by  a  single  Slater 
determinant  of  the  KS  orbitals,  we  would  now 
essentially  be  done.  However,  in  this  system  there 
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are  now  in  general  many  degenerate  or  near¬ 
degenerate  orbitals  at  the  Fermi  energy.  After  each 
iteration,  the  Kohn-Sham  scheme  chooses  to  oc¬ 
cupy  the  Ng]  orbitals  with  the  lowest  eigenvalues, 
corresponding  to  making  a  distinct  Slater  determi¬ 
nant  of  these  orbitals.  But  there  will  be  small 
fluctuations  in  the  density  between  each  iteration, 
which  cause  a  different  subset  of  these  (near)  de¬ 
generate  orbitals  to  be  occupied  after  each  itera¬ 
tion.  This  corresponds  to  constructing  different 
Slater  determinants  after  each  iteration,  and  the 
occupation  numbers  of  these  orbitals  are  zero 
or  unity  more  or  less  at  random  after  each  itera¬ 
tion.  This  means  that  the  computations  will  never 
converge.  However,  the  average  occupancies,  i.e., 
the  occupancies  averaged  over  many  iterations, 
become  well  defined  and  approach  a  definite  value, 
e.g.,  5  for  orbitals  localized  in  a  region  where  the 
local  filling  factor  is  close  to  v  =  |.  Therefore,  we 
use  these  average  occupancies  to  construct  an  en¬ 
semble  by  accumulating  running  average  occupan¬ 
cies  </„,„>  after  the  initial  iterations; 

I  Nit 

/=Noq  +  l 

where  f„„  i  is  the  occupation  number  (0  or  1)  of 
orbital  after  the  ith  iteration,  and  use  these  to 
calculate  densities.  Thus,  our  algorithm  essentially 
picks  a  different  (near)  degenerate  Slater  determi¬ 
nant  after  each  iteration,  and  these  determinants 
are  all  weighted  equally  in  the  ensemble.  It  is  clear 
that  this  scheme  reduces  to  the  KS  scheme  for 
which  the  density  can  be  represented  by  a  single 
Slater  determinant  of  KS  orbitals  (for  which  the  KS 
scheme  picks  only  the  one  Slater  determinant 
which  gives  the  ground  state  density)  for  N^g  large 
enough.  We  have  numerically  verified  that  a  fi¬ 
nite-temperature  version  of  our  scheme  converges 
to  a  thermal  ensemble  at  finite  temperatures  down 
to  temperatures  of  the  order  of  Vd~^h(ajk^.  We 
have  also  performed  some  Monte  Carlo  simula¬ 
tions  about  the  ensemble  obtained  by  our  scheme. 
In  these  simulations,  we  used  a  Metropolis  algo¬ 
rithm  to  randomly  change  the  occupation  numbers 
about  our  converged  solution,  keeping  the  chemi¬ 
cal  potential  fixed.  The  free  energy  of  the  new  set 
of  occupation  numbers  was  calculated  self-con- 
sistently.  If  the  free  energy  decreased,  this  set  was 
kept,  and  if  the  free  energy  increased,  the  set  was 
kept  if  a  random  number  was  smaller  than 
exp[-AF/A:BT*],  where  Af  is  the  change  in  free 


energy,  and  T*  a  fictitious  temperature.  The  re¬ 
sults  were  that  to  within  numerical  accuracy  our 
ensemble  DFT  scheme  gives  the  lowest  free  en¬ 
ergy.  As  a  condition  for  convergence,  we  typically 
demanded  that  the  difference  between  the  input 
and  output  ensemble  densities,  and  n^^fCr), 

of  one  iteration  should  satisfy 

Injr)  -  n^^^(r)lrdr  <  W-\  (7) 

Ki  ■'o 

Practical  density  functional  theory  calculations 
hinge  on  the  availability  of  good  approximations 
for  the  exchange-correlation  potential  which 
enters  in  the  effective  Schrodinger  equation  for  the 
KS  orbitals.  The  simplest,  and  probably  the  most 
commonly  used,  approximation  is  the  local  density 
approximation  (LDA).  In  this  approximation,  the 
exchange-correlation  energy  is  assumed  to  be  a 
local  function  of  density,  so  that  the  total 
exchange-correlation  energy  consists  of  contribu¬ 
tions  from  the  local  density  of  the  system.  Thus,  in 
this  approximation  one  writes  E^^/N  = 
f  dxe^^Mnir),  where  e^fv)  is  the  exchange-corre¬ 
lation  energy  per  particle  in  a  homogeneous  system 
of  constant  density  n  =  p/HttII)  and  filling  factor 
V.  In  other  words,  in  the  LDA  one  assumes  that 
the  system  is  locally  homogeneous,  i.e.,  the  system 
can  locally  be  approximated  to  have  the  energy 
per  particle  of  an  infinite,  homogeneous  system  of 
the  local  density.  This  approximation  obviously 
makes  sense  if  the  density  of  the  system  varies  on 
a  very  long  length  scale,  while  it  could  be  ques¬ 
tionable  for  systems  in  which  the  density  varies  on 
some  microscopic  length  scale.  However,  experi¬ 
ence  has  shown  that  the  LDA  often  works  surpris¬ 
ingly  well,  even  for  systems  in  which  the  electron 
density  is  strongly  inhomogeneous  [19].  In  fact,  the 
first  application  of  LDA-DFT  was  to  calculate  the 
work  function  of  simple  metals  [32],  so  these  were 
systems  which  were  terminated  with  densities 
varying  on  the  scale  of  a  Bohr  radius!  Neverthe¬ 
less,  the  LDA-DFT  gave  quite  good  results,  vastly 
superior  to  those  of  the  Hartree  or  Hartree-Fock 
approximation.  In  the  FQHE,  the  length  scale  of 
exchange-correlation  interactions  and  density  fluc¬ 
tuations  is  given  by  the  magnetic  length  /g  due  to 
the  Gaussian  fall-off  of  any  single-particle  basis  in 
which  the  interacting  Hamiltonian  is  expanded. 
The  densities  are  relatively  smooth  on  this  length 
scale,  which  gives  us  additional  hope  that  the  LDA 
will  work  well  for  the  FQHE,  too.  In  addition,  the 
cusps  in  the  exchange-correlation  energy  will  sup- 
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press  density  fluctuations,  so  in  this  sense  one  can 
actually  expect  the  basic  physics  of  the  FQHE  to 
make  the  LDA  a  good  approximation. 

In  conventional  LDA-DFT  calculation,  the  ex¬ 
change-correlation  energy  is  obtained  by  inter¬ 
polating  between  the  exchange-correlation  ener¬ 
gies  per  particle  of  systems  with  vanishing  and 
infinite  densities,  respectively,  for  which  exact  re¬ 
sults  are  known.  Analogously,  following  Rasolt 
and  Perrot  [33],  we  obtain  our  exchange-correla¬ 
tion  energy  by  interpolating  between  two  limits 
for  which  the  result  is  known  very  accurately.  In 
our  case,  the  two  limits  are  B  and  B  0, 
respectiv'^ely,  and  we  stitch  them  together  using  a 
Fade  approximant  [33].  Thus,  we  write  for  the 
exchange-correlation  energy  per  particle  of  a  uni¬ 
form  electron  gas  in  a  constant  magnetic  field: 

- TTT^ - ■  ® 

Here,  ej^  is  the  zero-magnetic  field  result  for  a 
2DEG  obtained  by  Tana  tar  and  Ceperley  [34].  The 
term  is  the  B  ^  limit,  which  is  the 

exchange-correlation  energy  of  the  FQHE  in  a  sys¬ 
tem  for  which  only  single-particle  states  in  the 
lowest  Landau  level  are  occupied.  This  contribu¬ 
tion  consists  of  two  terms.  The  first  one  is  a  smooth 
interpolation  formula  due  to  Levesque, 

Weiss,  and  MacDonald  [35]  between  ground-state 
energies  at  some  rational  fillings.  The  second  one, 
is  all-important  for  the  study  of  the  FQHE. 
This  term  contains  the  cusps  in  the  ground-state 
energy  which  cause  the  FQHE.  Here  we  have  used 
a  simple  model  which  captures  the  essential 
physics.  We  model  by  constructing  it  to  be 

zero  at  values  of  i'  =  p/q  which  display  the  FQHE. 
Near  v  =  p/q,  e^/v)  is  linear  and  has  at  v  =  p/q  a 
discontinuity  in  the  slope  related  to  the  chemical 
potential  gap  A/r  =  qi\A^\  +  |A;,|).  Here  A^  ,,  are 
the  quasiparticle  (hole)  creation  energies  which 
can  be  obtained  from  the  literature  [36,  37]  at 
fractions  v  =  p/q.  Farther  away  from  v  =  p/q, 
decays  to  zero.  Finally,  in  the  LDA  is 

obtained  from  as 


V,Ar) 


d[  ve,.^(p)] 
dv 


v{r) 


at  constant  B.  In  our  calculations,  we  restrict  our¬ 
selves  to  include  only  the  cusps  at  =  |,  |,  |  and 
v=  l,  which  are  the  strongest  fractions.  These  are 
some  the  fractions  of  the  form  v  =  p/{2p±D 


generated  by  the  so-called  model,  in  which 
only  the  pseudo-potential  is  included. 

A  technical  difficulty  arises  in  the  LDA:  The 
discontinuities  in  iri  the  LDA  give  rise  to  a 

numerical  instability.  The  reason  is  that  an  arbi¬ 
trarily  small  fluctuation  in  charge  density  close  to 
an  FQHE  fraction  give  rise  to  a  finite  change  in 
energy.  Imagine  that  the  local  filling  factor  v(r)  in 
some  neighborhood  of  a  point  r  is  very  close  to, 
but  less  than,  say,  |  after  one  iteration.  In  this 
neighborhood,  the  local  exchange-correlation  po¬ 
tential  will  then  form  a  potential  well  with  sharp 
barriers  at  the  points  around  r  where  p(r)  = 
During  the  next  iteration,  charge  will  then  be 
poured  into  this  well.  As  a  result,  the  local  filling 
factor  will  after  this  iteration  exceed  and  in  this 
neighborhood  now  forms  a  potential  barrier  of 
finite  height.  So  in  the  next  iteration,  charge  is 
removed  from  this  neighborhood,  and  so  on.  We 
can  see  that  this  leads  to  serious  convergence  prob¬ 
lems.  To  overcome  this,  we  made  the  compressibil¬ 
ity  of  the  system  finite,  but  very  small,  correspond¬ 
ing  to  a  finite,  but  very  large,  curvature  instead  of 
a  pointlike  cusp  in  at  the  FQHE  fractions.  In 
other  words,  instead  of  having  a  steplike  disconti¬ 
nuity  A/i,  in  the  chemical  potential,  it  rises 
smoothly  an  amount  A  /m  over  an  interval  y  in  the 
filling  factor.  What  worked  very  well  in  practice 
was  to  have  the  discontinuity  in  chemical  potential 
occur  over  an  interval  of  filling  factor  y  of  magni¬ 
tude  10“^.  This  corresponds  to  a  sound  velocity  of 
about  10^  m/ s  in  the  electron  gas,  which  is  3 
orders  of  magnitude  larger  than  the  Fermi  velocity 
of  a  2DEG  at  densities  typical  for  the  FQHE.  In 
general,  the  finite  compressibility  does  not  lead  to 
any  spurious  physical  effects  so  long  as  the  energy 
of  density  fluctuations  on  a  size  of  the  order  of  the 
systems  size  is  larger  than  any  other  relevant  en¬ 
ergy  in  the  problem.  The  only  noticeable  effect  is 
that  incompressible  plateaus,  at  which  the  density 
would  be  perfectly  constant  were  the  compressibil¬ 
ity  zero,  will  have  density  fluctuations  on  a  scale 
of  y.  Figure  2  depicts  used  in  our  calculations 
as  a  function  of  filling  factor. 


Applications  to  Quantum  Dots 

We  have  self-consistently  solved  the  KS  equa¬ 
tions  [Eqs.  (5)]  for  a  spin-polarized  quantum  dot  in 
a  parabolic  external  potential,  V'ext^'')  = 
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FIGURE  2.  Exchange-correlation  potential  as  function  of  filling  factor  in  units  of  e^/  (gQ/g)  for  0  <  i'  <  1.  The 
increase  in  at  an  FQHE  filling  occurs  over  a  range  of  filling  factor  of  0.004. 


by  expanding  the  KS  orbitals 
in  the  eigenstates  of 


We  use  the  cylindrical  gauge,  A(r)  =  \Br<j),  and 
include  the  four  lowest  Landau  levels  (u 
0,...,3).  We  chose  the  static  dielectric  constant 
eg  =  13.6,  appropriate  for  GaAs,  and  a  confining 
potential  of  strength  [9]  fiil  =  1.6  meV. 

The  use  of  our  LDA-DFT  scheme  is  illustrated 
by  a  study  of  the  edge  reconstruction  of  the  quan¬ 
tum  dot  as  a  function  of  magnetic  field  strength. 
As  is  known  from  Hartree-Fock  and  exact  diago- 
nalizations  [12—15,  16],  for  strong  confinement  the 
quantum  dot  forms  a  maximum  density  droplet  in 
which  the  density  is  uniform  at  v  =  1  in  the  inte¬ 
rior,  and  falls  off  rapidly  to  zero  at  r  «  'JTn  /  g  =  rg. 
As  the  magnetic  field  strength  increases,  a  "lump 
of  density  breaks  off,  leaving  a  "hole"  or  deficit  at 
about  r  =  rg.  This  effect  is  due  to  the  short-ranged 
attractive  exchange  interaction:  It  is  energetically 
favorable  to  have  a  lump  of  density  break  off  so 
that  the  system  can  take  advantage  of  the  ex¬ 
change  energy  in  the  lump.  As  B  is  further  in¬ 
creased,  the  correlations  will  cause  incompressible 


strips  with  densities  v  —  p/q  to  appear  [16,  17,  24, 
38]  on  the  edges,  and  incompressible  droplets  to 
form  in  the  bulk  at  densities  v  =  p/q.  Figure  3 
depicts  various  states  of  edge  reconstruction  ob¬ 
tained  by  us  as  the  magnetic  field  strength  is 
increased.  The  value  of  B  for  which  the  exchange 
lump  appears  compares  very  well  with  the  value 
found  by  De  Chamon  and  Wen  [15]  in 
Hartree-Fock  and  numerical  diagonalizations.  At 
higher  fields  still,  incompressible  strips  appear  at 
the  edges,  and  incompressible  droplets  are  formed 
in  the  bulk.  In  Figure  4  we  show  occupancies  for 
the  KS  orbitals  for  a  finite-temperature  calcula¬ 
tion  with  Ng]  =  40,  B  =  4.45  T,  and  T  = 
O.OOOBe'^/ieQl^k^}.  The  diamonds  depict  the  con¬ 
verged  ensemble  occupancies  for  the  KS  single¬ 
particle  states  using  our  ensemble  scheme.  At  this 
finite  temperature,  we  calculated  the  thermal  occu¬ 
pancies  of  the  KS  orbitals  after  each  iteration,  and 
these  "instantaneous"  thermal  occupancies  were 
then  averaged  using  our  ensemble  scheme.  The 
temperature  was  sufficinently  low  that  the  instan¬ 
taneous  thermal  occupancies  were  essentially  0  or 
1  before  convergence.  The  continuous  curve  shows 
the  instantaneous  thermal  occupancies  obtained 
after  a  particular  iteration  after  convergence  has 
been  achieved.  This  figures  then  clearly  shows  that 
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reconstruction  of  a  quantum  dot  as  the  magnetic  field  strength  is  increased.  Plotted  here  is  the  local 
Jlhng  factor  v(r)  for  a  parabolic  quantum  dot  with  S  U  =  1 .6  meV  and  40  electrons.  For  magnetic  field  strengths  B  <25 
T  he  dot  forms  a  rnaximum  density  droplet,  and  for  B  =  3.0  T,  an  exchange  hole  is  formed.  For  stronger  magnetic ' 
tields,  incompressible  regions  form,  separated  by  compressible  strips. 


J'^'®'^°nds)  and  instantaneous  (solid  line)  thermal  occupancies  for  N„,  -  40,  B  =  4.45  T  and 
/— 0.0003  e  /  (eg/g/Tg)  after  convergence. 
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FIGURE  5.  Eigenvalues  of  the  lowest  Landau-level  Kohn-Sham  orbitals  for  A/g,  -  40  and  6  -  4.45  T  as  a  function  of 
angular  momentum  quantum  number.  The  chemical  potential  is  indicated  by  the  dashed  line. 


our  ensemble  occupancies  (in  this  case  at  a  low, 
but  nonzero,  temperature)  converged  to  the  ther¬ 
mal  occupancies.  Note  that  this  particular  temper¬ 
ature  is  so  low  that  no  standard  finite-temperature 
scheme  could  be  used  to  achieve  convergence. 

Figure  5  depicts  the  eigenvalues  of  the  KS  or¬ 
bitals  for  Ng,  =  40,  and  B  =  4.45  T.  The  dashed 
line  indicates  the  chemical  potential  of  the  system. 
This  figure  then  shows  that  all  KS  orbitals  in  the 
bulk  are  in  fact  degenerate.  It  may  at  first  seem 
paradoxical  that  the  eigenvalues  are  degenerate  on 
an  incompressible  strip,  since,  according  to  the 
picture  by  Chklovskii,  Shklovskii,  and  Glazman 
[17],  on  such  a  strip  the  density  is  constant,  while 
the  total  potential  varies  (since  the  electrons  can¬ 
not  screen  the  external  potential).  If  the  total  po¬ 
tential  varies,  then  ought  not  the  eigenvalues  of 
the  KS  orbitals  localized  on  that  strip  vary,  too, 
since  these  then  in  general  are  subjected  to  differ¬ 
ent  potential  energies?  The  problem  with  this  ar¬ 
gument  as  applied  to  DFT  is  that  it  ignores  the 
effect  of  the  exchange-correlation  potential.  As  the 
external  and  Hartree  potentials  vary  across  the 
strip,  the  exchange-correlation  potential  varies 
across  its  discontinuity  so  as  to  completely  screen 
out  the  external  and  Hartree  potentials.  The  dis¬ 
continuity  in  does  not  mean  that  this  potential 
is  fixed  at  the  lower  limit  of  its  discontinuity  while 


the  density  is  fixed  at  an  incompressible  strip. 
What  it  does  mean  is  that  is  f^e  to  achieve 
any  value  across  its  discontinuity  so  as  to  com¬ 
pletely  screen  out  the  external  and  Hartree  poten¬ 
tials.  In  this  way,  it  is  perhaps  better  to  think  of 
incompressibility  as  the  limit  of  a  finite  compress¬ 
ibility  approaching  zero.  A  strip  can  then  remain 
incompressible  with  constant  density  so  long  as 
can  screen  the  external  and  Hartree  potentials, 
so  the  width  of  the  incompressible  strip  is  given  by 
the  distance  over  which  the  external  plus  Hartree 
potentials  varies  an  amount  given  by  the  energy 
gap  associated  with  the  density  at  that  strip.  Also, 
all  bulk  KS  states  are  degenerate  at  the  chemical 
potential.  When  a  single  particle  is  added,  the 
chemical  potential  simply  increases  a  small 
amount,  and  all  KS  orbitals  are  again  degenerate  at 
the  chemical  potential.  We  also  would  like  to  em¬ 
phasize  that  incompressible  regions  that  appear  in 
these  calculations  are  not  due  to  the  presence  of  a 
uniform  positive  background  density  which  tends 
to  fix  the  bulk  density  at  the  value  of  the  back¬ 
ground  density. 

There  is  also  another  edge  effect  caused  by 
correlations.  For  particular,  stiff  confining  poten¬ 
tials,  so-called  composite  edges  [26,  39]  can  ap¬ 
pear.  These  can  be  thought  of  as  particle-hole 
conjugates  of  uniform  incompressible  droplets. 
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Consider  a  droplet  with  a  bulk  density  corre¬ 
sponding  to  V  =  falling  off  to  zero  at  the  edge. 
An  incompressible  droplet  with  a  bulk  density  of 

=  I  is  obtained  by  particle-hole  conjugation. 
However,  at  the  edge,  the  density  will  first  rise  to 
V  =  1  (since  the  density  of  the  v  =  |  droplet  drops 
to  zero),  and  then  eventually  drop  to  zero.  Note 
that  this  argument  is  based  on  particle-hole  conju¬ 
gation,  which  is  an  exact  symmetry  of  the  lowest 
Landau  level  [40],  and  it  is  unclear  if  composite 
edges  exist  in  real  systems,  which  do  not  strictly 
obey  particle-hole  symmetry. 

Figure  6  depicts  the  particle  density  (inset)  for  a 
system  where  the  confining  potential  is  supplied 
by  a  uniform  positive  background  charge  density 
w+=2/(67rZg)  (so  that  the  corresponding  filling 
factor  is  =  |)  for  r  <  r^,  and  falling  linearly  to 
zero  over  a  distance  a  for  a  >  r^,  where  Tq  is  fixed 
by  charge  neutrality.  Thus,  the  parameter  a  is  a 
convenient  parameter  with  which  one  can  control 
the  stiffness  of  the  confining  potential  [15].  From 
this  figure,  we  see  that  for  a  =  0,  the  system  forms 
a  composite  edge,  even  though  our  system  does 
not  obey  particle-hole  symmetry.  We  therefore 
conclude  that  such  edges  can  exist  in  real  systems. 
As  a  is  increased,  the  edge  undergoes  an  instabil¬ 


ity  and  reconstruction,  and  eventually  forms  in¬ 
compressible  strips. 


Conclusion  and  Summary 

In  conclusion,  we  have  showed  that  ensemble 
density  functional  theory  can  be  applied  to  the 
FQHE.  This  opens  the  door  to  doing  realistic  calcu¬ 
lations  for  large  systems.  We  believe  that  our  re¬ 
sults  are  also  significant  in  that  they  are  the  first 
LDA-DFT  calculations  of  a  strongly  correlated 
system  in  a  strong  magnetic  field,  and  they  are  (to 
the  best  of  our  knowledge)  the  first  practical  en¬ 
semble  DFT  calculations.  There  are,  however,  still 
many  issues  that  need  to  be  resolved,  and  new 
directions  to  go.  For  example,  our  calculations 
were  of  a  spin-polarized  system.  As  is  well  known 
[7],  the  spin  degree  of  freedom  is  very  important, 
even  for  magnetic  fields  of  the  order  of  10  T.  The 
reason  is  that  the  effective  g  factor  in  GaAs  is  very 
small,  so  small  that  the  ratio  between  the  Zeeman 
splitting  and  the  cyclotron  energy  is  about  2%. 
This  leads  to  the  possibility  of  FQHE  ground  states 
which  are  not  spin-polarized.  It  also  gives  rise  to 
so-called  charge-spin-texture  excitations  near  v  = 


FIGURE  6.  Local  filling  factor  v{r)  as  a  function  of  r  (in  units  of  Ig)  for  a  system  of  45  electrons  in  a  magnetic  field  of 
B  =  5.0  T.  The  confining  potential  is  due  to  a  positive  background  charge  density  at  f  in  the  bulk,  and  falling 
linearly  to  zero  within  a  distance  a  near  the  edge.  For  a  stiff  edge  (a  =  0),  the  system  forms  a  composite  edge  with  v{r) 
rising  toward  unity  near  the  edge.  As  a  increases,  the  edge  becomes  softer  and  undergoes  a  reconstruction  to  a 
sequence  of  incompressible  strips. 
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1  /ni,  m  =  1,3, . . .  .  These  excitations  are  lower  in 
energy  than  a  simple  singlet  particle-hole  pair, 
and  believed  to  be  responsible  for  the  observed 
[41]  rapid  destruction  of  the  spin  polarization  near 
V  =  1.  Correctly  including  the  spin  degree  of  free¬ 
dom  to  account  for  this  involves  a  DFT  for  Heisen¬ 
berg  spins.  We  are  presently,  together  with  J. 
Kinaret  (Chalmers  University  of  Technology),  de¬ 
veloping  such  a  theory. 
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ABSTRACT _ 

The  Kohn-Sham  eigenvalues  were  formally  introduced  into  density  functional  theory  as 
Lagrange  multipliers  in  the  implementation  of  the  minimum  principle  for  the  total 
energy  of  a  many-electron  system.  No  general  results  are  available  concerning  the 
physical  significance  of  these  one-electron  eigenvalues  (with  the  exception  of  the  highest 
occupied  level,  which  equals  the  Fermi  energy).  Recent  ab  initio  calculations  of  dynamical 
response  in  metals  make  explicit  use  of  the  Kohn-Sham  band  structure,  and  associated 
wave  fimctions,  through  the  use  of  spectral  representations.  This  opens  up  the  possibility 
of  examining  the  significance  of  the  eigenvalues  at  an  ''empirical"  level,  i.e.,  through 
direct  comparison  with  the  results  of  spectroscopic  measurements.  A  particularly 
interesting  example  is  afforded  by  new  inelastic  x-ray  scattering  experiments  on  Al.  For 
a  special  wave  vector  transfer,  qg  ~  1.5k p,  the  measured  spectrum  provides  a  direct  mapping 
of  the  Kohn-Sham  noninteracting  spectrum.  For  a  range  of  wave  vectors  about  qQ,  the  bare 
Kohn-Sham  spectrum  still  reproduces  all  the  main  features  of  the  measurements;  this 
suggests  that,  in  this  metal,  the  Kohn-Sham  eigenvalues  are  good  approximations  to  the 
quasiparticle  energies.  We  also  discuss  the  interplay  between  Kohn-Sham  bands  and  the 
energy  of  the  "anomalous"  plasmon  in  Cs,  whose  dispersion  bears  a  signature  of  the 
excited-state  band  structure.  Finally,  and  in  a  more  formal  framework,  we  outline  the 
results  of  a  first-principles  comparison  between  quasiparticle  amplitudes  and  Kohn-Sham 
wave  functions  at  a  jellium  surface;  the  latter  turn  out  to  be  excellent  approximations  to 
the  former.  ©  1996  John  Wiley  &  Sons,  Inc. 
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Introduction 

Density  functional  theory  is  a  powerful  ap¬ 
proach  to  the  many-body  problem  of  inter¬ 
acting  electrons  [1].  In  particular,  Kohn  and  Sham 
[2]  established  a  rigorous  mapping  between  the 
intractable  problem  of  interacting  electrons  (e.g., 
the  ^10^^  valence  electrons  in  a  metal)  and  the 
much  simpler  problem  of  fictitious  noninteracting 
electrons  moving  in  an  effective,  local,  energy- 
independent  potential  which  contains  all  the 
many-body  effects  (these  effects  strongly  influence 
most  properties  of  metals).  The  great  practical  in¬ 
terest  of  this  scheme  resides  in  the  fact  that  the 
self-consistent  solution  of  the  Kohn-Sham  equa¬ 
tion  yields  the  density  for  which  the  energy  of  the 
ground  state  of  the  interacting  system  takes  on  its 
minimum  value.  The  scheme  would  be  exact,  i.e., 
the  density  would  equal  the  exact  many-body  den¬ 
sity,  if  the  exact  form  of  the  exchange-correlation 
energy  functional  E^^[n]  were  known — which,  of 
course,  it  is  not. 

The  local  density  approximation  (LDA)  is  the 
simplest  nontrivial  ansatz  for  This  approxi¬ 

mation,  already  discussed  in  the  two  original  pa¬ 
pers  on  density  functional  theory  [1,  2],  has  turned 
out  to  work  surprisingly  well  in  practice.  When 
LDA-based  density  functional  calculations  are  per¬ 
formed  strictly  within  the  confines  of  the  method 
— the  ground  state  of  the  interacting  system — the 
success  stories  have  been  many  [3~5].  Typically, 
lattice  parameters,  bulk  moduli,  phonon  frequen¬ 
cies,  and  other  observables  which  are  functionals 
of  the  ground-state  electron  density,  are  nowadays 
obtained  from  first  principles — that  is,  using  as 
input  only  the  atomic  number  for  the  species  un¬ 
der  study,  and  an  available  approximate  result  for 
the  correlation  energy  of  the  uniform  electron  gas 
— and  they  agree  with  experiment  very  well.  For 
strongly  correlated  systems  such  as  the  heavy 
Fermions,  the  high-T^-  cuprates,  and  even  for  a 
conventional  itinerant  ferromagnet  such  as  Fe,  the 
limitations  of  the  LDA,  and  its  spin-polarized 
counterpart,  LSD  A,  must  be  dealt  with  [6-8].  In 
some  cases,  gradient  corrections  to  the  LSDA  func¬ 
tional  may  improve  the  results  (e.g.,  the  ground 
state  of  Fe)  [9].  For  the  magnetic  systems,  the 
quality  of  the  results  may  even  depend  on  the 
details  of  the  parametrization  of  the  correlation 
energy  for  the  uniform  electron  gas  [10]. 


Now,  there  is  another  class  of  physical  problems 
for  which  the  information  contained  in  the 
Kohn-Sham  equation  has  been  used  rather  rou¬ 
tinely — and,  basically,  out  of  necessity.  Specifi¬ 
cally,  the  Lagrange  multipliers  of  the  theory,  the 
Kohn-Sham  eigenvalues  (typically,  evaluated 
within  the  LDA)  are  commonly  used  for  the  inter¬ 
pretation  of  spectra  obtained  via  photoemission 
[11]  (which  probes  the  occupied  states)  and  inverse 
photoemission  [12]  (which  probes  the  empty 
states).  This  procedure  has  proved  successful  in 
many  cases,  at  least  in  a  practical  sense  [11,  12]. 
However,  in  others  it  has  not — the  band-gap  prob¬ 
lem  in  semiconductors  [13,  14]  being  a  classic 
example.  [We  recall  that  the  origin  of  this  problem 
is  not  the  approximate  nature  of  the  LDA;  the 
optical  excitation  across  the  gap  is  strongly  influ¬ 
enced  by  the  dynamical  electron-hole  (exciton)  at¬ 
traction  [15],  which  is  a  many-body  process  out¬ 
side  the  one-electron  Kohn-Sham  scheme.] 

The  question  of  how  good  the  Kohn-Sham 
eigenvalues  are  as  an  approximation  to  the  excita¬ 
tion  energies  is  receiving  a  great  deal  of  attention 
these  days  [16,  17] — particularly  for  finite  systems, 
such  as  atoms  and  molecules  [16].  (In  the  case  of 
atoms,  there  is  the  additional  issue  that  the  LDA 
yields  poor  eigenvalues,  since  in  this  approxima¬ 
tion  the  one-electron  potential  decays  exponen¬ 
tially,  rather  than  as  the  inverse  power  of  the 
distance  from  the  nucleus  [3].)  Of  course,  anybody 
doing  spectroscopy  would  probably  agree  that 
what  is  needed  are  quasiparticle  band  structures 
(and  their  lifetimes),  as  these  are  the  ^'energy  lev¬ 
els''  which  photoelectrons  probe.  However,  quasi¬ 
particle  band  structure  calculations  in  the  presence 
of  the  periodic  ionic  potential  are  still  rather  rare 
[18],  and,  for  the  most  part,  confined  to  semicon¬ 
ductors  (Ref.  [19]  is  an  exception)  and  to  relatively 
simple  treatments  of  the  correlation  problem,  such 
as  the  dynamically  screened  Hartree-Fock  [20] — or 
GW  [21] — approximation.  More  elaborate  treat¬ 
ments  of  dynamical  correlations  are  at  the  present 
stage  the  province  of  discrete  models  of  highly 
correlated  materials,  such  as  the  two-dimensional 
Hubbard  model  [22]. 

In  this  article  we  examine  the  question  of  the 
interconnection  between  the  Kohn-Sham  band 
structure  and  real  electronic  excitations  in  metals 
at  both  an  "empirical"  level  and  at  a  more  "funda¬ 
mental"  level.  For  the  most  part,  we  confine  the 
presentation  to  the  empirical  category,  by  which 
we  mean  detailed  comparison  with  measured 
spectra.  Such  comparisons  are  now  becoming  possible 
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because  of  recent  advances  on  the  theoretical  front,  by 
virtue  of  which  loss  spectra  can  be  calculated  ab  initio, 
that  is,  with  use  of  a  realistic  representation  for  the 
Kohn-Sham  states  in  crystals  (as  opposed  to,  e.g., 
jellium  model-based  calculations);  see  Ref.  [23], 
and  references  therein. 

A  very  recent  inelastic  x-ray  scattering  investi¬ 
gation  of  A1  provides  an  excellent  test  case,  as  in 
that  study  a  wave  vector  transfer  is  identified  (q  = 
qQ  -  1.5k p)  for  which  the  x-ray  photons  couple 
directly  with  noninteracting  electron-hole  pairs 
[24].  We  find  that  the  "'Kohn-Sham  spectrum" 
agrees  very  well  with  the  x-ray  data  obtained  for 
this  unique  experimental  condition.  Away  from 
this  condition,  the  x-rays,  of  course,  excite  dressed 
electron-hole  pairs.  Yet,  in  general,  for  wave  vec¬ 
tors  which  are  large  enough  that  the  coherent 
aspects  of  the  electronic  response  are  suppressed, 
the  bare  Kohn-Sham  (LDA-)  spectrum  still  dis¬ 
plays  all  the  main  features  of  the  measured  spec¬ 
tra.  This  finding  suggests  that  the  unoccupied 
Kohn-Sham  bands  are  not  too  different  from 
quasiparticle  bands,  even  at  energies  as  high  as  60 
eV  above  the  Fermi  level.  (Quasiparticle  lifetimes 
are  outside  this  argument.) 

The  heaviest  alkali  metal,  Cs,  is  another  interest¬ 
ing  system  for  which  to  address  the  same  issue. 
This  possibility  arises  because  it  turns  out  that 
interband  transitions  into  empty  states  just  above 
0)^^  [25]  influence  the  measured  "anomalous"  dis¬ 
persion  [26]  of  the  Cs  plasmon  appreciably.  While 
other  physical  effects — such  as  correlation — also 
play  a  role,  even  for  small  wave  vectors  [25],  our 
presentation  highlights  the  interplay  between  se¬ 
lected  Kohn-Sham  bands  and  the  measured  plas¬ 
mon  energy.  This  interplay  is  such  that  it  leads  to 
a  negative  plasmon  dispersion  [27]  for  small  wave 
vectors. 

A  message  that  comes  out  of  these  studies  of 
electronic  excitations  in  prototype  metals  is  that 
we  expect  that  a  "many-body  loop,"  which  would 
start  with  a  Kohn-Sham  band  structure,  and  would 
proceed  by  renormalizing  the  bands  via  dynamical 
self-energy  insertions,  should  converge  rather 
quickly.  Of  course,  a  wider  variety  of  metallic 
systems  needs  to  be  investigated  along  similar 
lines. 

Finally,  we  present  a  sketch  of  a  first-principles 
comparison  [28]  between  quasiparticle  theory  and 
density  functional  theory  at  a  jellium  surface — this 
study  belongs  in  the  "fundamental"  category.  The 
emphasis  is  placed  on  states  which  are  self-bound 
to  the  surface  by  long-range  Coulomb  correlations. 


the  image  states  [29].  The  Kohn-Sham  eigenfunc¬ 
tions  for  these  states — which  must  be  calculated 
without  invoking  the  LDA,  as  this  approximation 
ignores  long-range  correlations — turn  out  to  be 
excellent  approximations  to  the  corresponding 
quasiparticle  amplitudes.  Furthermore,  the  quasi¬ 
particle  energy  shifts  are  found  to  be  quite  small. 
Thus,  at  least  as  far  as  the  manifestations  of  the 
real  part  of  the  electron  self-energy  are  concerned, 
the  Kohn-Sham  states  represent  the  quasiparticle 
states  near  the  vacuum  level  (~4  eV  above  the 
Fermi  level)  very  well. 


Linear  Response 

For  the  purposes  of  our  inquiry  into  the  possible 
experimental  manifestation  of  the  Kohn-Sham 
states,  it  is  convenient  to  begin  with  the  following 
spectral  representation  for  the  spatial  Fourier 
transform  of  the  noninteracting  electron-hole  po¬ 
larizability  (X)), 

a'g/g' ’(q; 

^  BZ  r  _  /  , 

k  n.n’  ^k,n  “  Ek  +  q,«'  + 

X<k,  +  +  q,n'} 

X{k  +  (1) 

where  is  the  normalization  volume  and  G  is 
a  vector  of  the  reciprocal  lattice.  The  label  KS  add¬ 
ed  to  the  polarizability  serves  as  a  reminder  that 
the  "noninteracting"  electrons  we  have  in  mind 
are  those  described  by  Kohn-Sham  eigenfunctions 
and  eigenvalues  ("independent"  may  be  a  better 
denomination  than  noninteracting,  since  the 
Kohn-Sham  electrons  experience  a  mean  field 
which  includes  exchange-correlation  effects).  The 
matrix  elements  in  Eq.  (1)  are  evaluated  by  ex¬ 
panding  the  one-electron  Bloch  (Kohn-Sham) 
states  in  an  appropriate  basis;  the  expansion  coeffi¬ 
cients  are  obtained  from  the  self-consistent  solu¬ 
tion  of  the  Kohn-Sham  ground  state.  In  principle, 
Eq.  (1),  and  the  discussion  which  follows,  refer  to 
an  exact  implementation  of  density  functional  the¬ 
ory;  in  practice,  most  calculations — in  particular, 
the  physical  results  discussed  in  the  next  two 
sections — start  from  a  band  structure  obtained 
within  the  LDA.  [Some  details  of  the  numerical 
evaluation  of  Eq.  (1),  and  of  the  ensuing  loss  spec- 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


1459 


EGUILUZ 


trum  (number  of  bands  included,  sampling  of  the 
Brillouin  zone,  size  of  the  response  matrices,  etc.)^ 
can  be  found  in  Refs.  [25,  30-32].] 

In  the  time-dependent  version  of  density  func¬ 
tional  theory  (this  method  is  a  relatively  recent 
development)  [33,  34],  the  true  density  response 
function  obeys  an  integral  equation  of 

the  form 

X  =  X  +  XVX,  (2) 


fxc  ~  /  dV^^(x)/dn(x),  where  V^J^n)  is  the 

exchange-correlation  potential  obtained  in  an  LDA 
ground-state  calculation. 

In  diagrammatic  approaches  to  the  electronic- 
response  problem,  it  is  rather  customary  to  intro¬ 
duce  the  random-phase  approximation  (RPA)  for 
the  irreducible  polarizability,  the  reference 

density  response  for  noninteracting  electrons  [39], 
In  that  case,  the  counterpart  of  Eq.  (5)  can  be 
written  as 


where  v  denotes  the  bare  Coulomb  interaction  and 
X  is  the  '"irreducible  polarizability,"  related  to 
^(0;KS)  equation 


=  (3) 

where  the  "vertex  correction"  is  rigorously 
given  by  the  functional  derivative  of  the  time- 
dependent  exchange-correlation  potential  [34]  with 
respect  to  the  fluctuating  density,  i.e,. 


rKS  = 
J  xc 


8V,,(xj) 

Tnix'J')  * 


(4) 


We  note  that  we  have  chosen  to  write  Eqs.  (2) 
and  (3)  in  a  form  which  facilitates  the  comparison 
with  the  diagrammatic  approach  [35].  To  this  end, 
we  furthermore  write  the  solution  of  Eq.  (2)  a  la 
Hubbard  [36],  i.e.. 


_  gKS)^,0;KS)J-’^  (5) 

a  result  which  follows  immediately  from  Eqs.  (2) 
and  (3)  upon  introducing  the  "many-body  local- 
field  factor"  G^^(q,  oj)  [35,  36],  defined  such  that 
/xc^  =  [Note  that,  while  for  the  jellium 

model  the  density  response  function  is  a  scalar, 
for  the  general  case  of  band  electrons  in  a  periodic 
crystal  potential,  all  elements  of  Eq.  (5)  must  be 
understood  as  matrices  in  reciprocal  space.] 

In  the  time-dependent  density  functional  frame¬ 
work  [33,  34],  Eq.  (5)  is  formally  exact.  However, 
not  much  is  known  explicitly  about  the  functional 
form  of  V^^(x,  t).  (For  recent  work  on  the  treatment 
of  dynamical  correlations  in  jellium,  see  Ref.  [37]; 
exchange-only  calculations  for  atoms  within  the 
so-called  optimized  effective  potential  are  dis¬ 
cussed  in  Ref.  [34].)  Most  calculations  of  response 
within  density  funttional  theory — particularly  in 
the  presence  of  the  crystal  structure — have  so  far 
resorted  to  making  an  adiabatic  ansatz,  such  as  the 
"time-dependent"  LDA  (or  TDLDA)  [38],  in  which 


A'  =  -  v(l  -  \  (6) 

where  we  have  introduced  a  general  many-body 
local-field  factor  G(q,  oj).  We  stress  that  Eq.  (6) 
may  also  be  viewed  as  giving  the  exact  density 
response  function — if  we  agree  that  all  the  effects 
of  correlation  are  to  be  included  inG(q,  oj).  Of 
course,  this  is  just  a  definition;  in  actual  calcula¬ 
tions,  the  local-field  factor  is  obtained  only  approx¬ 
imately  [35].  Note  also  that  the  propagators  which 
define  are  bare  [40]  (i.e.,  they  contain  no  self¬ 
energy  or  exchange-correlation,  effects),  and  this 
translates  into  the  result  that,  although  the  spectral 
representation  for  x^^^  the  same  form  as  Eq. 
(1),  the  energy  eigenvalues  and  matrix  elements 
are  to  be  obtained  for  the  Hartree  one-electron 
Hamiltonian. 

As  just  indicated,  the  two  "pictures" — i.e.,  the 
response  methods  based  on  either  Eq.  (5)  or  Eq.  (6) 
— formally  yield  the  same  exact  density  response. 
However,  the  respectively  noninteracting  polariz¬ 
abilities,  and  are,  in  general,  different 

(an  exception  is  encountered  in  the  case  of  A1 
investigated  below);  thus,  the  corresponding 
many-body  local  field  factors  must  be  different  as 
well.  In  fact,  from  Eqs.  (5)  and  (6)  it  is  a  simple 
exercise  to  obtain  the  following  relation  between 
them  [41]: 

GKS-G=  (7) 

Traditionally,  the  many-body  aspects  of  the 
electronic  response  have  been  discussed  via  the 
diagrammatic  approach  [Eq.  (6)],  whose  imple¬ 
mentation  has  been  mostly  restricted  to  the  jellium 
model  [35].  However,  the  physics  discussed  in  the 
next  two  sections  is  outside  this  time-honored 
model;  one  must  then  tackle  the  spectrum  of  exci¬ 
tations  of  real  crystals.  As  is  well  known,  calcula¬ 
tions  for  crystals  are  the  established  domain  of  the 
Kohn-Sham  method — as  far  as  ground-state  ob¬ 
servables  are  concerned.  Equation  (7)  provides  a 
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convenient  link  between  both  approaches  to  the 
treatment  of  the  effects  of  correlation  in  the  den¬ 
sity  response  function  x- 


Probing  the  Kohn-Sham  States  in 
Alnminum  with  X-rays 

Larson  et  al.  [24]  have  very  recently  performed 
extensive  measurements  of  the  dynamical  struc¬ 
ture  factor  of  Al  at  the  Oak  Ridge  National  Labora¬ 
tory  x-ray  beam  line  at  Brookhaven's  National 
Synchrotron  Light  Source.  The  emphasis  was 
placed  on  wave  vectors  ~  Ikp,  as  calculations  by 
Fleszar  et  al.  [31]  had  raised  the  possibility  that 
such  measurements  could  be  used  to  prove  the 
details  of  the  short-range  electron-electron  correla¬ 
tions.  This  wave  vector  regime  is,  of  course,  central 
to  Hartree-Fock  theory,  which  predicts  a  spin  den¬ 
sity  wave  instability  in  the  electron  gas  [42].  Fur¬ 
thermore,  the  large  wave  vector  response  has  a 
long  history  of  interest  in  connection  with  the 
famous  double-peak  loss  structure  of  the  inelastic 
x-ray  spectrum  of  simple  systems;  see  Refs.  [31,  43, 
44],  and  references  therein. 

We  recall  that  the  cross  section  for  inelastic 
x-ray  scattering  is  proportional  to  the  dynamical 
structure  factor  S(q,  w),  given  by  the  equation 

S(q;  ft))  =  — 2ftnf^Im 

Thus,  the  cross  section  is  directly  obtained  from 
the  solution  of  Eq.  (5).  Inspection  of  the  experimen¬ 
tal  data  [24]  readily  shows  that  the  x-ray  spectrum 
of  Al  contains  a  distinct  "crossover"  from  a  regime 
iq  <  kp)  fox  which  mean-field  response  (the  RPA) 
works  well,  to  a  regime  fq  >  kp)  where  it  does  not. 
An  equivalent  way  to  phrase  this  is  to  state  that 
for  q  >  kp  the  response  becomes  predominantly 
incoherent — which  is  precisely  the  regime  that  we 
must  have  access  to,  if  we  are  to  isolate  the  re¬ 
sponse  of  the  one-electron  states.  Note  that  this 
regime  is  not  easily  accessible  with  other  probes, 
such  as  inelastic  electron  scattering. 

In  fact,  the  large  impact  of  correlation  for  q  >  kp 
and  energies  up  to  hw  ~  40  eV  made  possible  the 
first  experimental  determination  of  the  many-body 
local-field  factor  for  Al  for  large  wave  vectors  [24]. 
In  brief,  such  determination  proceeds  by  "invert¬ 
ing"  Eq.  (5)  (whose  left-hand  side  corresponds  to 
the  measured  data)  to  extract  the  local-field  factor 
[45]  on  the  basis  of  the  knowledge  of  the  noninter¬ 
acting  spectrum  S‘”'‘^®*(q,  to)— which  obtains  from 


Eq.  (8)  upon  replacing  the  true  response  x  by  its 
noninteracting  counterpart  given  by  Eq.  (1);  such 
replacement  is  equivalent  to  setting  v(q)  =  0  in 
Eq.  (5).  Remarkably,  the  measured  local-field  fac¬ 
tor,  Ge'^p(q,  ft)),  turns  out  to  be  much  stronger  than 
predicted  by  theory  for  q  ->  2kp  [46-50] — with 
one  exception  [49];  this  result  is  indicative  of  strong 
many-body  correlations. 

From  the  perspective  of  the  present  article,  the 
key  feature  of  the  experiments  of  Larson  et  al.  [24] 
which  we  want  to  focus  on  is  that  Gg^(q,  co)  equals 
unity  for  q  =  q^  ~  1.5A:j..  Most  importantly,  this 
result  holds  (approximately)  over  a  large  fre¬ 
quency  interval — the  data  of  Ref.  [24]  are  consis¬ 
tent  with  a  weak  (o  dependence  of  the  local-field 
factor  for  energies  less  than  ~  35  eV;  furthermore, 
Gg^(q,  ft))  turns  out  to  be  predominantly  real  in 
this  spectral  region.  Thus,  for  q  =  qo  conditions  are 
realized  such  that  from  Eq.  (5)  it  follows  that 
S(qo,ft>)  «  w)  (i.e.,  Im  ~  Iin  for 

q  =  qo)  over  a  rather  wide  frequency  inter¬ 
val — which  includes  the  double-peak  loss  struc¬ 
ture.  This  situation  is  striking,  since  it  means  that, 
for  q  =  qo>  the  x-rays  photons  probe  the  response 
of  noninteracting  (independent)  electron-hole  pairs. 
This  provides  us  with  the  opportunity  of  “taking  a 
look”  at  the  Kohn-Sham  eigenvalues  {band  structure). 

The  above  statement  is  borne  out  by  the  results 
of  Figure  1,  in  which  we  compare  explicitly  the 
x-ray  data  for  q  =  qo  with  the  Kohn-Sham  spec¬ 
trum  w),  obtained  in  the  LDA.  The 

agreement  between  both  spectra  is  extremely  good; 
note  that  we  are  comparing  absolute  intensities, 
and  that  the  calculation  contains  no  adjustable 
parameters.  Clearly,  then,  for  q  =  qo  the  measured 
spectrum  provides  a  direct  physical  realization  of 
the  Kohn-Sham  excited-state  one-electron  band 
structure.  We  are  not  aware  of  a  similar  example 
in  any  other  experimental  context. 

[The  exaggerated  feature  in  w)  at  ho) 

~  40  eV  is  probably  due  to  residual  effects  of  the 
Coulomb  interaction,  i.e,.  to  the  approximate  na¬ 
ture  of  the  equality  G^^fqo,  ft))  =  1  over  the  fre¬ 
quency  range  covered  by  the  data  shown  in  Figure 
1.  In  addition,  for  fift)~40  eV,  and  for  higher 
frequencies  as  well,  the  data  are  rather  insensitive 
to  the  effects  of  correlation — this  increases  the 
intrinsic  error  of  the  measured  value  of  local-field 
factor  for  such  frequencies  [24].] 

Now,  the  definition  of  the  "special"  wave  vec¬ 
tor  (Jq  was  made  on  the  basis  of  Eq.  (5),  i.e.,  in  the 
context  of  the  density  functional  approach  to  dy¬ 
namical  response.  We  noted  above  that  the  many- 
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FIGURE  1 .  Inelastic  x-ray  spectrum  of  Al  for  q  =  Qq  = 
1.5/Cp — the  wave  vector  for  which  Gg^^p(q,  w)  =  1,  as 
determined  in  Ref.  [24]  (see  text).  The  empty  circles  are 
the  measurements  of  Larson  et  al.  [24].  The  solid  curve 
corresponds  to  w),  which  is  the  loss  spectrum 

for  “bare”  Kohn-Sham  electrons,  obtained  by  replacing 
Eq.  (1)  into  Eq.  (8).  For  comparison,  the  corresponding 
noninteracting  spectrum  for  jellium  (Lindhard  function)  is 
also  shown. 


body  local-field  factor  introduced  in  the  diagram¬ 
matic  approach  differs,  in  general,  from  its  density 
functional  counterpart;  see  Eq.  (7).  Thus,  on  first 
sight  it  may  seem  that  the  preceding  analysis  may 
contain  an  element  of  ambiguity  in  the  assignment 
of  the  balance  between  the  band  structure  and 
many-body  effects.  However,  there  is  no  ambigu¬ 
ity;  explicit  calculation  shows  that,  for  Al,  the 
'Truly  noninteracting''  response  evaluated  in 
Hartree  (RPA)  theory  is  identical,  on  the  scale  of 
Figure  1,  with  the  Kohn-Sham  noninteracting  re¬ 
sponse  From  this  result  it  follows  that 

=  G,  at  least  for  the  domain  of  wave  vectors 
and  frequencies  relevant  to  the  experiments  of  Ref. 
[24].  Thus,  the  significance  of  the  wave  vector 
for  the  x-ray  spectum  of  Al  is  identical  in  both 
"pictures." 

Thus  far  we  have  argued  that,  for  a  special 
experimental  condition,  which  materializes  for 
q  =  qof  the  x-rays  excite  "bare"  Kohn-Sham 
electrons.  What  about  other  wave  vectors? — how 
different  is  the  measured  response  from  in 


the  general  case?  We  stress  that  this  argument  refers 
to  a  comparison  of  the  data  with  not  with  the 

full  (''dressed")  response  x- 

The  answer  to  the  question  just  posed  is  con¬ 
tained  in  Figure  2.  For  pedagogical  reasons,  we 
show  results  for  both  the  smaller  q  regime  (left 
panel),  in  which  the  response  is  dominated  by  the 
collective  mode  (plasmon),  and  for  a  range  of 
wave  vectors  about  qQ,  for  which  the  response  is 
manifestly  incoherent  (right  panel).  It  is  apparent 
that,  in  the  incoherent  regime,  the  response  of 
independent  Kohn-Sham  electrons,  (o), 

provides  a  rather  good  approximation  to  the  x-ray 
data — it  reproduces  the  fine  structure  of  the  mea¬ 
sured  spectra  quite  well,  and  the  absolute  intensi¬ 
ties  are  not  that  far  off.  (A  more  exhaustive  series 
of  measurements,  including  data  for  even  larger 
wave  vectors,  is  presented  in  Ref.  [24].)  We  inter¬ 
pret  this  result  as  a  strong  indication  that,  for  Al, 
the  Kohn-Sham  eigenvalues  and  wave  functions 
are  good  first  approximations  to  their  quasipartide 
counterparts — even  for  excited  states  which  are  as 
far  away  from  the  Fermi  surface  as  are  probed  by 
the  data  of  Figure  2. 

We  finish  this  discussion  with  an  overview  of 
the  method  of  analysis  presented  above.  One  may 
still  ask  what  response  scheme — based  on  Kohn- 
Sham  or  RPA  noninteracting  polarizabilities — is 
more  suitable,  in  general,  for  the  purposes  of  ana¬ 
lyzing  the  x-ray  data.  Both  approaches  are  entirely 
equivalent  within  the  jellium  model,  since  in  that 
case  the  Kohn-Sham  wave  functions  are  plane 
waves,  and  the  many-body  effect  contained  in 
amounts  to  a  trivial  shift  of  all  the  eigenval¬ 
ues.  However,  a  metal  would  not  even  bind  in  the 
absence  of  electron  correlations  (which  is  why  the 
calculation  of  the  noninteracting  RPA  response 
alluded  to  above  was  performed  by  clamping  the 
crystal  at  the  lattice  constant  for  which  the  total 
energy  minimizes  in  EDA),  and  this  favors  the 
procedure  adopted  above — i.e.,  the  extraction  of 
the  local-field  factor  based  on  Eq.  (5),  not  on  Eq. 
(6).  The  opposite  viewpoint  was  adopted  in  Ref. 
[24],  but,  as  noted  above,  for  Al  there  is  no  differ¬ 
ence.  [Incidentally,  on  first  sight,  the  fact  that,  for 
Al,  a.ppear  to  imply  that  the 

inelastic  x-ray  spectrum  of  this  metal  is  well  repro¬ 
duced  by  the  jellium  model;  we  emphasize  that 
this  is  not  the  case,  as  Figure  1  clearly  illustrates.] 
For  a  metal  with  localized  electrons  at  the  Fermi 
level,  such  as  Pd,  and  x^’^^  quite  differ¬ 

ent  [51]  (the  correlation  effect  contained  in  V^fn) 
leads  to  a  narrowing  of  the  occupied  d-hand  com- 
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FIGURE  2.  Comparison  of  measured  inelastic  x-ray  spectra  of  Al  [24]  for  a  variety  of  wave  vectors  with  the 
corresponding  loss  spectra  for  independent  Kohn-Sham  electrons,  (o).  See  text  for  details. 


plex),  and  thus  the  respective  many-body  local- 
field  factors  are  intrinsically  different.  Since  the 
LDA  provides  a  valuable  starting  point  for  the 
treatment  of  correlation  in  crystals,  we  anticipate 
that  forthcoming  experiments  on  the  transition 
metals  (which  will  make  use  of  new  sources  of 
hard  x-rays,  such  as  the  Advanced  Photon  Source 
at  Argonne)  will  be  analyzed  on  the  basis  of  Eq.  (5) 
— if  desired,  the  transformation  to  the  RPA-based 
picture  can  be  performed  via  Eq.  (7). 


Interplay  between  Kohn-Sham  Bands 
and  the  Dielectric  Response  of  Cesium 

The  dielectric  response  of  Cs  provides  us  with 
another  example  of  the  “empirical"  manifestation 
of  the  Kohn-Sham  band  structure  in  the  excitation 
spectrum  of  metals.  High-resolution  electron  en¬ 
ergy  loss  measurements  by  vom  Felde  et  al.  [26] 
have  shown  that  the  plasmon  dispersion  relation 
of  the  heavy  alkalis  differs  drastically  from  the 
picture  of  the  collective  response  of  the  simple 
metals  held  for  many  years — ever  since  the  devel¬ 
opment  of  the  RPA  for  the  interacting  electron  gas 
[39].  In  the  case  of  Cs,  instead  of  a  "conventional" 
(upward)  parabolic  dispersion  curve,  controlled  by 
the  compressibility,  the  energy  of  the  plasmon 


disperses  downward  for  cj  0;  furthermore,  the 
dispersion  curve  remains  quite  flat  for  large  wave 
vectors. 

In  Figure  3  (which  centers  on  the  small  wave 
vector  domain)  we  show  the  measured  plasmon 
dispersion  relation  [26],  together  with  our  calcu¬ 
lated  dispersion  curve  for  Cs  [25]  and  for  electrons 
in  jellium  with  the  bulk  density  of  Cs  (rs  =  5.6).  (A 
negative  plasmon  dispersion  for  a  realistic  model 
of  Cs  was  first  reported  in  Ref.  [27].)  The  theoreti¬ 
cal  dispersion  curve  was  obtained  from  the  posi¬ 
tion  of  the  main  peak  in  the  loss  spectrum  S(q,  (o) 
given  by  Eq.  (8).  Clearly,  there  is  excellent  agree¬ 
ment  with  experiment.  The  key  physical  ingredi¬ 
ents  behind  this  agreement  are  [25]:  the  effects  of 
the  band  structure,  the  matrix  nature  of  the  re¬ 
sponse,  which  reflects  the  contribution  to  the  po¬ 
larizability  from  the  spatially  localized  5p  semi¬ 
core  states,  and  the  related  enhancement  of  the 
importance  of  the  many-body  Coulomb  correla¬ 
tions  for  small  wave  vectors. 

In  keeping  with  the  theme  of  this  presentation, 
we  proceed  to  highlight  the  role  of  the  Kohn-Sham 
band  structure.  We  do  this  with  reference  to  Fig¬ 
ure  4,  in  which  the  central  issue  is  the  effect  of  the 
unoccupied  bands  on  the  position  of  the  root  of  the 
equation  Re  e^^o,  g  =o(‘1;  =  0— an  alternative 

definition  of  the  energy  of  the  plasmon  for  a  given 
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FIGURE  3.  Plasmon  dispersion  relation  in  Cs  for  small 
wave  vectors  along  the  (1 10)  direction.  The  figure  shows 
the  experimental  data  of  Ref.  [26]  (crosses)  and  the 
crystal  calculation  of  Ref.  [25]  (full  circles),  together  with 
the  RPA  dispersion  curve  for  jellium  (empty  squares).  (In 
the  crystal  calculation  the  5p  semicore  states  were 
treated  as  valence  states;  furthermore,  that  calculation 
includes  the  effects  of  the  crystal  local  fields  and  of  a 
many-body  local-field,  or  vertex  correction  [25].) 


q.  Now  the  exact  dielectric  function  is  given  by 
e  =  I  -  vx  [see  Eq.  (3)],  where,  in  general,  all 
quantities  are  matrices  in  reciprocal  space.  For 
independent  Kohn-Sham  electrons  we  have  = 
I  _  j,^(0;KS).  focusing  on  the  empty 

states,  in  Figure  4  we  have  not  included  the  contri¬ 
bution  from  the  semicore  states  [25],  in  which  case 
the  response  becomes  predominantly  diagonal,  and 
this  is  why  only  the  G  =  0,  G'  =  0  element  of  the 
dielectric  function  enters  the  present  argument. 

Figure  4  shows,  in  the  frequency  region  in  the 
vicinity  of  the  plasmon  energy  ( 3  eV),  Re  q 
g'^q(c[)o))  calculated  for  “real"  Cs  (solid  curve) 
and  its  counterpart  for  jellium  (long  dashes).  First, 
it  is  apparent  that  the  zero  of  the  (real  part  of  the) 
dielectric  function  of  the  crystal  is  shifted  down¬ 
ward  relative  to  its  position  for  the  jellium  model 
— the  location  of  the  respective  zero  of  Re 
=  is  indicated  by  a  pair  of  arrows.  The 

negative  plasmon  dispersion  (or  q  ^  0  is  traced  to 


this  shift  [32],  whose  magnitude  is  wave  vector 
dependent. 

Next,  we  draw  attention  to  the  curve  in  short 
dashes,  which  corresponds  to  the  dielectric  func¬ 
tion  of  the  crystal  obtained  upon  eliminating  from 
Eq.  (1)  the  contribution  from  final  states  corre¬ 
sponding  to  a  narrow  complex  of  Kohn-Sham 
bands  lying  just  above  the  plasmon  energy.  The 
inset  of  Figure  4  shows  the  calculated  density  of 
(Kohn-Sham)  states  (DOS);  the  lower  local  maxi¬ 
mum  in  the  DOS  (note  that  it  occurs  at  ^  3  eV!) 
due  to  the  bands  in  question  is  highlighted  by  an 
arrow.  These  bands,  which  are  predominantly  of 
d-like  symmetry  [52],  roughly  span  the  3-4  eV 
range.  As  seen  in  the  figure,  in  the  absence  of 
those  interband  transitions,  the  root  of  Re 
G'  =  o(q;  a;)  =  0  for  the  crystal  coincides  with  its 
frequency  position  for  jellium.  In  that  case  the 
anomaly  goes  away — the  plasmon  dispersion  is  then 
solely  determined  by  the  compressibility,  which  is 
of  course  positive,  since  the  system  is  stable,  and 
the  plasmon  disperses  upward  [53]. 

[Although  for  small  wave  vectors  the  spectral 
weight  of  the  one-electron  transitions  just  identi¬ 
fied  is  small  [32],  they  involve  final  states  which 
are  nearly  degenerate  with  the  plasmon;  further¬ 
more,  the  real  part  of  the  dielectric  function  of  the 
crystal  reaches  its  zero  quite  gradually,  as  seen  in 
Figure  4.  Both  features  of  the  excitation  spectrum 
of  Cs  contribute  to  the  sensitivity  of  the  energy  of 
the  plasmon  to  small  "perturbations,"  such  as 
introduced  by  the  unoccupied  Kohn-Sham  bands 
whose  effect  is  highlighted  in  the  figure.] 

To  reiterate:  If  we  remove  the  spiky  complex  in 
the  Kohn-Sham  DOS  which  lies  just  above  the 
dispersion  relation  of  the  plasmon  of  Cs  changes 
qualitatively — it  becomes  the  same  as  that  for  elec¬ 
trons  in  jellium  (cf.  Fig.  3).  Thus  the  plasmon  of  Cs 
embodies  a  remarkable  signature  of  the  Kohn- 
Sham  excited-state  band  structure.  The  impact  of  these 
one-electron  (Kohn-Sham)  transitions  on  the  di¬ 
electric  function  of  Cs  is  such  that,  in  conjunction 
with  other  physical  effects  which  prove  important 
in  the  present  problem — crystal  local  fields,  elec¬ 
tron-electron  correlations — excellent  agreement 
with  the  measured  plasmon  dispersion  relation  is 
obtained  for  small  wave  vectors  [25]. 

In  summary:  Although  there  is  no  general 
Koopmans-like  theorem  relating  the  Kohn-Sham 
one-electron  states  to  electronic  excitations,  the 
above  results  for  the  plasmon  of  Cs,  together  with 
the  direct  observation  of  oj)  on  which  we 

based  our  discussion  of  the  x-ray  spectrum  of  A1 
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FIGURE  4.  Re  eelo  g'  =  o(<1'  curve)  and  for  its  jellium  counterpart  (long  dashes),  in  the  vicinity  of  Wp 

(~  3  eV),  The  position  of  the  respective  zero  of  the  (real  part  of  the)  dielectric  function  is  indicated  by  an  arrow.  The 
wave  vector  q  is  along  the  (110)  direction.  The  short  dashes  represent  the  result  of  excluding  from  Eq.  (1)  the 
contribution  from  the  spiky  complex  in  the  DOS  (see  inset)  lying  just  above  Wp.  [In  the  present  case  (cf.  Fig.  3)  the 
crystal  calculation  does  not  include  the  effect  of  the  semicore  states.] 


for  large  wave  vectors,  provide  concrete  examples 
in  which  dynamical  polarizabilities  obtained  from 
Kohn-Sham  electrons  compare  with  experiment 
quite  well.  Ongoing  calculations  for  a  metal  with 
rf-like  orbitals  at  the  Fermi  surface,  namely  Pd, 
tend  to  agree  with  this  finding  [51]. 

Of  course,  by  its  very  nature,  an  "empirical" 
study  such  as  the  one  we  have  outlined  for  A1  and 
Cs  can  only  be  deemed  as  being  "suggestive." 
Nonetheless,  on  the  basis  of  the  above  presentation 
we  propose  that,  for  these  metals,  the  Kohn-Sham 
independent  electrons  provide  a  good  first  approx¬ 
imation  to  the  true  quasiparticle  states  (with  the 
caveat  that  the  quasiparticle  lifetimes  are  beyond 
the  realm  of  the  Kohn-Sham  scheme).  Clearly, 
more  fundamental  investigations,  based  on  the 


evaluation  of  quasiparticle  band  structures  (in¬ 
cluding  lifetimes),  must  be  undertaken.  Such  cal¬ 
culations  may  be  feasible;  the  above  discussion  gives 
us  the  hope  that  the  Kohn-Sham  band  structure  may  be 
such  a  good  initial  "guess"  that  (the  computationally 
expensive)  many-body  self-consistency  may  be  achieved 
in  a  few  iteration  steps. 


Quasiparticles  and  Kohn-Sham 
Electrons  at  a  Jellium  Surface 

The  spectrum  of  electronic  excitations  can  be 
obtained  from  the  solution  of  the  Dyson  equation 
for  the  one-electron  Green's  function,  or,  equiva¬ 
lently,  by  the  self-consistent  solution  of  the  quasi- 
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particle  equation  [54]: 

+  j  d^x'I,^^(x,x';  Eqp)'Vqp(x',  Egp) 

=  Eqp^qp(x,  Eqp),  (9) 

where  ^Qp(x,  Egp)  is  a  quasiparticle  amplitude  and 
Eqp  is  the  corresponding  eigenvalue — which,  we 
recall,  corresponds  to  a  physical  excitation  energy 
of  the  many-electron  system  in  a  metal,  as  probed, 
for  example,  by  an  electron  added  in  the  inverse 
photoemission  process.  In  Eq.  (9)  we  have  intro¬ 
duced  the  electron  self-energy,  E),  which 

plays  the  role  of  a  complex,  nonlocal,  energy- 
dependent  ''potential"  in  which  the  quasiparticle 
propagates — and  which  the  quasiparticle  actually 
sets  up  self-consistently.  In  the  calculations  on 
which  the  following  outline  is  based,  the  self¬ 
energy  was  evaluated  in  the  so-called  GW  approx¬ 
imation  [20,  21],  in  which 

5,,(x,  x';E)  =  f  cIE'  G(x,  x';  E  -h  E') 

277 

X  W(x,x';  EO,  (10) 

where  G(x,  x';  E)  is  the  one-electron  Green's  func¬ 
tion,  and  W(x,  x';  E)  is  the  dynamically  screened 
electron-electron  interaction,  given  by  the  equa¬ 
tion  W  =  V  where  x  is  the  time-ordered 

counterpart  of  the  density  response  function  intro¬ 
duced  in  Eq.  (2).  We  note  that  a  description  of  the 
screening  consistent  with  the  neglect  of  vertex 
corrections  in  Eq.  (10)  requires  that  the  density 
response  x  used  in  the  evaluation  of  W  be  the 
solution  to  Eq.  (2)  for  x  =  — which  is  the  RPA, 

obtained  also  from  Eq.  (6)  with  G(q,  co)  =  0.  [The 
exact  self-energy  is  formally  given  by  =  GWE, 
where  E  is  the  vertex  function;  E  =  1  in  Eq.  (10).] 
The  solutions  of  Eqs.  (9)  and  (10)  have  been 
compared  in  detail  [28]  with  the  self-consistent 
solutions  of  the  much  simpler  Kohn-Sham  equa¬ 
tion, 

1“  +  R^^(x)|</),Xx)  =  E,.(/>,Xx), 

(11) 

for  the  case  of  a  metal  surface  modeled  by  jellium 
— this  model  affords  a  tractable  realization  of  a 
strongly  inhomogeneous  interacting  electron  sys¬ 


tem.  The  emphasis  was  placed  on  states  which  are 
bound  by  the  image  tail  of  the  surface  barrier, 
namely  the  image  states  [29]  (which  are  reso¬ 
nances,  in  the  absence  of  a  band  gap).  The  physics 
of  these  states  is  governed  by  the  dual  features  of 
long-range  Coulomb  correlations  and  strong  charge 
localization  at  the  surface. 

Now,  in  the  EDA,  is  a  function  of  the  local 
value  of  the  density  n(x),  and  this  leads  to  a 
surface  barrier  which  decays  exponentially  into  the 
vacuum  outside  the  surface.  By  contrast,  the  cor¬ 
rect  surface  barrier  has  a  long-range  z~^  tail.  Thus, 
the  image  states  are  entirely  beyond  the  realm  of 
the  EDA,  which  is  the  approximation  for  correla¬ 
tion  for  which  Eq.  (11)  is  typically  solved. 

A  scheme  was  developed  for  going  beyond  the 
EDA  in  Eq.  (11)  in  a  nonperturbative  fashion  [55, 
56].  By  nonperturbative  we  mean  that  the  rapid 
rate  of  decay  of  the  electron  density  at  the  surface 
is  treated  exactly,  i.e.,  without  appeal  to  gradient 
expansions — which  is  a  must,  in  the  present  case. 
In  essence,  this  scheme  starts  out  from  the  obser¬ 
vation  that  long-range  correlations  in  the  presence 
of  the  self-consistent  electron  density  profile  at  the 
surface  are  naturally  included  into  the  self-energy 
given  by  Eq.  (10),  and  thus  into  the  quasiparticle 
states  given  by  Eq.  (9),  already  at  the  level  of  the 
RPA,  via  the  response  function  x^  The  idea  is  then 
to  feed  the  long-range  correlations  into 
through  their  insertion  into  X^fx.x';  E)  by  solving 
an  exact  integral  equation  [57]  which  relates  both 
"potentials."  From  the  solution  of  that  equation 
for  ^  nonlocal  relation  between  and  n 

was  subsequently  established  [56,  58];  this  relation 
allows  us  to  perform  electronic  surface  structure 
calculations  which  have  built  in  the  effects  of  the 
long-range  Coulomb  correlations,  and  thus  the  im¬ 
age  states,  absent  in  the  EDA. 

In  Figure  5  we  compare  the  quasiparticle  ampli¬ 
tude  '^^(qy,  Eqp;  z)  with  the  corresponding  Kohn- 
Sham  wave  function  ^y(q||,  z) — obtained  from  Eq. 
(11)  in  the  presence  of  the  "nonlocal"  obtained 
as  just  described.  The  figure  refers  to  the  n  =  1 
image  state  (resonance)  of  Al,  modeled  with 
jellium  with  =  2.07,  for  a  representative  value 
of  the  wave  vector  parallel  to  the  surface,  q^, 
and  energy  eigenvalue  Eqp/  for  the  quasiparticle 
state.  Clearly,  the  two  wave  functions  are  nearly 
identical. 

The  above  result  is  remarkable  in  view  of  the 
fact  that  the  physics  of  the  image  problem  is  con¬ 
trolled  by  correlation,  while  in  the  bulk  (of  a  semi¬ 
conductor),  for  which  a  similar  result  was  previ- 
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FIGURE  5.  Comparison  of  the  modulus  square  of  the 
quasiparticle  amplitude  ^Qp(qj|,  Eqr;  z)  and  the  density 
functional  wave  function  (/>^(q||,  z)  for  the  n  =  1  image 
state  (resonance)  of  Al,  modeled  by  a  jellium  surface 
with  =  2.07.  The  calculation  was  performed  for  Eqp  = 
0.3Ep,  and  Qjj  =  0.2/('p  [28]. 

ously  obtained  [59],  correlation  plays  a  lesser  role 
in  establishing  the  nature  of  the  one-electron  po¬ 
tential.  Since  in  Figure  5  the  Kohn-Sham  and 
quasiparticle  calculations  were  performed  at  the 
same  level  in  the  treatment  of  the  electron-elec¬ 
tron  interaction  (the  RPA),  we  feel  that  this  a 
posteriori  success  of  the  Kohn-Sham  wave  func¬ 
tions  may  be  general — i.e.,  it  will  hold  for  calcula¬ 
tions  which  derive  from  more  sophisticated  treat¬ 
ments  of  correlation  than  contained  in  Eq,  (10). 

Additionally,  the  quasiparticle  energy  shift,  that 
is,  the  difference  Egp  —  was  found  to  be  small. 
Specifically,  this  difference  was  found  to  be  ~  0.02 
eV  for  =  0,  which  should  be  compared  with  the 
binding  energy  of  the  state,  --  0.5  eV,  measured 
from  the  vacuum  level.  This  result  is  deemed  to  be 
typical  of  the  present  problem,  as  for  Al  the  quasi¬ 
particle  effective  mass  is  very  close  to  unity;  the 
fact  that  the  center  of  mass  of  the  image  state  is  far 
into  the  vacuum  (see  Fig.  5)  should  lead  to  an  even 
smaller  many-body  mass  enhancement. 

A  physical  interpretation  for  the  almost  perfect 
overlap  between  the  quasiparticle  amplitudes  and 
Kohn-Sham  wave  functions  for  the  image  states 
was  given  by  Deisz  et  al.  [28]  on  the  basis  of  an 
effective,  local  quasiparticle  potential, 
constructed  as  a  spatial  integral  over  the  electron 
self-energy,  weighted  by  the  quasiparticle  ampli¬ 


tude  [28].  It  was  found  that  the  relatively  weak 
nonlocality  of  Re  outside  the  surface  has  the 
consequence  that  is  well  approximated  by 

14^(2)  over  the  energy  range  corresponding  to  the 
image  states.  This  energy  domain  is  of  the  order  of 
4  eV  above  the  Fermi  surface. 

Of  course,  the  quasiparticle  spectrum  is  richer 
than  its  Kohn-Sham  counterpart,  since  the  former 
contains  the  physical  effect  of  the  damping  of  the 
excited  electronic  states.  Novel  features  of  the  en¬ 
ergy  dependence  of  the  quasiparticle  states  outside 
the  surface  have  been  found  [60];  in  particular,  the 
conventional  (E  —  Ep)^  scaling  of  Fermi  liquid 
theory  has  been  shown  to  hold  for  an  increasingly 
small  range  above  the  Fermi  level  as  the  quasipar¬ 
ticle  moves  out  into  the  vacuum.  In  other  words, 
when  the  quasiparticle  is  outside  the  surface,  large 
corrections  to  quadratic  scaling  are  obtained  for 
most  energies  for  which  such  scaling  holds  in  the 
bulk.  These  corrections  are  traced  to  the  suppres¬ 
sion  of  decay  channels  involving  one  electron  final 
states  above  the  Fermi  energy  [60]. 
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ABSTRACT _ _ _ 

We  study  the  intensity  and  polarization  of  light  emitted  during  slow  ion— atom  collisions, 
We  describe  the  nuclei  as  moving  along  classical  trajectories  while  the  electronic 
rearrangement  is  treated  using  time-dependent  molecular  orbitals.  The  intensity  of 
emitted  light  is  calculated  from  the  diatomic  time-dependent  dipole.  We  evaluate  the 
diatomic  dipole  matrix  elements  involving  Is,  2s,  and  2p  traveling  atomic  orbitals 
suitable  for  time-dependent  collision  studies.  We  calculate  the  intensity  and  the 
polarization  of  light  emitted  in  p  +  H(ls)  collisions  at  kinetic  energies  from  10  to  1000 
eV,  for  several  impact  parameters,  changing  over  time.  The  emitted  intensity  goes 
through  a  maximum  as  the  collision  energy  increases  and  lasts  between  10  and  1  fs;  the 
polarized  light  components  parallel  and  perpendicular  to  the  incoming  beam  direction 
show  pronoimced  dependences  on  impact  parameters  and  time.  ©  1996  John  Wiley  & 
Sons,  Inc. 


Introduction 

Several  contributions  have  shown  how  elec¬ 
trons  jump  between  two  nuclei  during  slow 
ion-atom  collisions  [1-7].  In  our  previous  work, 
we  developed  a  time-dependent  theory  of 
ion- atom  collisions  which  provides  details  of  the 
time  evolution  of  the  diatomic's  properties  [8-13]. 
Our  previously  calculated  Figure  1  [13]  presents 
the  target  H  Is-orbital  population  in  collisions 

*To  whom  correspondence  should  be  addressed. 


with  at  a  laboratory  projectile  kinetic  energy 
E  =  10  eV  and  impact  parameter  =  1.0  a.u.  One 
can  see  the  temporal  charge  rearrangement  that 
leads  to  an  oscillating  electric  dipole  for  the  colli¬ 
sion  complex,  which  does  therefore  emit  light. 

We  have  recently  presented  [14]  a  quantitative 
study  of  the  light  emission,  where  we  have  calcu¬ 
lated  its  time  evolution  and  spectral  distribution  in 
H'^+  H(ls)  collisions,  at  projectile  (R-")  energies 
of  10  and  100  eV  at  a  single  typical  impact  parame¬ 
ter,  and  we  have  analyzed  the  wavelength  X  =  c/v 
components  of  the  emitted  light  packet.  In  the 
present  work  we  calculate  the  intensity  and  polar¬ 
ization  of  emitted  light  integrated  over  the  light 
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detector  angles  for  varying  impact  parameters  and 
collision  energies,  and  as  functions  of  the  time 
lapsed  during  collisions.  The  calculations  involve 
the  evaluation  of  the  dipole  matrix  elements  for  Is, 
2  s,  and  2p  orbitals  over  time,  insofar  as  2  s  and  2p 
orbitals  are  important  for  collisions  occurring  at 
Lab  energies  of  the  order  of  100  eV  and  higher.  We 
consider  only  the  free-to-free  transition,  since  the 
radiative  recombination  leading  to  formation  of 
H2  has  been  found  to  have  small  cross  sections 
(below  about  10“^  a.u.,  compared  to  electron 
transfer  cross  sections  around  10^  a.u.)  due  to  the 
small  values  of  their  Franck-Condon  factors  [15, 
16]. 

We  adopt  a  semiclassical  treatment  with  the 
nuclei  moving  along  classical  trajectories,  and  we 
calculate  the  intensity  of  light  emitted  from  the 
classical  electromagnetic  flux.  The  expectation 
value  of  the  time-dependent  electric  dipole  is  ob¬ 
tained  from  the  electronic  density  matrix,  calcu¬ 
lated  in  a  basis  of  traveling  atomic  orbitals  previ¬ 
ously  introduced  in  the  calculation  of  collision 
cross  sections  [12,  13]. 

In  the  conventional  language  of  molecular  elec¬ 
tronic  spectra,  light  emission  leads  to  a  transition 
between  stationary  electronic  states  of  H  -I-  H^, 
which  are  coupled  to  the  nuclear  motions  and 
change  nuclear  trajectories.  In  our  treatment,  elec¬ 
tronic  and  nuclear  motions  are  coupled  over  time, 
and  we  can  describe  changes  of  trajectories  as  the 
electron  state  changes.  The  electronic  rearrange¬ 
ment  is  accurately  treated  by  a  quantal  (time-de¬ 
pendent  molecular  orbital)  calculation  which  has 


been  shown  to  give  correct  answers  for  cross  sec¬ 
tions  and  orbital  polarization  parameters  [12,  13, 
17].  Our  classical  description  of  the  electromag¬ 
netic  field  means  that  we  are  describing  photon 
emission  in  a  probabilistic  way,  with  the  absolute 
value  squared  of  the  field  giving  the  relative  prob¬ 
ability  of  emission  at  a  given  time. 

The  procedure  we  follow  here  may  be  com¬ 
pared  with  the  conventional  procedure  using  sta¬ 
tionary  molecular  states.  For  free-to-free  transi¬ 
tions,  these  states  are  first  generated  in  the  absence 
of  light  for  the  continuum  energies  and  the  elec¬ 
tronic  states  of  interest.  Then  the  transition  dipole 
is  calculated  between  stationary  states,  and  from 
this  one  obtains  emission  rates.  The  wavelengths 
and  intensities  of  emitted  light  vary  in  accordance 
with  the  distance  dependence  of  states  and  transi¬ 
tion  dipoles,  which  enter  the  rates  through 
Franck-Condon  factors.  For  the  H  -h  system, 
the  initial  state  has  the  electron  localized  at  one  of 
the  protons,  and  must  therefore  be  written  as  a 
combination  of  the  and  o;,  stationary  states  of 

.  In  our  procedure,  we  instead  generate  nonsta¬ 
tionary  (time-dependent)  states  chosen  initially  to 
localize  the  electron  at  one  of  the  protons.  The 
system  is  evolved  in  time,  again  in  the  absence  of 
light,  to  generate  a  time-dependent  dipole  which 
oscillates  due  to  the  electron  jumps  between  pro¬ 
tons.  This  takes  the  place  of  transitions  between 
the  ag  and  cr„  orbitals  of  HJ.  From  the  time-de- 
pendent  dipole,  it  is  possible  to  calculate  transient 
light  emission,  as  we  show  in  what  follows.  The 
Fourier  transform  of  the  emission  intensity  gives 


FIGURE  1.  Lowdin  population  of  the  target  1s  state  vs.  time  in  a.u.  in  a  10-eV  proton-hydrogen-atom  collision  at 
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the  distribution  of  emitted  frequencies  and  could 
be  rewritten  in  terms  of  stationary  electronic  states. 
The  present  time-dependent  calculation  provides 
instead  a  straightforward  procedure  for  transient 
emission.  The  energy  carried  away  by  an  emitted 
photon  must  be  small  compared  with  the  total 
energy  of  the  diatomic  system.  This  is  indeed  the 
case  for  H  4-  insofar  as  the  emitted  photon 
energies  are  below  about  5  eV,  as  found  from  the 
diatomic  potential  energy  differences  and  also  from 
Ref.  [14],  while  the  molecular  energies  we  consider 
go  from  1000  down  to  10  eV.  At  the  lower  value,  a 
more  accurate  treatment  would  include  the  elec¬ 
tromagnetic  energy  in  the  calculation  of  the  molec¬ 
ular  dynamics  from  the  outset,  but  this  would  not 
change  the  qualitative  description  at  our  low  ener¬ 
gies,  and  would  be  negligible  at  our  high  energies. 


INTEIVSITY  AND  POLARIZATION  OF  THE 

EiVlITTED  LIGHT 

Our  starting  point  is  the  eikonal/time-depen- 
dent  Hartree-Fock  (Eik/TDHF)  method  reported 
earlier  [12],  which  describes  the  time  evolution  of 
coupled  electronic  and  nuclear  motions  during  col¬ 
lisions.  We  are  interested  in  one-electron  collisions 
for  which  the  Fock  operator  reduces  to  the  one- 
electron  Hamiltonian  H  and  we  work  with  the 
eikonal/time-dependent  molecular  orbital 
(Eik/TDMO)  approach  [13].  This  formulation  is 
developed  in  terms  of  the  density  operator  and  its 
density-matrix  derived  in  a  basis  of  traveling 
atomic  orbitals.  In  the  next  step  the  electric  and 
magnetic  fields  of  the  oscillating  dipole  and  the 
Poynting  flux  vector  are  derived.  Integrating  the 
expression  for  the  intensity  of  light  emitted  into  a 
given  solid  angle  over  a  sphere  of  radius  (the 
distance  from  the  Lab  frame  region  to  the  light 
detector)  produces  the  intensity  4  of  light  emitted 
in  a  given  polarization  k.  Integration  of  4  over 
time  gives  the  polarization  energy  E^.. 


Eikonal  /  Time-Dependent  Molecular 
Orbital  Method 

We  express  the  electron  density  operator  as 

P  =  \ip}{ip\,  (1) 


atomic  orbitals 

t)  =  D^/r,  f)c^(f), 
f)  =  f)' 


(2) 


with  an  atomic  orbital  centered  at  proton 

position  R„,(t),  tn  =  a,  b,  for  the  electron  at  posi¬ 
tion  r,  and  with  the  electron  translation  factor 
T„,(r,f)  given  by 


T„,(r,  f)  =  explim 


vM)  ■  r 


fdt'vlU')/2 


(3) 


where  is  the  velocity  vector  of  proton  m,  is 
the  initial  time,  and  m,  is  the  electron  mass. 

Following  the  notation  used  for  molecular  or¬ 
bital  calculations  [18]  we  write  the  atomic  orbitals 
(AOs)  as  linear  combinations  of  Gaussian  func¬ 
tions 

/ 

y-1 

with  7  =  6,  where  is  the  distance  from  electron 
1  to  nucleus  m,  k  =  x,  y,  z  for  /  =  1.  The  Gaussian 
functions  we  have  used  are  (for  =  0), 


gsir,  a)  - 


\ 


3/4 

exp(  —  ar^), 

1/4 

^  exp(  — ar 


0, 


(5) 


with  x,y,  z,  and  the  exponential  coefficients 
I  and  expansion  coefficients  y  taken  from 
previous  work  [10,  13]. 

The  density  operator  in  the  basis  of  traveling 
atomic  orbitals  is 


p(f)  = 

fXV 

where  is  the  iixv)  element  of  the  density 
matrix  P,  which  is  given  by 

P^/f)  =  E  (7) 

occ  i 

For  one-electron  systems  the  TDHF  equation  for 
the  electronic  density  operator  becomes 


where  the  time-dependent  molecular  orbital  ifi(t)  Hp  -  pH  =  ih — .  (8) 

is  written  as  a  linear  combination  of  traveling 
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This  equation  is  coupled  to  the  equation  for  the 
proton  trajectories,  which  are  determined  by  the 
classical  Hamilton's  equations  for  the  system,  in 
terms  of  an  effective  Hamiltonian  average  over 
electronic  states  /  with  weight  rv,  [12], 

H  =  «H»  =  I2rc,tr(p,H) 

I 

dR  dH 


dt  <?R' 

Equations  (8)  and  (9)  are  solved  self-consistently 
to  generate  a  dynamically  screened  Coulomb  po¬ 
tential.  Since  we  take  the  collision  to  occur  in  the 
ix,  2)-plane,  the  internuclear  distance  is  given  by 


where  R,„  is  the  distance  from  the  nucleus  m  to 
the  origin.  Then,  the  Cartesian  components  of  tj 
are  Xj  =  X„,  +  x,„^,  =  Y„,  +  y„,j,  and  z,  = 

+  2,„i,  and  the  dipole  matrix  elements  are  as  in 
the  example  below: 

i  I 

+  E  E^^mls,;rf„ls,/  (15) 

2^a, 

where  use  was  made  of  the  property  of  the  Gauss¬ 
ian  functions 


R(t)  =  yiXpit)  -  XjiOf  -t  [Zp(f)  -  Z^U)]\ 


with  the  subscripts  P  and  T  referring  to  the  pro¬ 
jectile  and  target  nuclei,  respectively. 

Evaluation  of  the  Electric  Dipole 


^^s(“/)  =^p,(a;)/(2v^).  (16) 

The  other  matrix  elements  are  derived  in  a  similar 
way. 


Electric  and  Magnetic  Fields  and 
the  Poynting  Vector 


For  a  single  electron,  the  electronic  component 
of  the  electric  dipole  in  terms  of  the  electronic 
density  matrix  is 

D„(f)  =  <^(f)|D„|^(t)>  =  -  £ 

V 

(11) 

in  atomic  units. 

The  total  electric  dipole  D  is  given  by 

D  =  Dnud  +  Del  =  D(f;  (12) 

where  the  nuclear  component  of  the  electric  dipole 
is 

^nucl  ZjRjit)  +  ZpRp(t),  (13) 

with  Zj  and  Zp,  being  the  nuclear  charges  and 
Rj(t)  and  Rp(f)  being  the  trajectories. 

The  distance  from  an  electron  1  to  the  origin 
is  given  by 

l*!  ^  Rm  +  ^ntU  (14) 


We  assume  that  the  collision  occurs  in  the 
(x,  2)-plane,  with  the  initial  projectile  velocity  par¬ 
allel  to  the  +z-axis.  The  light  detector  (LD)  is 
located  at  vector  position  We  introduce  two 
sets  of  coordinates  to  describe  it:  (?'ld/ ®ld/ ^ld) 
in  a  coordinate  system  where  the  2 '  axis  is  kept  in 
the  direction  of  D(t^)  and  rotates  with  time  (©ld 
is  the  angle  between  the  dipole  and  the  light 
detector)  and  (rpo, ©^d/ ^ld)  the  Lab  frame 
[Fig.  2(a)].  In  Figure  2(b)  we  illustrate  the  dynam¬ 
ics  of  the  oscillating  dipole  with  the  dipole  orienta¬ 
tion  angle  0  being  defined  as  the  angle  between 
the  2 '  and  z-axes  in  the  Lab  frame. 

In  SI  units,  the  electric  E  and  magnetic  B  fields 
for  the  light  emitted  by  the  dipole  are  given  by 
[19] 

E(rLD/  ^)  ^  [^LD(i^r '  D(f^))n^  —  D(f^)|  (17) 
and 

B(rLD/  0  =  -  ["r  ^  D(f,)] ,  (18) 
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FIGURE  2.  (a)  Coordinate  systems  and  light  detector  positions  for  proton-hydrogen-atom  collisions:  (b)  similar  to  (a) 
including  the  nuclear  trajectories. 


where  (jlq  is  the  permeability  of  free  space,  rLo  is 
the  location  of  the  light  detector,  =  t  —  rLp/c 
and  n,  =  ttv/^LD- 

We  express  the  total  electric  dipole  as 

D  =  D,.n,.,  (19) 


where 


n,.  =  cos(0'LD)n,  -  sin(0LD)n0.,  (20) 

with  Hq.  the  unit  vector  in  the  direction  of  increas¬ 
ing  0'ld- 
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Using  Eqs.  (19)  and  (20)  we  can  rewrite  E  and  B 
as 


E(r 

B(r 


LD/ 


LD/ 


t)  =  ~Dit,) 
4tt 


t)  = 

Attc 


sin(0LD) 

^LD 

/  sinCQ'^p) 

\  ^LD 


Hq,,  (21) 

In^,,  (22) 


with  the  unit  vector  in  the  direction  of  increas¬ 
ing  ^LD- 

Then  the  Poynting  flux  vector  is  given  by 


1 

S(rLD,0  =  — (E  X  B) 
Mo 


Mo 


D(t,)] 


;  sin^  (©',„) 


‘LD- 


'  LD 


(23) 


We  need  S  in  the  Lab  frame,  where  we  use  the 
coordinate  system  (v,  y,  z).  We  find  the  detector 
angle  in  the  Lab  frame  in  terms  of  the  dipole 
orientation  angle  0(t;b,Vp)  =  arccos(D,/D)  (with 
+  D}),  and  the  detector  angles  ©^o  and 
Old,  by  recognizing  [from  Fig.  2(a)]  that 

U  ==  ^ldCOs(©ld). 

=  rLDsin(©LD)cos(OLD)/  (24) 

U'  ^  ^ld‘^C)s(©ld)  =  r^cosiO)  -h  r^sin(0), 

yields 

cos(©ld)  =  cos(©ld)cos(^) 

+  sin(©LD)cos(OLD)sin(0).  (25) 


Intensity  and  Polarization  of  Emitted 
Light 

In  this  work  we  are  interested  in  looking  at  the 
intensities  and  polarizations  of  light  emitted  by  an 
oscillating  dipole.  The  intensity  of  light  emitted 
into  solid  angle  by  a  dipole  component 

along  k  =  X,  z,  for  fixed  impact  parameter  h  and  a 
nearby  detector  (with  r^^y/c  ^  t)  is 

(26) 


Integration  over  with 


~  sin(©LD)  ^®LD 

yields 


4(t;  b,  Vp) 


1  2  Dj.it^b.Vp)]^ 

Attsq  3c^ 


(27) 


(28) 


We  obtain  the  energy  Ej^  emitted  per  collision, 
for  each  polarization,  by  integrating  over  time  the 
k  polarization  of  the  intensity  of  light  emitted  4‘ 


(29) 


Results  and  Conclusion 

We  consider  the  case  where  the  light  has  been 
collected  over  all  angles  and  calculate  the  inte¬ 
grated  intensity  given  by  Eq.  (29). 

Figures  3  and  4  present  results  for  4  and  L  vs. 
time  vs.  impact  parameter  for  protons  of  energy 
10.0  eV.  Figures  5  and  6  present  similar  results  for 
100.0  eV.  The  lowest  b  value  shown  is  =  0.1  a.u., 
and  the  others  are  multiples  of  0.5  a.u. 

From  Figures  3  and  4,  we  observe  that  the  x 
polarization  (perpendicular  to  the  velocity  of  the 
incoming  proton)  of  the  intensity  of  emitted  light 
integrated  over  time  goes  through  a  maximum  for 
impact  parameters  b  around  1.0  a.u.  and  has  a 
very  small  value  for  head-on  collisions  near  b  =  0.0 
a.u.  The  integrated  intensity  with  z  polarization 
(parallel  to  the  incoming  proton  velocity)  is  on  the 
other  hand  largest  at  b  =  0,  and  then  falls  with 
increasing  b  values.  The  light  emitted  over  time  is 
found  to  appear  only  during  the  collision  and  lasts 
about  400  a.u.  or  about  10  fs.  Figures  5  and  6  show 
similar  dependences  with  impact  parameter  and 
time,  but  the  duration  of  light  emission  is  shorter, 
as  we  would  expect  for  a  faster  collision,  and  show 
transient  emission  lasting  about  2.5  fs. 

To  obtain  the  magnitudes  of  the  transient  emis¬ 
sion,  we  calculated  the  average  total  light  energy 
emitted  per  collision  for  the  impact  parameters  of 
1.0  and  1.5  a.u.  by  integrating  over  time.  Results 
are  presented  in  Table  I,  for  the  x  and  z  polariza¬ 
tions  and  also  for  their  sum.  Our  calculations, 
done  in  atomic  units,  have  been  converted  to  Joules 
per  mole  to  give  an  estimate  of  the  amount  of 
energy  emitted  at  each  impact  parameter.  This 
corresponds  to  a  given  deflection  angle  into  the 
detector,  and  would  have  to  be  integrated  over 
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Intensity  [x  1(yX-8)]  (a.u.) 


FIGURE  3.  The  x  component  of  the  intensity  vs.  time  vs.  impact  parameter  for  H(1s)  collisions  for  a  collision 
energy  of  10  eV. 


angles  to  obtain  the  total  light  energy  emitted  over 
all  angles  for  all  times.  For  a  single  trajectory,  we 
find  that  the  magnitude  of  light  emission  appears 
to  go  through  a  maximum  as  collision  energy 
increases  from  10  to  1000  eV;  this  is  observed  for 
the  two  impact  parameters  we  include  in  Table  I 
and  appears  to  be  a  general  feature  of  the  transient 
emission.  The  values  in  Table  I  can  be  used  to 
estimate  the  number  of  emitted  photons  at  each 
energy.  For  typical  light  emission  energy  of  1  J /mol 
and  photon  energy  of  4  eV  =  6.40  X  10"^^  J,  the 
number  of  photons  emitted  by  1  mol  of  H  + 
would  be  1.56  X  10^®.  A  beam-gas  experiment  at 


one  of  the  given  energies  and  impact  parameters 
would  give  a  somewhat  lower  number  of  photons, 
while  in  a  crossed-beam  experiment  the  number 
would  be  much  lower;  but  in  any  case  it  appears 
to  us  that  there  would  be  enough  emitted  photons 
to  experimentally  detect  the  transient  emission. 

In  conclusion,  we  have  characterized  the  nature 
of  transient  light  emission  with  a  computational 
approach  which  proved  before  to  be  reliable  for 
collisional  properties.  Light  emission  following 
ion-atom  collisions  has  been  observed  and  mea¬ 
sured  before,  but  to  our  knowledge  it  has  not  been 
studied  at  our  low  collisions  energies,  for  which 


Intensity  [x  10^(“8)]  (a.u.) 
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Intensity  [x  10^(-8)]  (a.u.) 


FIGURES.  The  X  component  of  the  intensity  vs.  time  vs.  impact  parameter  H ■^+  H(1s)  collisions  for  a  collision  energy 
of  100  eV. 


Intensity  [x  10^(-8)]  (a.u.) 


TABLE  I 


Light  energy  emitted  per  collision  for  H^+  H(1s)  collisions  for  impact  parameters  b  = 
integrated  over  time. 

1.0  a.u.  and  b  = 

1.5  a.u., 

Impact 

parameter 

(a.u.) 

Emission 
energy 
(J  /  mol) 

Collision  energies 

(10  eV) 

100  eV 

500  eV 

700  eV 

1000  eV 

1.0 

fx 

0.460 

1.578 

0.402 

0.335 

0.393 

E, 

0.578 

0.890 

0.834 

0.758 

0.692 

E,  +  E, 

1.038 

2.468 

1.236 

1.093 

1.085 

1.5 

E. 

0.514 

0.889 

0.361 

0.298 

0.271 

E, 

0.227 

0.288 

0.309 

0.310 

0.306 

E,  +  E, 

0.741 

1.177 

0.670 

0.608 

0.577 
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repeated  electron  transfer  is  present.  Light  emis¬ 
sion  due  to  polarization  of  target  or  projectile 
atoms  in  slow  collisions  has  been  theoretically 
studied  before  and  has  been  labeled  atomic 
bremsstrahlung  [20],  Light  emission  in  our  case 
comes  instead  not  from  transient  atomic  states  but 
from  transient  molecular  states  resulting  from  the 
temporary  sharing  of  electrons,  and  therefore  has  a 
different  distribution  of  intensities  over  time  and 
wavelengths.  The  present  study  indicates  that 
transient  light  emission  should  appear  in  many 
collision  phenomena  involving  electronic  rear¬ 
rangement,  and  provides  an  estimate  of  the  prop¬ 
erties  of  the  emitted  light. 
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ABSTRACT _ - _ 

A  hybrid  quantum/classical  model  is  developed  for  the  photodissociation  and 
recombination  dynamics  of  an  1 2  molecule  in  low-temperature  rare-gas  (Rg)  matrices. 
The  simplified  model  consists  of  an  I2  molecule  embedded  in  a  linear  chain  of  Rg  atoms. 
The  aggregate  is  partitioned  into  a  quantum  system  and  a  classical  bath,  which  are 
self-consistently  coupled.  Two  partitioning  schemes  are  used.  The  first  treats  the  I-I 
coordinate  quantum  mechanically  and  the  Rg  coordinates  classically.  The  second  and 
more  reliable  scheme  includes  in  the  quantum  system  both  the  I-I  mode  and  the 
symmetric  motion  of  the  two  nearest  Rg  atoms.  Both  models  show  substantial  energy 
transfer  from  the  dissociating  1 2  to  the  solvent,  followed  by  coherent  vibrational  motion 
of  the  recombined  1 2-  It  is  found  that  the  one-dimensional  quantum/classical  scheme  is 
consistent  with  its  higher  dimensional  counterpart,  although  the  latter  shows  much  faster 
dephasing.  ©  1996  John  Wiley  &  Sons,  Inc. 


I.  Introduction 

In  many  extended  systems,  a  complete  quan¬ 
tum-mechanical  characterization  of  dynamics 
is  formidable  and  often  unnecessary.  However, 
there  is  ample  evidence  indicating  that  a  pure 

*  To  whom  correspondence  should  be  addressed. 


classical  treatment  is  not  always  sufficient.  To  ad¬ 
dress  this  dilemma,  it  is  common  to  partition  the 
aggregate  into  a  system  and  the  surrounding  bath 
[1].  The  system  is  usually  defined  as  a  collection  of 
several  modes  where  the  primary  dynamics  takes 
place,  while  the  bath  consists  of  the  less  involved 
modes  coupled  to  the  system.  Such  a  separation 
represents  a  good  approximation  when  the  system 
and  bath  are  weakly  coupled  and  when  their  fre¬ 
quencies  are  significantly  different. 
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A  commonly  used  formulation  of  the  system- 
bath  separation  is  the  so-called  hybrid  classical/ 
quantum  scheme  [2].  In  this  method,  the  dynamics 
of  the  system  is  described  by  a  quantum  wave 
packet  while  that  of  the  bath  is  by  classical  trajec¬ 
tories.  The  system  and  the  bath  are  coupled  self- 
consistently  via  the  Ehrenfest  theorem  [3].  Namely, 
the  system  is  subjected  to  an  instantaneous  mean- 
field  determined  by  the  bath  and  vice  versa.  En¬ 
ergy  exchange  between  the  system  and  bath  is 
allowed,  albeit  indirectly.  This  approach  focuses 
on  the  primary  dynamics  in  the  system  without 
neglecting  the  influence  of  the  bath.  It  is  numeri¬ 
cally  efficient  and  amenable  to  large  and  weakly 
coupled  systems. 

It  should  be  realized  that  the  aforementioned 
scheme  can  be  considered  as  the  quantum/classi- 
cal  version  of  the  so-called  time-dependent  Hartree 
(TDH)  or  self-consistent  field  (TDSCF)  approxima¬ 
tion  [2,  4-6],  in  which  the  total  wave  function  is 
assumed  to  be  separable.  Thus,  the  hybrid  method 
inherits  the  deficiencies  of  the  TDH  approxima¬ 
tion,  in  addition  to  the  classical  approximation.  A 
major  concern  [7]  of  the  hybrid  approach  is  the 
separability  of  the  quantum-classical  subsystems. 
In  this  work,  we  attempt  to  address  this  issue  in  a 
well-defined  simple  system,  i.e.,  the  photodissocia¬ 
tion  and  recombination  of  1 2  in  a  linear  rare-gas 
(Rg)  matrix. 


tion  of  the  1 2  moiety  is  very  slow  [14].  On  the 
other  hand,  the  neglect  of  solvent  atoms  in  three 
dimensions  renders  the  LC  model  a  qualitative 
one.  Thus,  no  direct  comparison  with  experiments 
is  attempted.  The  I-I  interaction  potentials  for  the 
X  and  A  states  are  given  in  the  Morse  form  while 
the  I-Rg  and  Rg-Rg  interactions  are  represented 
by  Lennard-Jones  6-12  potentials.  Only  the  nearest 
interactions  are  included  in  the  model.  The  param¬ 
eters  of  the  potentials  are  given  elsewhere  [13]. 
The  potentials  of  the  1 2  system  are  displayed  in 
Figure  1. 


III.  Hybrid  Oiiantiim/Classical 
Methods 

In  order  to  facilitate  a  direct  comparison  be¬ 
tween  the  two  quantum/classical  models  used 
here,  we  restrict  ourselves  to  a  matrix  temperature 
of  0  K.  Under  such  idealized  conditions,  the  I2 
center-of-mass  motion  can  be  ignored  and  the  rare- 
gas  atoms  oscillate  symmetrically  relative  to  the 
center  of  the  chain.  It  has  been  shown  previously 
that  the  dynamics  at  a  finite  but  low  temperature 
does  not  change  qualitatively  [12]. 

The  hybrid  quantum/classical  approach  consid¬ 
ers  the  total  Hamiltonian  as  the  sum  of  terms 
representing  the  system,  the  bath,  and  the  interac¬ 
tion  between  them: 


II.  Linear  Chain  Model 


H  =  H,  -h  H,  +  H,,  (1) 


In  this  work,  we  are  primarily  concerned  with 
the  photodissociation  and  recombination  of  I2  em¬ 
bedded  in  a  cryogenic  rare-gas  (Kr)  matrix.  This 
system  (1 2-solvent)  has  been  extensively  investi¬ 
gated  and  served  as  a  prototype  for  condensed 
phase  bond  breaking  processes  [8].  With  a  fem¬ 
tosecond  pump/probe  scheme,  coherent  vibra¬ 
tional  motion  of  the  recombined  I2  has  been  ob¬ 
served  for  a  few  picoseconds  after  the  initial  exci¬ 
tation  to  the  A  state  [9,  10].  Experimental  evidence 
also  indicated  that  the  caging  of  I2  in  its  A  state  is 
complete.  Our  simplified  model  [11-13]  used  here 
is  loosely  based  on  the  l2“Rg  system.  In  this  so- 
called  linear  chain  (LC)  model,  an  1 2  molecule  is 
embedded  in  a  one-dimensional  aggregate  of  100 
rare-gas  atoms,  as  shown  in  Figure  1.  The  LC 
model  is  expected  to  be  a  reasonable  approxima¬ 
tion  for  the  short-time  dynamics  because  the  rota- 


The  quantum  wave  packet  of  the  system  is  the 
solution  of  the  time-dependent  Schrodinger  equa¬ 
tion,  which  can  be  written  for  a  two-state  problem 
as  follows  (h  =  1): 


^1 

<Pl 

H21 

H^2_ 

(Pz 

where  (/  =  !,  2)  represent  the  nuclear  wave 
functions  on  the  two  adiabatic  electronic  surfaces 
corresponding  to  the  X  and  A  states,  respectively. 
The  off-diagonal  terms  in  the  Hamiltonian  matrix 
denote  radiative  couplings  responsible  for  the  exci¬ 
tation,  which  are  modeled  in  this  work  by  a  short 
temporal  function  with  a  central  frequency: 

Hi2  =  H*,  =  (3) 
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FIGURE  1.  Upper  panel:  one-dimensional  potential  energy  curves  for  the  I -I  coordinate.  The  pump  scheme  is 
indicated  by  a  vertical  bold  arrow.  The  presence  of  the  solvent  wall  (dashed  line)  prevents  complete  dissociation.  Lower 
panel:  geometry  of  the  linear  chain  model  for  I2  embedded  in  rare-gas  (Rg)  atoms.  The  two  boxes  indicate  the  quanturn 
system  in  the  two  hybrid  quantum /classical  schemes  used  in  this  work.  The  upper  labels  are  for  the  1 DQ/C  model  and 
lower  ones  are  for  the  2DQ/C  model. 


The  equations  of  motion  for  the  classical  bath  are 
given  below  in  Hamilton's  form: 

dt  dpi  ’  dt  dqj 

The  system  is  initially  on  the  ground  state  and 
the  excitation  occurs  in  the  time  span  of  the  excita¬ 
tion  pulse.  The  quantum  wave  packet  of  the  sys¬ 
tem  is  propagated  using  a  short  iterative  Lanczos 
method  [15].  The  fast  Fourier  transform  method 
[15]  is  used  to  evaluate  the  action  of  the  Hamilto¬ 
nian.  The  classical  equations  are  solved  using  the 


velocity  Verlet  method  [16].  The  quantum  and 
classical  equations  of  motion  are  integrated  simul¬ 
taneously.  The  self-consistent  propagation  allows 
energy  flow  between  the  quantum  and  classical 
modes. 

A.  ONE-DIMENSIOINAL  OEANTLM  /  CLASSICAI- 

(IDO/C)  MODEL 

In  this  model,  only  the  dynamics  along  the  I-I 
internuclear  distance  is  treated  quantum  mechani¬ 
cally  and  the  rare-gas  atoms  are  described  classi¬ 
cally.  The  quantum  system  is  illustrated  in  the 
lower  panel  of  Figure  1  by  the  inner  box.  The 
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Hamiltonians  for  the  system,  the  bath,  and  the 
system/bath  interaction  are  given  below: 

p2 

Hs  =  -  I-  Vi.jfX),  (5a) 

^Ai-r 

100  „2 

H;,  =  Z!  f  (5b) 

/=i  2 in 

H.,  =  21/,.Kg(X,^),  (5c) 

where  m  and  are  the  mass  of  Rg  and  the 
reduced  mass  of  I2,  respectively.  The  quantum 
degree  of  freedom  X  is  the  I-I  internuclear  dis¬ 
tance  while  the  collective  bath  coordinate  q  de¬ 
notes  the  classical  displacements  of  the  100  Rg 
atoms  in  the  matrix.  The  effective  Hamiltonians  for 
the  system  and  the  bath  are  given  below: 

p2 


Hf  =  L  ^  +  V,iq)  +  2<V,_^^(X,  q)}^.  (6b) 
/.m 

For  the  quantum  system,  the  mean-field  potential 
is  determined  by  the  instantaneous  positions  of  the 
classical  particles  in  the  bath  {q).  For  the  bath 
particles,  however,  the  effective  potential  is  aver¬ 
aged  over  the  quantum  mode  according  to  the 
Ehrenfest  theorem  [3]: 


98  p2 

Hfc  =  E  +  VM),  (8b) 

,=  1  2  m 

~  2yi^g_j^g/(  Y,  ^'),  (8c) 

where  Y  =  q^  —  qi  (q^  and  q^  are  the  displace¬ 
ments  of  the  two  rare-gas  atoms  nearest  to  the  I2 
moiety)  and  P-Rg-Rg  is  the  corresponding  reduced 
mass.  Note  that  in  this  model  the  bath  has  98 
atoms  and  the  system-bath  interaction  is  between 
two  solvent  atoms  (Rg-Rg').  Similarly,  the  effec¬ 
tive  Hamiltonians  for  the  system  (Ij  •  Rg2)  and  the 
bath  (98  Rg')  are 


Hf 


2p,[_i  2p,,jg_Rg 


+  v,_,(x)  +  2y,_Kg(x,y) 

+  2<V"Rg-Rg'(Y,  ^')>y  (9a) 


H 


eff 


98 

E 

)=i 


IL 

2  m 


+  Vb^q') 


+  2<V'j^g_jjg.(Y,  (j'))x,y .  (9b) 

Note  that  the  I-Rg  interaction  is  explicitly  taken 
into  account  in  the  system  Hamiltonian.  The  effec¬ 
tive  potentials  are  treated  in  a  similar  manner  to 
the  IDQ/C  system.  For  example,  the  mean-field 
potential  for  the  classical  bath  is  as  follows: 


^^Rg-Rg'(^'  q'^')x,Y 


<l/,_Rg(X,^)>x 

=  f^dX<p*iX,t)V,_^^iX,q)<p(X,t).  (7) 

The  mean-field  potential  is  only  calculated  for  the 
excited  state. 


B.  TWO-DIME^ISIOMAL  OUANTUM/ 

CLASSICAL  (2DO/C)  MODEL 

In  this  more  elaborate  model,  the  symmetric 
motion  of  the  two  rare-gas  atoms  nearest  to  the  I2 
moiety  is  included  in  the  quantum  system,  as 
shown  by  the  outer  box  in  the  lower  panel  of 
Figure  1.  The  classical  bath  includes  all  the  remain¬ 
ing  rare-gas  atoms.  The  corresponding  Hamiltoni¬ 
ans  are 


2ai-i  2p.Rg_Rg 

+  R,_,(X) +  2y,_Rg(X,y),  (8a) 


TdX  r dY<p*(X,Y,t) 

•'0  ■'0 

x^Rg-Rg'(^.^)<p(x,y,t),  (10) 


where  Rg  denotes  a  rare  gas  atom  in  the  system 
and  Rg'  its  nearest  neighbor  in  the  bath. 

The  idea  of  explicitly  including  in  the  system 
some  strongly  coupled  bath  modes  is  not  entirely 
new  [17,  18].  The  2DQ/C  model  proposed  here 
removes  some  crucial  deficiencies  in  the  IDQ/C 
model.  It  treats  the  dynamics  between  the  I2  and 
its  adjacent  rare-gas  atoms  exactly  and  quantum 
mechanically.  The  2DQ/C  method  is  thus  ex¬ 
pected  to  yield  much  more  reliable  results  and 
serves  here  as  a  benchmark  to  assess  the  validity 
of  the  IDQ/C  model. 


IV.  Results 

We  concentrate  here  on  the  comparison  between 
the  IDQ/C  and  2DQ/C  models  for  the  l2-Krjoo 
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system.  A  734-nm  laser  pulse  with  a  width  of  100 
fs  is  used  to  excite  the  system  from  its  groxmd 
state  to  the  excited  A  state.  Figure  2  displays  the 
averaged  I-I  distance  (a)  and  the  energy  (b)  of  the 
system  calculated  from  the  two  models.  Starting^at 
the  grormd-state  equilibrium  geometry  (~  2.8  A), 
the  I-I  distance  increases  rapidly  during  the  initial 
phase  of  dissociation,  owing  to  the  strong  repul¬ 
sive  potential  of  the  A  state.  It  quickly  reaches  its 
maximum  at  ~  4.2  A  in  a  few  hundred  femtosec¬ 
onds.  Because  of  the  "cage  effect,"  however,  the 
dissociation  of  1 2  is  blocked  by  the  solvent.  Subse¬ 
quently,  the  1 2  moiety  oscillates  near  a  new  equi¬ 
librium  geometry.  T^e  agreement  between  the 
IDQ/C  and  2DQ/C  models,  as  shown  by  Figure 
2(a),  is  generally  satisfactory.  Quantitatively,  the 
2DQ/C  model  (dotted  line)  shows  stronger  and 
more  regular  oscillations  in  the  first  1  ps  and  more 
pronounced  dephasing  at  long  times  (  >  2  ps).  The 
dephasing  can  be  attributed  to  the  diffusive  wave 
packet  in  the  two-dimensional  phase  space  [13]. 

The  caging  of  the  highly  energetic  fragments  is 
accompanied  by  drastic  energy  transfer  from  the 


0  1000  2000  3000  4000 

Time  (fs) 


fragments  to  the  solvent  atoms.  It  can  be  seen  from 
Figure  2b  that  the  energy  loss  in  the  IDQ/C  model 
occurs  in  the  first  ~  300  fs.  It  takes  a  little  longer 
to  achieve  similar  energy  loss  in  the  2DQ/C  model 
because  the  system  now  includes  two  neighboring 
Kr  atoms.  Quantitatively,  the  amount  of  energy 
transferred  to  the  bath  is  somewhat  larger  in  the 
2DQ/C  model  (dotted  line).  This  difference  may 
be  attributed  to  the  inclusion  of  the  I-Rg  interac¬ 
tion  in  the  quantum  system,  which  provides  a 
better  description  of  the  energy  transfer  from  1 2  to 
its  adjacent  Rg  atoms.  Concurrent  with  the  1 2  en¬ 
ergy  loss,  the  energy  of  the  rare-gas  matrix  in¬ 
creases  (not  shown  here)  and  the  total  energy 
(system  plus  bath)  is  conserved.  The  kmetic  energy 
of  the  rare-gas  atoms  is  propagated  outward  in  the 
solvent  chain  as  a  shock  wave  [11—13].  After  the 
initial  energy  loss  due  to  the  fragment-solvent 
collision,  further  vibrational  relaxation  becomes 
less  efficient  due  to  the  oversimplification  of  the 
LC  model. 

Similar  observations  have  been  obtained  in 
three-dimensional  classical  molecular  dynamics 
studies  of  the  same  system  by  Martens  and  co¬ 
workers  [9,  10].  However,  it  should  be  pointed  out 
that  the  quantum  coherence,  which  stems  from  the 
interference  between  coherently  populated  vibra¬ 
tional  states,  is  fundamentally  different  from  the 
classical  vibrational  oscillation.  The  retention  of 
the  vibrational  coherence  after  significant  energy 
transfer  is  due  to  the  impulsive  nature  of  the 
collision  and  to  the  low  temperature  of  the  matrix. 

The  wave  packet  motion  in  the  I-I  degree  of 
freedom  is  presented  for  the  two  models  in  Figure 
3.  In  the  2DQ/C  model,  the  one-dimensional  re¬ 
duced  density  is  defined  as  follows: 


Time  (fs) 


=  j  dY(p*(X,Y,t)(p(X,r,0.  (11) 


Overall,  the  two  methods  give  qualitatively  simi¬ 
lar  results,  even  at  very  long  times  (~  2  ps).  In 
order  to  quantitatively  compare  the  two  models, 
we  calculate  the  overlap  between  the  moduli  of 
the  two  wave  packets: 


<1^1DQ/C||^2DQ/C|> 

C  =  — — =========^  / 

y/(  I  ^IDQ/C  )<  ^2DQ/C  |  ^2DQ/C  ) 


FIGURE  2.  Averaged  I-I  distance  (a)  and  averaged  where  the  modulus  is  defined  as 

energy  of  the  system  (b)  as  a  function  of  time.  The  solid 

and  dotted  lines  are  the  IDQ/C  and  2DQ/C  models,  (13) 

respectively. 
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The  denominator  in  Eq.  (12)  is  used  to  normalize 
the  wave  functions  in  both  models.  This  is  neces¬ 
sary  because  the  population  on  the  A  state  is  not 
normalized  and  is  a  function  of  time  during  pump)- 
ing.  Since  we  are  primarily  interested  in  the  dy¬ 
namics  in  coordinate  space,  rather  than  in  energy 
space,  the  phase  factors  of  the  wave  packets  are 


ignored.  The  overlap  is  displayed  in  Figure  4.  At 
the  center  of  the  pump  pulse  (f  =  0),  the  overlap 
between  the  two  models  is  very  close  to  unity. 
Subsequently,  the  overlap  deteriorates  as  the  colli¬ 
sion  between  the  iodine  fragments  and  the  nearby 
rare-gas  atoms  takes  place.  It  recovers  near  1300  fs 
where  the  wave  packet  again  reaches  the  outer 


FIGURE  3.  Snapshots  of  the  I -I  wave  packet  motion  of  the  la-Kr^jo  system  for  the  1DQ/C  and  2DQ/C  models. 


1484 


QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  30 


PHOTODISSOCIATION  AND  RECOMBINATION  OF  12(A) 


FIGURE  4.  Overlap  between  the  moduli  of  the  wave 
packets  for  the  two  models  (see  text). 


turning  point.  Beyond  2000  fs,  the  overlap  between 
the  IDQ/C  and  2DQ/C  models  becomes  stabi¬ 
lized  near  ~  80%. 

The  above  comparison  of  the  two  models  clearly 
illustrates  that  the  validity  of  the  IDQ/C  model 
depends  very  much  on  the  type  of  quantities  to  be 
extracted  from  the  dynamics.  The  IDQ/C  model  is 
certainly  capable  of  capturing  some  major  dynami¬ 
cal  features  of  the  dissociation-recombination  pro¬ 
cess,  such  as  the  impulsive  energy  transfer  be¬ 
tween  the  dissociating  fragments  and  the  solvent 
atoms  and  the  retention  of  the  vibrational  coher¬ 
ence  after  the  collision.  However,  it  fails  to  de¬ 
scribe  the  destruction  of  the  coherence  on  a  rela¬ 
tively  long  time  scale.  This  failure  of  the  IDQ/C 
model  in  this  aspect  can  be  directly  related  to  the 
mean-field  approximation  that  restricts  the  I-I 
wave  packet  to  one  dimension. 


V,  Conclusions 

In  this  work,  we  have  examined  the  validity  of 
the  hybrid  quantum/classical  methodology  in 
studying  the  photodissociation  and  recombination 
dynamics  of  I2  in  rare-gas  matrices.  The  validity  of 
the  system-bath  separation  is  explored  by  two 
models  within  the  linear  chain  approximation.  The 
simpler  model  (IDQ/C)  treats  the  I-I  coordinate 
quantum  mechanically  and  the  remaining  bath 
modes  classically.  This  method  is  often  used  in 
condensed  phase  systems.  The  more  sophisticated 
model  (2DQ/C)  includes  in  the  system  both  the 
I-l  mode  and  the  symmetric  mode  of  the  two 


adjacent  rare-gas  atoms.  It  provides  a  better  repre¬ 
sentation  of  the  d3mamics  because  of  the  exact 
treatment  of  the  I-Rg  interaction. 

The  IDQ/C  model  is  shown  to  reproduce  the 
significant  energy  transfer  between  the  dissociat¬ 
ing  fragments  and  the  solvent  atoms.  It  also  shows 
coherent  wave  packet  motion  after  the  energy 
transfer.  However,  this  model  gives  a  relatively 
poor  representation  of  the  dephasing  process. 
Overall,  the  IDQ/C  model  appears  to  provide  a 
reasonably  accurate  representation  of  the  short- 
time  dynamics. 

The  hybrid  quantum / classical  method  based  on 
the  time-dependent  Hartree  approximation  com¬ 
bines  a  quantum  treatment  of  the  system  with  an 
approximated  classical  characterization  of  the  bath. 
When  applied  prudently,  it  represents  an  efficient 
and  useful  tool  in  studying  chemical  processes 
with  large  dimensionalities.  It  should,  however,  be 
borne  in  mind  that  this  method  involves  a  number 
of  potentially  fatal  approximations.  It  works  best 
when  the  system— bath  coupling  is  weak  and  the 
mass  of  the  bath  atoms  is  heavy.  Its  validity  deteri¬ 
orates  with  time  so  that  the  long-time  dynamics  is 
unreliable.  Highly  averaged  attributes  are  usually 
more  trustworthy  than  state-resolved  quantities. 
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ABSTRACT _ _ _ _ _ _ 

Minimal  basis-set  (STO-3G)  direct  and  Fourier  space  restricted  Hartree-Fock  (RHF) 
calculations  on  the  infinite  regular  (metallic)  and  alternating  chains  of  lithium  molecules, 
(-Lij-);,,  are  reported  to  illustrate  two  advantages  of  the  Fourier  representation  method: 
accurate  calculation  of  all  lattice  summations  and  faithful  reproduction  of  the  genuine 
features  of  the  RHF  approach  for  metallic  cases.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

Restricted  Hartree-Fock  (RHF)  linear  combi¬ 
nation  of  atomic  orbitals  (LCAO)  calcula¬ 
tions  on  extended  chains  with  one-dimensional 
periodicity  [1]  have  become  routine.  Most  of  the 
numerical  and  algorithmic  difficulties  associated 
with  the  study  of  these  systems  are  imderstood 
now  (basis-set  extension  and  possible  linear  de¬ 
pendencies,  lattice  summations,  self-consistent 
field  (SCF)  convergence,  computational  effort,  etc.) 
and,  if  not  yet  fully  controllable,  the  possible  nega¬ 
tive  effects  on  the  quality  of  the  results  have  been 

*To  whom  correspondence  should  be  addressed. 


pointed  out.  It  is  interesting  to  note  that  essentially 
all  practical  calculations  in  this  area  are  carried  out 
in  the  configuration  or  direct  space  (DS)  approach. 

In  the  DS  approach,  the  Coulomb  and  exchange 
contributions  are  treated  on  a  different  footing  [2]. 
The  Coulomb  (Madelung)  sums  are  computed 
generally  using  a  multipole  expansion  of  the  two- 
electron  integrals  in  the  asymptotic  regime  of  small 
overlap  of  the  interacting  charge  distributions.  The 
convergence  of  the  sununations  of  the  exchange 
contributions  is  fixed  by  the  decay  of  the  LCAO- 
density  matrix  elements  making  the  implementa¬ 
tion  of  a  multipole  expansion-based  algorithm 
more  difficult.  At  present,  the  exchange  lattice 
summations  are  still  evaluated  on  a  trial-and-error 
basis  by  repeated  calculations  with  varying  num¬ 
bers  of  interacting  cells  to  assess  the  convergence. 
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If  acceptable  in  the  case  of  insulators  and  semicon¬ 
ductors,  the  procedure  cannot  be  applied  realisti¬ 
cally  to  systems  characterized  by  partial  band  oc¬ 
cupancy.  Indeed,  to  reproduce  the  genuine  fea¬ 
tures  of  the  HF  description  of  the  metallic  situation 
[3]  (notwithstanding  its  physically  undesirable 
character)  the  number  of  interacting  cells  to  be 
taken  into  account  would  be  prohibitively  expen¬ 
sive.  This  surely  explains  that  the  vast  majority  of 
the  reported  applications  are  on  insulators  and 
semiconductors. 

The  Fourier  space  (FS)  representation  [4]  is  a 
promising  direction  to  investigate  for  the  evalua¬ 
tion  of  both  the  Coulomb  and  exchange  lattice 
sums  on  a  consistent  basis.  Very  recently,  it  has 
been  shown  that  combining  the  Poisson  summa¬ 
tion  formula  and  the  Ewald  method  in  the  context 
of  gaussian  basis  sets  [5]  can  be  used  to  compute 
accurately  these  summations.  Practical  implemen¬ 
tation,  however,  requires  the  development  of  effi¬ 
cient  algorithms  for  the  two-electron  integrals.  The 
purpose  of  this  work  is  to  compare  the  results 
obtained  for  the  linear  chain  of  lithium  molecules 
treated  in  a  basis  of  s-type  gaussian  functions  by 
the  FS  and  DS  approaches. 

The  article  is  planned  as  follows:  In  the  theoreti¬ 
cal  section  the  model  system  is  described  briefly. 
Also  introduced  in  this  section  are  the  basic  RHF 
equations  that  will  be  needed  and  some  numerical 
aspects  to  help  in  appraising  the  difference  be¬ 
tween  the  DS  and  FS  methods.  In  the  result  section 
we  compare  numerical  results  on  the  lithium  chains 
obtained  with  both  approaches.  The  work  ends 
with  a  few  remarks  on  future  developments. 


Theoretical  Considerations 

MODEL  SYSTEM 

The  model  system  used  for  the  present  compari¬ 
son  is  the  linear  chain  of  lithium  molecules  with 
=  6)  electrons  per  unit  cell  distributed  along 
the  2  axis,  which  is  chosen  as  the  direction  of 
periodicity  (Fig.  1).  The  interatomic  distance  in  the 
unit  cell  is  kept  fixed  and  is  equal  to  5.5  a.u., 
which  is  close  to  the  ab  initio  HF  value  (5.52  a.u.) 
obtained  for  the  Li  2  molecule  with  the  STO-3G 
basis  set  using  the  Gaussian  92  program  [6].  In  this 
work  three  values  of  the  unit  cell  length  have 
been  considered,  11.0,  12.0  and  13.0  a.u.,  to  cover 
situations  ranging  from  an  even  distribution  of 
atoms  to  cases  of  increasing  alternancy.  In  a  RHF 
description  it  must  be  pointed  out  that  to  a  (-Li2-)x 
chain  with  equidistant  atoms  does  not  correspond 
the  metallic  situation,  but  the  symmetry  broken 
solution.  Accordingly,  the  band  structures  for  the 
three  values  of  Aq  will  all  exhibit  a  gap  at  the  edge 
of  the  Brillouin  zone. 

The  DS  calculations  have  been  performed  with 
the  PLH  program  [2]  and  the  calculations  in  FS 
using  a  new  prototype  program,  FTCHAIN.  In  all 
cases,  the  criterion  for  convergence  of  the  iterative 
SCF  procedure  on  the  density  matrix  elements  is 
set  equal  to  10“*.  Calculations  have  been  carried 
out  using  two  atomic  s  functions  on  each  atom, 
each  function  described  by  three  gaussians  to 
mimic  Slater  functions  of  exponents  =  2.69  and 
^2s  —  0.65  [7].  It  must  be  pointed  out  that  this 
fairly  localized  minimal  basis  set  has  been  selected 
on  purpose  to  guarantee  everywhere  reliability 


d=5.5  a.u. 

^ - ► 


0 


a 


0 


z 

FIGURE  1.  Schematic  picture  of  the  infinite  chains  of  lithium  molecules,  (-Lig-lx,  considered  in  this  work.  The  values 
of  ag,  the  unit  cell  length,  are  11.0,  12.0,  and  13.0  a.u.  The  interatomic  distance  in  the  unit  cell,  d,  is  equal  to  5.5  a.u. 
The  z  axis  is  the  direction  of  periodicity. 
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and  accuracy  of  the  multipole  expansion  used  in 
the  PLH  program  to  evaluate  the  Madelung  terms. 

RELEVANT  RHF  EXPRESSIONS 

The  RHF  Bloch  states  0„(/c,r)  are  doubly  occu¬ 
pied  up  to  the  Fermi  energy  Ep  and  orthonormal- 
ized  as  shown  in  the  following  equation: 

j  dr(l)*,ik',i)(f>„(k,r)  =  (1) 

N(N  ^  oo)  is  the  total  number  of  cells  in  the  sys¬ 
tem,  the  lattice  sites  are  identified  by  the  integers 
m,  m',  and  m"(=  0,  +  1,  +  2, . . . ),  and  n  and  n' 
are  the  band  indices.  In  our  notation,  the 
wavenumber  k  is  expressed  in  units  of  Itt/Uq  and 
is  defined  in  the  Brillouin  zone  (BZ),  i.e.,  fc  e  [  —  ^, 
|].  The  Bloch  states 


+ 


X 


f  dk'Y^oAk') 

•'bz 


(^*(/c',r2)0,^(fc',r2) 


r,  -  to 


-/  dk'j:eAk') 

Z  *'BZ 


f,  (f>*Xk',r2)(l>„(k,r2) 

X  j  ‘^J'2 - — rn - 


r,  -  r. 


=  s„(k)(l>„(k,i^), 


(6) 


where  F(r^)  is  the  Fock  operator  in  the  restricted 
form  for  double  occupancy  of  the  Bloch  states; 
and  vector  define  the  charge  and  the  position 
of  atom  fl,  respectively,  and  M  is  the  number  of 
atoms  (M  =  2  in  this  work).  Equation  (6)  can  be 
expressed  in  terms  of  the  atomic  functions  as 


0„(/c,r)  =  i:C,,(fc)^^,(/c,r)  (2) 

p 


i:F„(fc)c,„(fc)  =  s„(k)T,s^^a)c^„(k),  (7) 


are  expressed  in  terms  of  Bloch  sums  b^ik^x) 

cc 

bp(k,r)  =  Y.  -  ma^e^) 

m  - 

oc 

=  E 

-  cc 


where  F^^ik)  and  S^^ik)  are  the  elements  of  the 
LCAO  Fock  ¥(k)  and  overlap  S(k)  matrices,  de¬ 
fined  as 


F^^(k)  =  N  ^  j  dx  ^7*(/c,r)f(r)bg(/:,r)  (8) 


where  p  and  vector  A^  represent  the  index  and 
the  center  in  the  unit  cell  of  the  atomic  function, 
Xp,  respectively. 

The  charge  neutrality  condition  of  the  chain  is 
given  by 


(  dkj^e„(k)  =  n^ 

■'bz  „ 

with  the  occupation  function  d„(k)  defined  as 


(4) 


d„ik) 


2  if  e„ik)  <  Ep 
0  if  s„(k)  >  Ep. 


(5) 


The  RHF  equations  of  the  Bloch  states  and  the 
energy  bands  s„(k)  are 


F(ri)<f.„(fc,ri) 

V^fr^) 

2 


M  7 

E - ^ - 

«  =  i  Ih  -  A„  -  mfloe. 


and 


Sp^(k)  =  N^j'drb*(k,T)b^(k,r).  (9) 

The  Fock  matrix  elements  are  composed  of  three 
terms: 


F^k)  =  T^k)  +  V^k)  +  (10) 

where  T^^ik)  is  the  kinetic  energy,  Vp^(k)  the  clas¬ 
sical  electrostatic  energy,  which  includes  the  elec¬ 
tron-nuclear  attraction  and  the  electron- electron 
repulsion  contributions,  and  X^^ik)  the  exchange 
energy.  Analysis  of  the  problem  in  these  terms 
with  respect  to  the  convergence  of  the  lattice  sums 
which  enter  in  their  definition,  and  the  analytic 
properties  of  the  LCAO  density  matrix  elements 


Pp,(k)  =  zcp„a)c*„a)e„(k)  (11) 
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in  the  BZ,  as  well  as  the  consequences  on  the  RHF 
results  are  well  documented  [8]. 

COMPUTATIOIVAL  ASPECTS 

In  this  section,  we  briefly  contrast  the  way  the 
matrix  elements  are  computed  in  the  DS  and  FS 
approaches. 


Direct  Space 

In  the  configuration-space  approach,  the  differ¬ 
ent  components  are  expressed  as 


S,,(fc)  = 

^iTrikmcin 

m=-N^ 

(12) 

= 

N, 

Y''  ^iTrikni’-rm 

(13) 

m=-N^ 


=  E 


=  E 


E  E  {LPrt 


m”  =  - 2c  m'  ^  - N,.  \  rs 


0  m  ni"  m'  +  m" 

P  q  r  s 


y  m"  =  -N^.  ni'  -  -Ny.  rs 


+  long  range  correction. 


E  E  P/: 

=  -  Ny  rs  \  Til' ,  ni"  =  -  Ny. 


where  is  the  limit  used  in  the  lattice  summa¬ 
tions  appearing  in  the  exchange  and  in  the 
Coulomb  terms.  In  the  Coulomb  terms,  however, 
the  summation  over  m"  decays  like  Hence, 

the  summation  over  m"  is  partitioned  in  a  short- 
range  region  (-N,  <  m"  <  +N^)  where  all  inte¬ 
grals  are  calculated  in  a  standard  way.  Outside  of 
this  region  a  multipole  expansion  is  applied  and 
the  infinite  summations  are  carried  out  explicitly 
using  the  properties  of  Rieman  zeta  functions.  S'” , 


0  m'  m  nf  m 
p  s  r  cj 


JVl 

v;‘im")=  E  fdTx°(r-A^,) 


r  —  —  tn"  ae. 


x:'(r)  (18) 


0  mm'  m" 
p  q  r  s 

=  j  dx^j  dt 


A'p(ri);\r“(ri)A-;'’(r2 )  xf  (12 ) 


fo 

m  m' 

m"  \ 

[P 

q  r 

^  1 

respectively,  are  the  overlap,  kinetic,  nuclear  at¬ 
traction  and  two-electron  integrals  between  the 
atomic  functions  used  to  represent  the  Bloch  state 
d>„(k,r): 

S;\  =  j  drx°iT)x;"(T),  (16) 

=  /  dxx^ir)  ^  x^ir),  (17) 


where  P/”  is  the  mth  Fourier  coefficient  of  the 
complex  Fourier  series  representing  P,//:), 


In  this  work,  only  s  type  atomic  orbitals  located 
on  the  2  axis  are  used: 
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and  the  above  integrals  take  the  particular  forms: 


4a  a 

cm  _  f  r  ‘I 

f"'  '  77 


77 


OLp  +  <X^ 


3/2 


a  a  2  i 

X  expl - "  Ap  +  ma^ej  |, 


(22) 


ym  _  _ 

I  a„  +  a 


9  / 


X  <  3a„a^  —  2- 


-|A„-A, 


'P“7  “  4-  '  7  P 


+  mfloe,nS”  , 


(23) 


2(a  +  a  ) 

— S” 


PH  ' 


M 

-m  ^ 
PH  ^ 


X 


Fo((a,  +  u.XG;,  -  A„  + 

(24) 


0  m 
P  ^ 


M"  m"  +  m 


') 


(a„  +  a„)(a„  +  a^) 
r>  /  P _ H  ^ cm  cm 

^1/  ^  I  I  I  ^  ^vq'^rs 

7r(ct^,  +  +  a^J 

(ttp  +  a,)(a,  +  a/ 


X  Fr 


«p  +  “,  +  “r  +  “s 


(?n"flo 


e.  +  G" 


■g;;) 

(25) 


Fourier  Space 

In  the  Fourier  transform  approach,  the  expres¬ 
sions  for  the  Coulomb  and  the  exchange  terms  are: 


Vfc) 


=  (77«o)'’  E  f  dk'Pjk')Z  /  ^^^S,,(fc,q^) 

rc  •'BZ  u  ^0  M 


1  /  ^ 

Srsik',  -qp)  -  -Sjk',0)  E 

\.=1 


■'BZ 

dqo 


(27) 

X,/fc)  =  -(277flo)''  E 

(/r  +  ^:-A:')^ 

^  Sp5(fc', q^+^_j(.-)S,^(fc,  “qp,+t-j:')/ 

(28) 

where  q  <=>  {q^,  q^,  qj  and  q^  «  (q^,  qy,  fi). 

In  these  equations,  the  generalized  overlap 
Spyik,  q)  integral  is  defined  by: 

Sp,ik,q)=  (29) 

m 

In  the  case  of  s-atomic  orbitals,  the  generalized 
overlap  integral  has,  after  factorization  in  the 
q^,qy,  and  q^  components,  the  following  analyti¬ 
cal  expression: 

S,,(fc,q)  =  S/'^'>S/>(I:,(?,)e'^’^‘'-^- 


X  exp 


al  (ttp  +  ay) 


(30) 


where  and  G"J’  are  the  centers  of  the  gauss- 
ians  that  result  of  the  product  of  two  gaussians 
X°(r)  and  XqHr),  and  y°(r)  and  y/'Xr), 


g^A^  +  ay(Ay  +  ma^ej  ^ 

P'?  +  a^ 


Fn(t)  is  related  to  the  error  function. 


Fo(t)  =  pexpi-tu^)du  =  —  i/^erf()/F) 

•'o 


2  V  t 


(t  >  0). 


with  P’  defined  as: 


. 4a„a. 

g(ar,  y)  ^1  P  7 
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77 


ttp  -h  ay 


X  exp<  - 


“p“7 


Otp  +  CLc, 


+  (A 


y-p 


[(A,,p- A,,,)" 

01) 


The  2-component  involves  a  lattice  summa¬ 
tion  that  can  always  be  made  to  converge  rapidly 
by  choosing  one  of  the  two  equivalent  forms  (32a) 
or  (32b)  linked  by  the  Poisson  formula  according 
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to  the  particular  values  of  the  gaussian  exponents, 
'  4a„a„  ^  ^ 


SilKk,qJ  = 


or 


p  ^1 


77 


X  Lexp 


iTrinn  k  + 


^2 


a„o(„  , 

- 7 - Onag  +  A^_  “  A,  ) 

a„  +  a,  '  ^ 


(32a) 


S\,^Kk,qJ 


TT  I  4a^,a^ ''  I 


,  1/2 


27  J  \ 


X  s^,;\k,-iix  +  k-k')) 

^  g(2i7/flo)<(M  +  *:-J:'XC5,p,-G"  ,,) 

X  Ei(77(m  +  fc  -  fc')").  (34) 

In  these  expressions,  y,  5,  and  77  are  respectively 
defined  as: 


+  ttg 


/77\^/  1  1  \ 

^  =  - - ^ - 

\  ^7o  /  \  +  a,  a,  +  a  J 


X  exp 
X  XI  exp 


277  a. 
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(32b) 


Using  these  relations,  the  Coulomb  and  exchange 
terms  then  become: 


V,,(k) 


E  dk'Pjk’) 

_  1  /T 


ri/2 
'-1/2 


X 


(33) 


The  rate  of  convergence  of  these  lattice  summa¬ 
tions  can  be  slow  if  the  y,  8,  or  77  values  are  small, 
or  alternatively  when  (a^  4-  a^)  or  (a^  +  a^)  are 
large.  Rapidly  convergent  series  are  obtained  us¬ 
ing  a  combination  of  the  Ewald  technique  [9]  and 
the  Poisson  formula  [10],  the  latter  being  applica¬ 
ble  thanks  to  the  gaussian  expansion  of  the  atomic 
functions. 

The  numerical  integration  over  k'  is  carried  out 
with  the  same  quadrature  scheme  and  integration 
points  for  both  the  Coulomb  and  exchange  contri¬ 
butions.  The  method  is  based  on  a  grid  of  equidis¬ 
tant  points  (in  our  case  33  k'  points  in  the  first 
Brillouin  zone)  and  Newton-Cotes  formulas  [11]. 
The  numerical  integration  procedure  has  specifi¬ 
cally  been  designed  to  handle  analytically  the  sin¬ 
gularity  in  the  exchange  part.  The  integration 
weights  needed  to  treat  the  singular  integrand 
occurring  in  the  exchange  term  in  a  suitable  way 
have  been  constructed  from  the  Lagrange  interpo¬ 
lation  formulas  [11].  Reference  5  contains  more 
details  about  this  newly  proposed  computational 
technique. 


Results 


and 


X,,(k) 


~  E  d/:'P,,(U)S<jUs(x'/) 

2rto  rs  -^-1/2 


First,  we  analyze  comparatively  the  results  ob¬ 
tained  with  the  DS  and  FS  approaches  for  the  three 
values  of  the  unit  cell  length  (^q  =  11.0,  12.0,  and 
13.0  a.u.).  Then,  to  further  illustrate  the  power  of 
the  FS  approach,  we  report  results  on  the  symme¬ 
try  constrained  metallic  situation. 
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TABLE  I _ _ ^ - 

Energy  values  e{k)  of  the  occupied  and  unoccupied  valence  bands  at  selected  k  points  and  the  band  gaps, 
W,  for  the  infinite  chain  of  lithium  molecules,  (-Ll2-)xj  with  (a)  Sq  =  11.0,  (b)  12.0,  and  (c)  13.0  a.u. 
computed  with  the  DS  and  FS  approaches.  (Energies  are  in  atomic  units.)  _ 


(a) 

(b) 

(0 

s{k) 

DS 

FS 

DS 

FS 

DS 

FS 

occ. 

ff(O.O) 

-0.228535 

-0.228533 

-0.217075 

-0.217076 

-0.207408 

-0.207410 

£  (0.1 25) 

-0.223255 

-0.223254 

-0.212417 

-0.212417 

-0.203517 

-0.203519 

£(0.25) 

-0.206738 

-0.206730 

-0.198218 

-0.198217 

-0.192080 

-0.192079 

e  (0.375) 

-0.177521 

-0.177518 

-0.175256 

-0.175256 

-0.175418 

-0.175416 

s(0.5) 

-0.149940 

-0.149992 

-0.158195 

-0.158195 

-0.165188 

-0.165188 

unocc. 

s(0.0) 

0.152080 

0.152152 

0.119592 

0.119599 

0.093216 

0.093201 

e  (0.1 25) 

0.111178 

0.111210 

0.089544 

0.089549 

0.071268 

0.071260 

£(0.25) 

0.040393 

0.040367 

0.032494 

0.032492 

0.026775 

0.026771 

£(0.375) 

-0.021191 

-0.021217 

-0.018549 

-0.018553 

-0.013423 

-0.013424 

£(0.5) 

-0.058807 

-0.058761 

-0.044671 

-0.044673 

-0.031566 

-0.0311561 

w 

0.091133 

0.091231 

0.113524 

0.113522 

0.133622 

0.133627 

Hit 

57 

38 

10 

22 

8 

18 

LITHIUM  CHAINS  WITH  ao  =11.0,  12.0,  AND 
13.0  a.u. 

The  valence  band  structures  obtained  by  the  DS 
and  FS  approaches  for  (-~Li2-)x  with  Uq  =  11.0, 
12.0,  and  13.0  a.u.  are  shown  in  Figure  2  where 


FIGURE  2.  Valence  bands  obtained  by  the  DS  and  FS 
approaches  for  the  infinite  chain  of  lithium  molecules, 
(-Li2-)x,  with  (a)  ao  =  11.0,  (b)  12.0,  and  (c)  13.0  a.u. 
Energies  are  in  atomic  units. 


they  are  denoted  by  (a),  (b),  and  (c),  respectively. 
The  DS  results  have  been  obtained  with  =  10, 
the  maximum  number  of  interacting  cells  that  can 
presently  be  taken  into  account  in  the  PLH  pro¬ 
gram  to  compute  the  exchange  contributions  to  the 
Fock  matrix  elements,  Eq.  (15).  Selected  energy 
values  s(k)  of  the  occupied  and  unoccupied  va¬ 
lence  bands,  and  the  band  gaps  W  obtained  by  the 
two  approaches  are  listed  in  Table  I.  The  number 
of  iterations  needed  to  achieve  convergence  of 
the  density  matrices  within  the  preassigned 
threshold  is  also  indicated.  Overall,  the  DS  and  FS 
results  compare  fairly  well  and  their  graphs  in 
Figure  2  are  indeed  indistinguishable.  Neverthe¬ 
less  from  data  in  Table  I,  one  can  notice  that  the 
more  alternant  the  situation  is,  the  better  the 
agreement  between  the  two  approaches.  To  trace 
the  origin  of  this  discrepancy,  it  is  useful  to  ana¬ 
lyze  the  changes  during  the  iterations  of  the  en¬ 
ergy  band  values  obtained  by  the  DS  approach  for 
the  chain  of  equidistant  atoms  (a). 

The  first  iteration  starts  with  a  guess  density 
matrix  based  on  the  diagonal  elements  of  the  over¬ 
lap  matrix  S(k): 

pf^ik)  =  (35) 

where  O^^ik)  =  2  for  k  £[  -  \,\]  and  1  for  g 
U  [j,  5]),  p  =  Is  and  2s.  The  valence 
band  structures  corresponding  to  the  first  (dashed 
line)  and  last  (continuous  line)  iterations  of  the 
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FIGURE  3.  Energy  valence  band  obtained  by  the  DS 
approach  for  the  infinite  chain  of  lithium  molecules, 
(-Li2-)x,  at  the  first  (dashed  line)  and  last  (continuous 
line)  iterations  of  the  SCF  procedure.  Energies  are  in 
atomic  units. 


SCF  procedure  are  shown  in  Figure  3.  At  the  first 
iteration  the  band  structure  is  degenerate  (to  within 
accuracy  of  the  calculations)  at  the  edge  of  the 
Brillouin  zone  {k  =  0.5).  As  the  iterations  proceed, 
the  degeneracy  is  progressively  lifted  and  the  solu¬ 
tion  converges  toward  the  energetically  more  sta¬ 
ble  symmetry  broken  solution.  Figures  4al  and  4a2 
compare  the  graph  of  the  density  matrix  element 
Pzl-is  with  respect  to  m  at  the  first  and  last 
iterations.  The  corresponding  values  are  given  in 
Table  II.  It  is  observed  that  the  decay  of  these 
elements  is  rather  slow  [8]  at  the  first 

iteration  since  it  corresponds  to  a  '"numerically 
transient"  metallic  situation.  At  the  end  of  the  SCF 
process  a  gap  has  opened  up  (Fig.  3)  and  the  decay 
is  now  of  the  exponential  type  as  theoretically 
predicted  for  the  nondegenerate  cases  [8].  The  ac¬ 
tual  rate  of  the  exponential  decay  is  governed  by 
the  analytical  features  of  the  one-electron  states 
and  energies  with  respect  to  k  (see  appendix  in 
Ref.  8c).  To  a  first  approximation,  the  convergence 
rate  can  be  related  to  the  energy  band  gap  [8b]. 
This  can  be  verified  with  case  (b)  (qq  =  12.0  a.u.) 
for  which  the  gap  is  slightly  larger  than  for  (a) 
(Fig.  2);  the  corresponding  density  matrix  ele¬ 
ments,  also  given  in  Table  II  and  plotted  in  Figure 


FIGURE  4.  Graphs  of  the  evolution  with  respect  to  m 
of  the  density  matrix  element  P^_2s  "fof  ihe  Infinite 
chain  of  lithium  molecules  with  ao  =  11 .0  a.u.  at  the  (a1) 
first  and  (a2)  last  iterations  of  the  SCF  procedure  and 
with  (b)  ag  =  12.0  a.u.  at  the  last  iteration. 


4b,  indeed  decay  faster  than  for  (a).  Thus,  the 
larger  discrepancy  between  the  direct  space  and 
Fourier  space  results  (Table  I)  noted  for  the  chain 
of  evenly  distributed  lithium  atoms  (a)  indicate 
that  the  number  of  interacting  cells,  |N^|  =  10, 
used  to  evaluate  the  exchange  contributions  is  not 
enough  to  reach  5  decimal  digits  accuracy. 
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Valuh^witn  respect  to  m  of  the  density  matrix  element  P™  _  for  the  infinite  chain  of  lithium  molecules  with 
ao  =  11.0  a.u.  at  the  (a1)  first  and  (a2)  last  iterations  of  the  SCF  procedure  and  with  (b)  Sq  =  12.0  a.u.  at  the 
last  iteration.  _ _ 


(a1) 


(a2) 


(b) 


0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 


0.685031 
-0.151910 
0.546705  X  10^' 
-0.266155  X  10“' 
0.155983  X  10“' 
-0.102099  X  10“' 
0.721343  X  10“^ 
-0.537895  X  10“^ 
0.417780  X  10“^ 
-0.334927  X  10“^ 
0.275415  X  10“^ 


0.720026 
-0.172434 
0.601288  X  10  ' 
-0.261654  X  10“' 
0.128804  X  10“' 
-0.672170  X  10  ^ 
0.365573  X  10“^ 
-0.203782  X  10“^ 
0.115738  X  10“^ 
-0.666340  X  10"® 
0.390831  X  10“® 


0.699199 
-0.127620 
0.352045  X  10"' 
-0.121790  X  10"' 
0.472761  X  10“' 
-0.194125  X  10 
0.829047  X  10  ® 
-0.362483  X  10“® 
0.161329  X  10“® 
-0.727299  X  10"'* 
0.332536  X  10"'* 


To  further  assess  the  importance  of  the  number 
of  interacting  cells  taken  into  account  to  evaluate 
the  exchange  terms  in  DS  calculations,  we  have 
computed  the  band  structure  (the  occupied  and 
the  unoccupied  valence  band)  for  the  chain  of 
evenly  distributed  lithium  atoms  (a)  with  N^.  = 
2,3,5,  and  10.  Energies  at  selected  k  points  and 
band  gaps  are  listed  in  Table  111,  and  the  valence 
band  structures  are  plotted  in  Figure  5.  For  the 
sake  of  clarity,  =  5  is  omitted  in  Figure  5.  It 
appears  that  even  if  the  graphs  are  essentially 
stabilized  for  =  3,  the  values  in  Table  III  show 
that  significant  variations  in  the  energies  still  per¬ 
sist. 


SITVIIVIETRY-CONSTRAINED  METALLIC  CHAIN 

As  we  have  just  seen,  even  in  nondegenerate 
situations  with  gaps  as  large  as  0.094  a.u.  or  2.5  eV, 
DS-RHF  band  structures  can  sttll  be  sensitive  to 
the  number  of  interacting  cells  through  their  ex¬ 
change  contributions.  Here  we  wish  to  show  that 
the  FS  approach  applied  to  compute  the  band 
structure  of  the  symmetry-constrained  metallic  so¬ 
lution  of  (-Li-)^  (one  Li  atom  per  unit  cell)  leads 
to  the  genuine  features  of  the  RHF  approach, 
namely  the  vanishing  of  the  density  of  states  at  the 
Fermi  level.  The  results  given  in  Table  IV  and 
Figures  6  and  7,  reveal  that,  except  in  the  vicinity 


Energy  values  e(k)  of  the  occupied  and  unoccupied  valence  bands  at  selected  k  points  and  the  band  gaps, 

W,  for  the  infinite  chain  of  evenly  distributed  Li  atoms  and  for  the  DS  and  FS  approaches.  In  the  DS  calculations, 
different  numbers  of  interacting  celis  (A/„)  have  been  considered.  (Energies  are  in  atomic  units.) _ 


DS 


Hk) 


N=2 


N=3 


N  =5 


A/  =  10 


FS 


occ. 

s(O.O) 

-0.229288 

£(0.125) 

-0.223232 

£(0.25) 

-0.205896 

£(0.375) 

-0.177509 

£(0,5) 

-0.151701 

unocc. 

£(0,0) 

0.164063 

£(0,125) 

0.104394 

£(0,25) 

0.039743 

£(0,375) 

-0.017581 

£(0.5) 

-0.061858 

W 

0.089843 

-0.228249 

-0.228475 

-0.223475 

-0.223290 

-0.206686 

-0.206758 

-0.177305 

-0.177470 

-0.150501 

-0.150016 

0.147813 

0.151261 

0.114752 

0.111203 

0.037917 

0.040868 

-0.019508 

-0.021235 

-0.059989 

-0.059054 

0.090512 

0.090962 

-0.228535 

-0.228533 

-0.223255 

-0.223254 

-0.206738 

-0.206730 

-0.177521 

-0.177518 

-0.149940 

-0.149992 

0.152080 

0.152152 

0.111178 

0.111210 

0.040393 

0.040367 

-0.021191 

-0.021217 

-0.058807 

-0.058761 

0.091133 

0.091231 
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FIGURE  5.  Valence  bands  obtained  by  the  DS 
approach  for  the  infinite  chain  of  evenly  distributed  Li 
atoms  using  different  numbers  of  interacting  cells  (N^). 
Energy  values  are  In  atomic  units. 

of  k  =  kp,  the  band  structures  for  the  metallic  and 
symmetry-broken  solutions  are  almost  identical. 
Thus,  it  seems  from  this  application  that  the  gen¬ 
eral  features  of  the  electronic  structures,  in  particu¬ 
lar  the  band  widths,  are  not  better  nor  less  well 

TABLE  IV _ 

Energy  values  e{k)  of  the  occupied  and 
unoccupied  valence  bands  at  selected  k  points 
and  the  band  gaps,  W,  obtained  for  the  symmetry- 
constrained  metallic  (dotted  line)  and  the  symmetry- 
broken  (continuous  line)  solutions.  (Energies  are  in 
atomic  units.) 


e(k) 

DS 

FS 

occ. 

£(0.0) 

-0.228535 

-0.229977 

£(0.125) 

-0.223255 

-0.224647 

£(0.25) 

-0.206738 

-0.207620 

£(0.375) 

-0.177521 

-0.174672 

£(0.5) 

-0.149940 

-0.104701 

unocc. 

£(0.0) 

0.152080 

0.152798 

£(0.125) 

0.111178 

0.111863 

£(0.25) 

0.040393 

0.040592 

£(0.375) 

-0.021191 

-0.024702 

£(0.5) 

-0.058807 

-0.104701 

W 

0.091133 

0.0 

k 

FIGURE  6.  Valence  bands  obtained  for  the 
symmetry-constrained  metallic  (dotted  line)  and 
symmetry-broken  (continuous  line)  solutions.  To  facilitate 
comparison,  the  DS  valence  band  has  been  unfolded. 
Energy  values  are  in  atomic  units. 


E 


FIGURE  7.  Valence  density  of  states  D(E)  obtained  for 
the  symmetry-constrained  metallic  (dotted  line)  and 
symmetry-broken  (continuous  line)  solutions.  Energy 
values  are  in  atomic  units  and  the  density  of  states 
values  are  In  arbitrary  units. 
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described  in  metals  than  in  insulators  at  the  RHF 
level. 

The  graphs  of  the  valence  density  of  states  ob¬ 
tained  for  the  metallic  (dotted  line)  and 
symmetry-broken  (continuous  line)  solutions  are 
shown  in  Figure  7.  In  the  case  of  the  metallic 
solution  and  within  the  resolution  of  the  density  of 
states  calculations  reported  here,  the  special  be¬ 
havior  of  RHF  metallic  cases  is  apparent.  A  more 
detailed  analysis  of  the  valence  and  core  bands  of 
the  symmetry-constrained  metallic  case  can  be 
found  in  Ref.  12. 


Concluding  Remarks 

From  this  work,  the  following  concluding  re¬ 
marks  can  be  made: 

1.  Even  for  systems  with  gaps  of  the  order  of  2.5 
eV,  typically  case  (a),  the  importance  of  the  num¬ 
ber  of  interacting  cells  contributing  to  the  ex¬ 
change  terms  is  still  felt  beyond  10  neighbors.  It 
must  be  further  stressed  that  these  observations 
are  made  for  a  localized  minimal  basis  which  is  a 
rather  favorable  situation  from  a  numerical  point 
of  view.  Even  if  it  could  be  argued  that  the  net 
effect  does  not  modify  qualitatively  the  final  pic¬ 
ture  in  the  case  studied  here,  a  similar  analysis  on 
complex  systems  where  more  diffuse  bases  are 
generally  used  is  thus  necessary. 

2.  The  ES  approach  constitutes  a  suitable  frame¬ 
work  for  accurate  evaluations  of  all  lattice  sums 
for  localized  as  well  as  delocalized  bases.  As  such, 
it  appears  as  a  way  to  avoid  the  difficult  problem 
of  selecting  the  number  of  interacting  cells  needed 
in  a  particular  application. 

3.  To  our  toowledge,  the  FS  approach  is 
presently  the  only  approach  capable  of  a  proper 
description  of  RHF  symmetry-constrained  metallic 
solutions  of  realistic  systems. 

4.  On  a  more  pragmatic  ground,  these  compar¬ 
isons  constitute  checks  on  the  reliability  of  our  FS 
and  DS  codes. 

Our  Fourier  transform  code  (FTCHAIN)  is  still 
in  a  prototype  phase,  but  it  has  already  con¬ 
tributed  to  illustrate  the  merits  of  the  Fourier  rep¬ 
resentation  in  the  context  of  calculations  of  the 
electronic  structure  of  extended  chains.  Ongoing 
work  to  bring  it  to  a  more  applicable  stage  in¬ 
cludes  the  rather  demanding  task  of  obtaining 
working  forms  for  new  types  of  integrals  and  the 
corresponding  computer  routines. 


ACKNOWLEDGMEOTS 

All  calculations  have  been  performed  on  the 
IBM  RS6000  Models  730  and  560  of  the  Namur 
Scientific  Computing  Facility  (Namur-SCF).  The 
authors  gratefully  acknowledge  the  financial  sup¬ 
port  of  the  FNRS-FRFC,  the  'Toterie  Nationale" 
for  the  convention  No.  9.4593.92,  and  the  FNRS 
within  the  framework  of  the  "Action  ddmpulsion 
a  la  recherche  fondamentale"  of  the  Belgian  Min¬ 
istry  of  Science  under  the  convention  0,4511.93. 


References 

1.  (a)  T.  E.  Peacock  and  R.  McWeeny,  Proc.  Phys.  Soc.  London 
74,  385  (1959).  (b)  S.  Yomosa,  J.  Phys.  Soc.  Jpn.  19,  1718 
(1964).  (c)  G,  Del  Re,  J.  ladik,  and  G.  Biezo,  Phys.  Rev.  155, 
977  (1967).  (d)  J.  M.  Andre,  L.  Gouvemeur,  and  G.  Leroy, 
Int.  J.  Quantum  Chem.  1,  427  (1967). 

2.  J.  M.  Andre,  D.  H.  Mosley,  B.  Champagne,  J.  Delhalle,  J.  G. 
Fripiat,  J.  L.  Bredas,  D.  J.  Vanderveken,  and  D.  P.  Ver- 
cauteren.  Methods  and  Techniques  in  Computational  Chemistry 
(METECC-94X  E  dementi,  Ed,  (Stef,  Cagliari,  1993),  Vol.  B. 
J.  G.  Fripiat,  D.  H.  Mosley,  B.  Champagne,  and  J.  M.  Andre, 
PLH-93  from  METECC-94. 

3.  (a)  F.  Seitz,  The  Modern  Theory  of  Solids  (McGraw-Hill,  New 
York,  1940).  (b)  P.  Fulde,  Electron  Correlation  in  Molecules 
and  Solids  (Springer- Verlag,  Berlin,  1991). 

4.  (a)  F,  E.  Harris  and  J.  H.  Monkhorst,  Phys.  Rev.  B2,  4400 
(1970).  (b)  F.  E.  Harris,  J.  Chem,  Phys.  56,  4422  (1972).  (c) }. 
Delhalle  and  F.  E.  Harris,  Phys.  Rev.  B31,  6755  (1985). 

5.  J.  Delhalle,  J.  Cizek,  1.  Flamant,  J.  L.  Calais,  and  J.  G.  Fripiat, 
J.  Chem.  Phys.  101,  10717  (1994). 

6.  Gaussian-92 /DFT,  Revision  F.2,  M.  J.  Frisch,  G.  W.  Trucks, 
H.  B.  Schlegel,  P.  M.  W.  Gill,  B.  G.  Johnson,  M.  W.  Wong,  J. 
B.  Foresman,  M.  A.  Robb,  M.  Head-Gordon,  E.  S.  Replogle, 
R.  Gomperts,  J.  L.  Andres,  K.  Raghavachari,  J.  S.  Binkley,  C. 
Gonzalez,  R.  L.  Martin,  D.  J.  Fox,  D.  J.  Defrees,  J.  Baker,  J.  J. 
P.  Stewart,  and  J.  A.  Pople,  Gaussian,  Inc.,  Pittsburgh  PA, 
1993. 

7.  R.  Poirier,  R.  Kari,  and  1.  G.  Csizmadia,  Handbook  of  Gauss¬ 
ian  Basis  Sets  (Elsevier,  New  York,  1985). 

8.  (a)  H.  J.  Monkhorst,  Phys.  Rev.  B  20,  1504  (1979).  (b)  H.  J. 
Monkhorst  and  M.  Kertesz,  Phys.  Rev.  B  24,  3025  (1981).  (c) 
J.  Delhalle  and  J.  L.  Calais,  J.  Chem.  Phys.  85,  5286  (1986). 

9.  P.  P.  Ewald,  Ann.  Phys.  64,  253  (1921). 

10.  P.  Henrici,  Applied  and  Computational  Complex  Analysis  (Wi¬ 
ley,  New  York,  1977),  Vol.  2. 

11.  M.  Abramowitz  and  L  A.  Stegun,  Handbook  of  Mathematical 
Functions  (Dover,  New  York,  1968). 

12.  1.  Flamant,  J.  Delhalle,  and  J.  G.  Fripiat,  submitted  for 
publication. 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


1497 


Iodine-Doped  Transpolyacetylene 


G  P  DAS* 

Systran  Corporation,  4126  Linden  Ave,  Dayton,  Ohio  45432:  e-mail:  dasgp@picard.ml.mp^M-mil 

A.  T.  YEATES  AND  D.  S.  DUDIS 

WL  /  MLBP,  WPAFB,  Ohio  45433 

Received  February  28,  1996;  accepted  April  2,  1996 


ABSTRACT 


With  a  view  to  elucidate  and  develop  a  model  for  the  mechanism  of  conduchon  m  doped 
transpolyacetylene  (t-PA),  we  have  carried  out  an  ab  initio  study  of  ground  and 
excited  state  of  transoctatetraene  in  the  presence  of  an  lodme  atom.  While  the  groimd 
state  is  calculated  as  characterized  by  a  slightly  distorted  alternating  bond  geomehy, 
with  iodine  remaining  largely  neutral  on  top  of  the  plane  of  the  polyene  near  rougWy  the 
midpoint  of  the  chain  axis,  the  excited  state  has  the  geometry  characterized  ^  a  solitomc 
distortion  with  the  octatetraene  carrying  a  net  positive  charge  close  bandgap 

[highest  occupied  to  lowest  unoccupied  molecular  orbital  (HOMO  LUMO)  gap]  is 
computed  to  be  nearly  0.8  eV.  Moreover,  the  excited-state  energy  appears  very  msensitive 
to  the  position  of  iodine  along  the  axis  of  the  octatetraene  chain.  As  an  extension  of  an 
earlier  calculation  on  the  transport  of  charge  by  a  charged  soliton,  a  similar  calculahon  is 
performed  on  a  CgHJo  "kink”  traveling  along  the  backbone  of  CjjHu,  yieldmg  very 
similar  results.  A  computer  code  has  been  developed  to  compute  the  transport  velocity  of 
charge  based  on  the  above  model  and  can  be  used  in  conjunction  with  the  GAMESb 
program  package.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 


When  doped  even  at  a  very  low  level,  trans- 
polyacetylene  (t-PA)  exhibits  very  high 
conductivity  [1,  2].  In  understanding  the  way 
charge  is  transported  in  organic  systems  under  the 
influence  of  an  electric  field,  one  has  to  take  into 
account  many  details  of  the  system  beyond  those 

*  To  whom  correspondence  should  be  addressed. 


required  in  understanding  conduction  in  metals 
and  inorganic  semiconductors.  While  the  latter  is 
amenable  to  interpretation  in  terms  of  largely  free 
electrons  or  holes,  the  conductivity  in  organic  sys¬ 
tems  appears  to  be  a  truly  many-body  phe¬ 
nomenon  where  not  only  is  such  reduction  to  free 
electrons  inappropriate,  but  also  the 
electron— nuclear  coupling  seems  to  be  too  strong 
to  be  treated  as  perturbation.  First  proposed  by  Su, 
Schrieffer,  and  Heegar  (SSH)  [3],  the  traveling  soli¬ 
ton  waves  are  now  generally  accepted  as  repre- 
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senting  the  predominant  mode  of  electrical  con¬ 
duction  in  doped  transacetylene  and  perhaps  other 
similar  organic  polymeric  systems.  In  this  soliton 
model  of  electrical  conduction  the  electrical  energy 
is  carried  by  positively  or  negatively  charged  soli- 
tons  which  are,  in  fact,  "kinks"  in  an  otherwise 
periodic  Peierls  bond-alternated  geometry. 

The  solitonic  transport  theory  developed  so  far 
as  relied  almost  exclusively  on  the  SSH  model 
Hamiltonian  or  its  variants  [3].  Because  of  the  very 
approximate  nature  of  these  models,  however,  even 
though  the  reality  of  charged  solitons  as  carriers  of 
electrical  energy  is  now  accepted  universally,  the 
predictions  of  their  mobility  [4-6]  based  on  these 
models  remain  unconvincing.  This  is  notwith¬ 
standing  the  fact  that  several  all-electron  semiem- 
pirical  [7]  and  ab  initio  [8]  studies  on  the  structure 
of  charged  solitons  reveal  roughly  the  same  fea¬ 
tures  as  obtained  from  the  SSH  models.  We  have 
recently  carried  out  an  ab  initio  evaluation  of  the 
transport  velocity  of  a  positively  charged  soliton 
and  made  an  estimate  of  the  conductivity  based  on 
these  calculations  [2].  In  this  analysis  several  as¬ 
sumptions  have  been  made.  First,  the  positively 
charged  solitons  are  assumed  to  be  the  dominant 
carriers.  Moreover,  the  excitonic  energy  (i.e.,  the 
energy  of  these  carriers  relative  to  the  ground 
state)  is  taken  to  be  of  the  order  of  1  eV.  Also  the 
solitonic  charge  is  assumed  to  be  not  pinned  by 
the  dopant  counterions  and  moving  almost  freely 
along  the  direction  of  the  chain  axis.  In  the  present 
work  we  shall  try  to  justify  these  assumptions 
again  in  the  ab  initio  framework.  For  this,  we 
restrict  ourselves  to  the  case  of  a  single  iodine 
atom  interacting  with  a  frans-octatetraene  mole¬ 
cule.  This  represents  perhaps  the  simplest  proto¬ 
type  (computationally  speaking)  of  the  real  doped 
t-PA  system.  Upon  satisfying  ourselves  that  the 
underlying  assumptions  of  the  model  are  borne 
out,  we  reevaluate  the  transport  velocity  of  a  posi¬ 
tively  charged  octatetraene  kink  as  it  moves  along 
the  chain  axis  of  the  oligomer  Ci2Hj-„  under  the 
influence  of  an  external  voltage. 


Characterization  of  the  Iodine-doped 
Polyene  System 

In  carrying  out  the  investigation  as  outlined 
above  we  must  describe  adequately  both  the 
ground  and  excited  states  of  the  total  system  as 
the  charge  moves  along  its  length.  We  can  achieve 


this  by  using  the  complete  active  space  self-con¬ 
sistent  field  (CASSCF)  wavefunctions  built  out  of 
the  total  TT-orbital  space  treated  as  the  active  space. 
However,  the  CASSCF  process  runs  into  difficul¬ 
ties  here  because  of  the  weak  mixing  of  the  a-  and 
rr-orbitals  due  to  the  breakdown  of  the  "planar" 
symmetry  of  the  polyene  in  the  presence  of  the 
iodine  atom.  An  unrestrained  CASSCF  will  scram¬ 
ble  the  cr-  and  rr-orbitals  among  each  other  beyond 
recognition,  leading  to  undesirable  inclusion  of  the 
cr-correlation.  Moreover,  this  will  also  cause  the 
MCSCF  convergence,  particularly  for  the  excited 
states,  to  be  poor  since  the  partially  occupied  rr- 
orbitals  have  orbital  energies  that  are  very  similar 
to  the  higher  lying  or-orbitals.  In  this  work  we 
solve  this  problem  as  follows:  We  first  obtain  the 
occupied  cr-orbital  and  the  occupied  and  virtual 
TT-orbital  from  a  restricted  Hartree-Fock  (RHF) 
calculation  on  the  polyene  (without  the  iodine). 
We  also  get  the  occupied  cr-  and  Tr-orbitals  for 
iodine.  These  orbitals  are  then  combined,  ordered 
in  accordance  with  the  basis  set  of  the  combined 
system,  ICgHj^,  and  then  orthonormalized.  Let  us 
denote  this  set  as  {cr,°}  and  {77,°}.  We  now  carry  out 
an  open-shell  RHF  on  the  combined  system.  Let 
the  corresponding  occupied  and  unoccupied  or¬ 
bitals  be  denoted  as  {0^}.  We  now  transform  these 
orbitals  into  the  a-  and  Tr-like  orbitals  {<t,}  and  {tt,}, 
respectively,  as  follows: 

y  coc 

;■  c  oc  ^  ^ 

/c  Lin 

where  the  symbols  oc  and  un  denote  respectively 
the  occupied  and  virtual  orbitals.  These  orbitals 
{<T,}  and  {tt,}  are  then  used  as  the  starting  orbital 
sets  for  the  CASSCF  runs  where  we  impose  the 
additional  constraint  that  the  cr-like  orbitals  mix 
only  within  the  space  that  is  orthogonal  to  the 
TT-like  orbitals,  while  the  tt's  mix  only  among 
themselves. 

Since  an  analytic  approach  of  geometry  opti¬ 
mization  is  ruled  out  because  of  the  constraints  on 
CASSCF,  in  order  to  simplify  the  geometry  search 
process,  we  make  the  following  further  approxi¬ 
mations:  First,  we  truncate  the  rr-space  to  consist 
only  of  the  lowest  N  (the  number  of  "heavy" 
atoms)  TT-orbitals,  thus  obviating  the  need  of  the 
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rotation  of  the  7r-orbitals  during  the  CASSCF  opti¬ 
mization.  The  error  due  to  this  constraint,  we  be¬ 
lieve,  is  minimal  as  far  as  the  geometry  optimiza¬ 
tion  is  concerned  since  the  changes  of  geometry  in 
our  case  bring  about  mostly  bond  breaking  and 
forming  without  much  of  a  participation  of  the 
nonvalence  7r-space  (which,  in  the  case  of  a  mini¬ 
mal  basis,  as  we  choose  for  the  first  stage  of  our 
calculation,  is  absent  anyway).  This  approximation 
clearly  reduces  the  orbital  optimization  problem  to 
simply  the  optimization  of  the  closed-shell  a- 
orbitals.  Also,  in  the  search  for  the  minimum  in  the 
potential  energy  surface,  we  avoid  a  complete  ge¬ 
ometry  optimization  by  keeping  frozen  all  the 
lengths  of  the  truly  (r-bonds  such  as  CH  through¬ 
out  at  their  values  obtained  for  the  polyene  in  its 
ground  state.  Moreover,  we  assume  that  the 
polyene  molecule  remains  strictly  planar.  This  is 
presumably  a  good  approximation  since  in  both 
the  ground  and  excited  states  iodine  is  quite  far 
away  from  the  plane  of  the  polyene  molecule, 
making  it  highly  unlikely  for  the  latter  to  be  signif¬ 
icantly  nonplanar. 


Calculations  and  Results  on  the 
Polyene  Plus  Iodine  System 

For  studying  the  influence  of  a  counterion,  in 
this  case  1",  we  adopt  the  following  basis:  We  treat 
iodine  in  the  effective  core  potential  (ECP)  frame¬ 
work  using  the  Hay-Wadt  ECP  basis.  For  octate- 
traene  we  use  the  STO-3G  basis,  which  is  known 
to  lead  to  a  reasonably  good  geometry  for  such 
systems. 

In  Figure  1  we  plot  the  bond  lengths  of  the 
system  I  -h  CsH^q  in  its  ground  and  first  excited 
states.  It  is  evident  tha  the  ground-state  geometry 
is  very  close  to  that  of  the  undoped  Peierls  alter- 


FIGURE  1.  Bond  distances  (in  bohrs)  for  the  ground 
(a)  and  excited  (b)  states  of  CgH^ol. 


nating-bond  polyene,  while  the  excited  state  corre¬ 
sponds  to  a  solitonic  kink  (a  soliton-antisoliton 
pair).  The  Mulliken  charges  carried  by  iodine  in 
these  states  are  respectively  -0.033  and  -0.694 
and  the  excitonic  energy  is  0.76  eV.  Table  I  shows 
the  excited-state  energy  and  the  Mulliken  charge 
on  iodine  as  functions  of  the  distance  of  iodine 
from  one  end  of  the  octatetraene  (all  other  geome¬ 
try  parameters  being  optimized  within  the  restric¬ 
tions  described  above).  It  is  evident  that  the  ex¬ 
cited-state  energy  is  quite  insensitive  to  where  the 
iodine  is  located.  It  is,  however,  interesting  to  note 
that  the  charges  on  iodine  vary  significantly  as  the 
solitonic  kink  moves  away  from  iodine. 

We  conclude  from  the  above  calculations  that 
the  assumptions  underlying  the  modeling  of  charge 
transport  in  terms  of  a  traveling  charged  soliton 
are  justified,  at  least  at  the  level  of  calculations 
described  above.  Once  the  geometry  is  optimized 
for  both  the  ground  and  excited  states,  we  recom¬ 
pute  the  excitation  energy  at  the  level  of  the  3-21G 
basis  for  all  the  atoms.  The  difference  of  0.07  eV  in 
the  excitation  energies  going  from  STO-G  to  3-21G 
(shown  in  parentheses  in  Table  I)  is  sufficiently 
small  such  that  the  general  conclusions  drawn 
here  based  on  STO-3G  are  very  likely  sound. 


Evaluation  of  Charge 
Transport  Velocity 

Calculations  on  the  charge  transport  are  now 
carried  out  by  assuming  a  positively  charged  larger 
polyene,  in  our  case,  Ci2H^4.  The  model  of  charge 
transport  that  we  pursue  here  is  similar  to  the  one 
used  in  [2].  This  is  diagrammatically  sketched  in 


TABLE  I _ 

CASSCF  excitonic  energy  of  the  first  excited  state 
and  Mulliken  charge  on  iodine  as  functions  of  iodine 


distance  measured  from  the  “left” 

-most  carbon.® 

Excitonic  energy 

Mulliken 

x{A) 

(hartrees) 

charge  on  1 

0.44 

0.03321 

-0.616 

0.94 

0.0331 1 

-0.611 

1.44 

0.03069 

-0.651 

1.94 

0.02954 

-0.706 

2.44 

0.02854  (0.03111) 

-0.694 

2.94 

0.03302 

-0.612 

^The  ground  state  CASSCF  energy  =  -316.21123  hartrees 
at  the  ground-state  optimized  geometry. 
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Figure  2.  We  review  below  the  steps  involved  in 
calculating  the  velocity  of  transport  of  charge,  (i) 
First  we  obtain  the  ground-state  wavefunction  of 
the  total  neutral  system,  in  this  case,  C  12^14-  This 
is  done  at  the  RHF  level,  (ii)  The  77-orbitals  {77,}  are 
localized  in  the  following  marmer:  We  consider 
two  sets  of  elementary  localized  77-orbitals,  one 
consisting  of  all  the  bonding  orbitals  and  the  other 
of  antibonding  ones  belonging  to  each  double  bond. 
These  are  symmetrically  orthonormalized.  We  de¬ 
note  the  resulting  orbitals  as  {77^^}  and  {77^''}  respec¬ 
tively.  The  77-orbitals  obtained  from  step  (i)  are 
now  projected  on  to  these  orbitals: 

’  (2) 

^;(un)= 

i 

(iii)  A  part  of  the  oligomer  is  now  selected  (to  be 
called  the  ''charged  unit")  as  carrying  the  charge 
and  a  restricted  open-shell  Hartree-Fock  (ROHF) 
calculation  is  carried  out  where  only  the  77-orbitals 
belonging  to  the  charged  unit  are  optimized  with 
the  constraint  that  they  be  orthogonal  to  the  rest  of 
the  77-orbitals.  The  cr-orbitals  are,  however,  opti¬ 
mized  for  all  geometries.  This  is  followed  by  a 
relocalization  of  the  77-orbitals  on  the  charged  unit 
(along  the  lines  indicated  above).  A  geometry  opti¬ 
mization  of  the  total  system  is  now  carried  out 
using  a  singles-and-doubles  configuration  interac¬ 
tion  (Cl)  where  only  the  77-electrons  of  the  charged 
unit  are  promoted.  This  will  give  us  both  the 
equilibrium  geometry  and  the  normal  modes  with 
the  charge  located  within  the  charge  unit  selected. 

(iv)  The  total  wavefunction  is  now  constructed  as  a 
linear  combination  of  the  global  Born-Oppen- 


FIGURE  2.  Scheme  for  modeling  transport  of  charge 
along  the  backbone  of  a  0^2^14  radical  cation. 


heimer  (GBO)  products  G^,  each  defining  a 
"charge  imit"  [9].  The  matrix  elements  and  over¬ 
laps  are  given  by: 

s;.^.  =  <<D,|<Dp<G,|Gp,  ^ 


where  and  are  respectively  the  electronic 
Hamiltonian  and  electronic  energy  of  each  charge 
unit  and  the  electron  integrals  are  computed  at  the 
equilibrium  geometry  of  either  the  /th  or  the  jth 
charge  unit.  In  our  singles-and-doubles  representa¬ 
tion  of  the  electronic  wavefunctions  (see  above), 
assuming  that  and  Oy  are  the  same  except  for  a 
translation  that  transforms  O,  into  or  vice  versa, 
it  is  easy  to  show  that,  to  a  good  approximation 
(neglecting  virtual  space  contribution), 

a 

<<I),|  He,  -  f’ell'fy)  ^  E  -  ^e,)  (4) 

a 


a 

n„  =  ttIttI  ■■■  ••• 

g;  =  <77jHe||7r„> 

(v)  The  general  solution  of  the  time-dependent 
problem  is 


iIi(r,R,  t)  = 

A 

If  we  assume  that  at  f  =  0  the  system  is  in  the 
nonsteady  localized  state  that  is,  the  charge  is 
localized  in  the  first  charge  unit,  we  have 

EQA</>  =  i, 

(7) 

EQ/i<.^)  =  o, 


The  probability  that  at  a  later  time  the  charge 
resides  on  the  charge  unit  number  2  is  given  by 


^2  = 


4  E  QC,,  7l</M(/>sin2  f  (8) 


A,  A' 


Ih 
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assuming  both  and  to  be  real.  This  attains 
its  maximum  at  a  time  t  given  by 


T  =  0. 


(9) 


In  the  present  case  Eq.  (3)  involves  three  charge 
units,  leading  to  three  eigenvalues  of  which  the 
higher  two  are  nearly  degenerate.  Thus  Eq.  (9) 
leads  to 

'"■ft 

sin - 7 - T  =  0  or  ^  ^  (10) 

ft 


whence  the  transport  velocity  is  given  by 

V  =  0.256  X  (fl  in  bohrs)  X  (A £  in  eV)  X  10^  cm/s, 

(11) 

where  a  is  the  approximate  repetition  length  in 
polyacetylene  and  A  E  is  the  width  of  the  excitonic 
energy  band. 

We  shall  now  describe  our  calculations  on  the 
transport  of  charge  in  CjjHj,.  We  have  found  [9] 
the  choice  of  the  STO-3G  basis  for  these  calcula¬ 
tions  to  be  adequate  for  preliminary  estimates.  As 
shown  in  Figure  3,  the  general  features  of  the 
geometry  of  C  1211^4  optimized  at  the  semiempiri- 
cal  and  ab  initio  SCF  levels  with  STO-3G  and 
3-21G  basis  sets  are  very  similar.  For  the  case  at 
hand  the  values  of  the  various  overlaps  and  matrix 
elements  [Eq.  (3)]  are  shown  in  Table  II.  The  solu¬ 
tion  of  the  corresponding  secular  equation  yields 
the  following  energies:  Ej  =  —0.030,  £2  =  —0.002, 
and  £3  =  0.001  hartrees.  Hence  the  transport  ve¬ 
locity  is  calculated  to  be  approximately  0.9  X  10^ 
cm/s.  This  is  very  similar  to  the  value  obtained  in 


II  _ 

Values  of  S,'y 

and  H'n  [Eq.  3].® 

Parameters 

Values 

S12 

-0.78676 

5^3 

0.39086 

H\2 

- 1 .7222  ( X 1 0  ^  ^  hartrees) 

H’,3 

1.0981  ( X 1 0  "  ^  hartrees) 

^^23  and  H23  are  assumed  equal  to  5^2  and  H\2,  respec¬ 
tively. 

[9]  with  a  shorter  chain  with  a  pure  charged  soli- 
ton.  Thus  we  arrive  at  the  important  conclusion 
that  the  size  of  the  "charged  tmit"  does  not  seri¬ 
ously  affect  the  calculated  transport  velocity.  Such 
high  velocities  have  been  shown  [9]  to  be  crucial 
for  the  occurrence  of  high  conductivity  in  t-PA. 

As  we  have  noted  in  [9],  however,  the  charge 
transport  velocity  is  substantially  reduced  (by 
nearly  40%)  when  3-21G  basis  is  used  instead  of 
STO-3G  in  the  transport  calculation.  The  resulting 
value  is  then  in  approximate  agreement  with  the 
value  =  0.12  X  10^  cm/s  reported  by  Su  [6] 
based  on  the  SSH  Hamiltonian. 


Code  for  Computing  Charge  Transport 

We  have  automated  the  above  steps  by  devel¬ 
oping  a  code  and  installing  it  in  the  GAMESS 
environment.  A  flow  diagram  is  given  in  Figure  4. 
The  code  requires  as  input  the  occupied  and  vir¬ 
tual  orbitals  resulting  from  an  RHF  calculation  on 
the  neutral  backbone  system  (restricted  here  to  be 


FIGURE  3.  Geometry  of  C12H14  with  bond  distances  shown  in  bohrs  optimized  at  (a)  the  semiempirical  (AM1)  level 
and  the  ab  initio  SCF  level  with  (b)  STO-3G  and  (c)  3-21 G  bases. 
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planar).  The  code  then  localizes  the  7r-orbitals,  per¬ 
forms  new  ROHF  at  each  of  a  set  of  grid  points  as 
defined  by  the  input,  yielding  new  cr-orbitals,  and 
then  performs  singles-and-doubles  Cl  over  the 
localized  7r-orbitals.  The  single  and  double  ex¬ 
citations  are  chosen  so  as  to  represent  the  bond 
breaking  and  bond  forming  that  lead  to  the  input- 
specified  location  of  the  charged  kink.  These  Cl 
energies  are  then  used  to  perform  the  vibrational 
analysis.  If  any  of  the  frequencies  is  imaginary,  the 
system  is  displaced  in  the  direction  of  the  normal 
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FIGURE  4.  Flow  chart  for  global  solution  of  nnolecular 
Schrodinger  equation. 


mode  of  imaginary  frequency  with  the  largest  ab¬ 
solute  value  until  the  energy  attains  a  rough  mini¬ 
mum,  whereupon  the  above  steps  are  repeated 
again.  On  the  other  hand,  if  the  vibrational  fre¬ 
quencies  are  all  real,  the  new  equilibrium  geome¬ 
try  is  obtained  by  solving  for  the  displacements 
that  lead  to  zero  gradients  (in  the  quadratic  ap¬ 
proximation)  on  the  potential  surface.  If  the  devia¬ 
tion  between  the  new  and  old  geometries  is  larger 
than  a  given  threshold,  the  above  steps  are  re¬ 
peated.  Otherwise  the  geometry  is  assumed  con¬ 
verged.  The  corresponding  normal  modes  and  fre¬ 
quencies  are  then  used  to  obtain  the  overlaps  of 
vibrational  functions  according  to  Eq.  (4).  Also  the 
overlaps  between  the  electronic  wavefunctions  and 
the  corresponding  Hamiltonian  matrix  elements 
are  calculated  according  to  Eq.  (4).  The  global 
secular  equation  is  then  solved,  leading  to  the 
parameter  AE  of  Eq.  (11). 
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ABSTRACT _ — 

The  dependence  of  X-ray  photoionization  spectra  on  structural  characteristics  is  assessed 
by  one-particle  Green's  function  calculations  on  the  nearly  isomeric  adamantane, 
1,4-dimethyl  bicyclo  [2,2,2]  octane  and  1,1,4,4-tetramethyl  cyclohexane  compounds.  These 
calculations  are  carried  out  using  the  quasi-particle  approximation  and  the  diagonal 
2ph-TDA  renormalization  scheme  for  the  self-energy.  The  simulated  spectra  fingerprint 
remarkably  the  atomic  connectivity  from  their  inner  valence  part,  through  the  interplay 
of  energy  degeneracies  and  the  influence  of  torsional  strains  on  electron  binding  energies. 
The  boundary  region  between  the  inner  and  outer  valence  bands  provides  also  specific 
signatures  for  molecular  characteristics  pertaining  to  conformational  questions,  namely 
the  enforced  conversion  from  the  chair  to  the  boat  form  of  the  cyclohexane  units 
generating  these  compounds.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

Photoelectron  spectroscopy  has  been  success¬ 
fully  used  for  more  than  two  decades  to 
investigate  the  electronic  structure  and  the  chemi¬ 
cal  bonding  characteristics  of  molecules  in  the  gas, 
liquid,  and  solid  phases.  As  is  well  known,  this 
spectroscopy  gives  a  direct  access  from  core  level 
shifts  to  the  chemical  composition  and  inhomo¬ 
geneity  in  the  top  layers  of  organic  and  inorganic 

*To  whom  correspondence  should  be  addressed. 


solids.  It  is  as  such  one  of  the  most  currently  used 
analytical  devices.  On  the  other  hand,  the  valence 
bands  have  received  so  far  much  less  attention.  By 
comparison  with  the  results  of  reliable  quantum 
mechanical  calculations,  however,  valence  pho¬ 
toionization  spectra  can  also  provide  [1]  invaluable 
information  on  the  molecular  architecture  of  the 
investigated  compounds.  This  comprises  not  only 
the  overall  atomic  connectivity  (primary  structure 
or  configuration)  but  also  the  possibility  [2-7]  of 
tracing  more  subtle  changes  of  the  molecular  con¬ 
formation  (or  secondary  structure).  Owing  to  its 
high  sensitivity  to  the  mixing  of  orbitals  through 
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the  interplay  of  cross-section  effects.  X-ray  pho¬ 
toionization  spectroscopy  (XPS)  is  particularly 
well-suited  to  fingerprint  molecular  conforma¬ 
tions. 

Recent  theoretical  investigations  on  the  elec¬ 
tronic  structure  of  large  saturated  hydrocarbons 
have  disclosed  evidence  for  a  mixing  of  the  €2^ 
and  C2P  +  Hjj  states  at  the  border  of  the  iimer  and 
outer  valence  bands,  due  to  the  development  of 
long-range  methylenic  hyperconjugation  effects 
over  zig-zag  planar  chains  [6]  or  related  cyclic 
structures  [7].  These  interactions  yield  highly  spe¬ 
cific  XPS  conformational  signatures  in  a  binding 
energy  interval  comprised  between  14  and  22  eV, 
which  could  be  for  instance  exploited  [8]  in  a 
definite  identification  [9]  of  the  leading  fold  stack¬ 
ing  sequence  in  the  [llOj's  growth  sector  of 
polyethylene.  These  studies  have  generated  fur¬ 
ther  investigations  on  polypropylene  [10]  or  self- 
assembled  layers  of  alkane-thiol  compounds  [11]. 
Long-range  methylenic  hyperconjugation  have  also 
provided  a  consistent  explanation  [12]  for  the  ori¬ 
gin  of  valence  XP  spectra  differences  of  M-tride- 
cane  in  solid  and  gas  phases,  considering  the  ther¬ 
mal  agitation  prevailing  in  the  gas  phase. 

Previous  works  [13]  have  shown  the  value  of 
studying  gas-phase  model  molecules  to  provide  a 
detailed  analysis  of  the  core  level  structure  of 
polymers.  Their  virtue  resides  not  only  in  the 
higher  resolution  achieved  in  gas  phase  but  also  in 
the  separation  of  intramolecular  characteristics 
(chemical  bonding,  vibrational  influences  on  the 
line  profiles,  origin  of  the  nonstoichiometric  ratios 
of  peak  areas)  from  intermolecular  interactions 
(polarization  of  neighboring  molecules,  phonon 
broadening,  structural  disorder,  etc.)  and  related 
solid-state  effects  limiting  the  resolution  (surface 
charging).  One  limitation  with  n-alkanes  samples 
is  that  the  chains  adopt  different  conformations  in 
the  gas  phase,  the  proportion  of  which  is  hard  to 
define.  As  a  result  of  the  ring  closure  constraints, 
the  cycloalkane  compounds,  at  least  the  smallest 
ones,  exhibit  in  comparison  a  much  better  defined 
molecular  architecture.  The  rigid  chair  form  pre¬ 
vailing  for  cyclohexane  is  in  particular  very  well- 
suited  to  investigate  the  effects  of  long-range 
methylenic  hyperconjugation.  As  shown  in  [7,  9, 
14],  a  cyclic  conversion  from  the  chair  to  the  boat 
or  twisted-boat  forms  of  cyclohexane  could  be 
easily  followed  from  XPS  measurements,  if  the 
latter  conformer,  lying  4.2  kcal  above  the  global 
energy  minimum,  could  be  stabilized  and  isolated. 


which  is,  to  our  knowledge,  impossible  in  practice 
nowadays.  Nevertheless,  this  conversion  can  be 
followed  indirectly,  by  enforcing  the  chair  and  the 
boat  forms  of  cyclohexane  in  polycycloalkane 
structures. 

Thus,  in  the  present  study,  we  have  chosen  to 
consider  the  adamantane,  the  1,4-dimethyl  bicyclo 
[2,2,2]  octane  and  the  1,1,4,4,-tetramethyl  cyclohex¬ 
ane  compoimds,  which  are  of  very  close  chemical 
nature  and  nearly  isomeric  pairwise,  but  strongly 
differ  in  their  overall  connectivity.  A  few  ultravio¬ 
let  photoelectron  spectra  are  available  [15]  for 
adamantane  and  hetero  derivatives,  in  studies 
which  essentially  focus  on  the  outer  valence  re¬ 
gion.  Except  for  a  recent  work  [16]  based  on  outer 
valence  Green's  function  (OVGF  [17])  calculations, 
their  interpretation  has  been  advanced  by  compar¬ 
ison  with  semi-empirical  theoretical  data.  No  mea¬ 
surement  of  the  inner  valence  structure  of 
adamantane  from  X-ray  photoionization  spec¬ 
troscopy  has  been  reported  to  date.  There  is  also, 
to  our  knowledge,  no  experimental  record  avail¬ 
able  for  the  1,4-dimethyl  bicyclo  [2,2,2]  octane  and 
the  1,1,4,4-tetramethyl  cyclohexane  compounds. 

In  order  to  evaluate  on  correlated  grounds  XPS 
fingerprints  for  the  molecular  structure  of  polycy¬ 
cloalkanes,  the  valence  ionization  spectra  of  these 
compounds  have  been,  in  this  preliminary  study, 
evaluated  using  simple  one-particle  Green's  func¬ 
tion  schemes  in  the  quasi-particle  approximation 
[6,  7]. 


Model  Systems  and 
Computational  Aspects 

Among  the  many  saturated  hydrocarbon  com¬ 
pounds  available  today,  adamantane  has  the 
unique  cage  structure  of  a  tetrahedron  of  CH 
groups  embedded  in  an  octahedron  of  CHj  groups. 
The  basic  pattern  found  in  this  molecule  is  the 
chair  form  of  cyclohexane,  which  is  consistently 
reproduced  along  each  of  the  3-fold  axis  of  its 
symmetry  group.  The  1,4-dimethyl  bicyclo  [2,2,2] 
octane  compound  has  been  selected  in  this  study, 
in  order  to  build  with  the  same  number  of  carbons 
as  adamantane  a  rigid  cage  structure  of  Dj,,  sym¬ 
metry  based  on  the  boat  form  of  cyclohexane.  The 
chair  form  of  cyclohexane  is  most  conveniently 
recovered  from  the  latter  structure  by  considering 
the  closely  related  1,1,4,4-tetramethyl  cyclohexane 
compound  of  symmetry. 
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The  photoelectron  spectra  of  these  compounds 
have  been  computed  on  the  basis  of  RHF/STO-3G 
optimized  geometries.  The  molecular  structures 
studied  are  shown  in  Figure  1.  Previous  compar¬ 
isons  carried  out  on  linear  alkanes  [6,  7,  9]  or 
cycloalkanes  [9]  have  pointed  the  adequacy  of  sim¬ 
ple  Green's  fimction  calculations  carried  out  on  the 
assumption  of  a  quasi-particle  picture  of  ioniza¬ 
tion,  when  has  been  recently  assessed  [14]  by  a 
detailed  series  of  Green's  function  calculations  ac- 
coimting  for  multistate  interactions  in  an  algebraic 
diagrammatic  construction  (ADC)  scheme  [18]  be¬ 
ing  exact  through  third  order  in  correlation.  As 
shown  in  this  study,  the  idea  of  a  one-particle 
process  for  the  ionization  of  saturated  hydrocar¬ 
bons  can  be  retained  nearly  over  the  whole  valence 
region,  up  to  a  general  threshold  of  about  23  eV. 
Beyond  that  threshold,  the  innermost  ionization 
lines  in  the  region  tend  to  break  into  many 
shake-up  lines  of  comparable  strength,  as  a  result 
of  severe  hole-mixing  effects  [19]  in  the  molecular 
cation.  Particularly  relevant  for  the  present  pur¬ 
pose  is  that  the  main  spectral  features  fingerprint¬ 
ing  the  development  of  long-range  methylenic  hy¬ 
perconjugation  remain  free  [18]  of  satellites.  At 
this  point,  it  is  useful  to  recall  that  configurational 
and  conformational  signatures  in  the  XPS  spectra 
of  saturated  hydrocarbons  find  essentially  their 


origin  in  the  nodal  characteristics  of  the  molecular 
orbitals.  These  signatures  can  therefore  be  consis¬ 
tently  analyzed  [6a,  7]  in  terms  of  the  relative 
distribution  of  main  lines,  which  appears  to  be 
mostly  unaffected  [18]  by  basis-set  effects. 

As  the  Green's  function  is  in  this  case  domi¬ 
nantly  diagonal,  one  can  legitimately  consider  re¬ 
lated  forms  of  the  second-order  self-energy  de¬ 
rived  from  a  renormalization  [20]  of  excitation 
energies  through  first-order  in  correlation  in  the 
so-called  diagonal  2ph-TDA  (two-particle-hole 
Tamm-Dankoff  Approximation)  scheme  [21].  In 
spite  of  inherent  drawbacks  such  as  a  loss  of  size 
consistency  [22],  this  scheme  has  enabled  [6a,  7,  9] 
overall  a  satisfactory  reproduction  of  the  photoion¬ 
ization  spectra  of  saturated  hydrocarbons,  in  re¬ 
gards  with  the  relative  positions  of  peaks  and  their 
intensity  or  the  width  and  shape  of  bands.  Accu¬ 
rate  absolute  values  are  beyond  reach  with  this 
approximation,  as  a  result  of  the  neglect  of  elec¬ 
tron  correlation  in  the  reference  ground  state, 
yielding  typically  an  underestimation  of  the  elec¬ 
tron  binding  energies  by  about  1  eV.  This  ap¬ 
proach  has  been  preferred  to  other  schemes  of 
decoupling  of  the  many-body  problem,  like  the 
second-order  (MBGF2)  or  shifted  Born  collision 
(SBC)  approximations  [21]  for  the  self-energy.  The 
reason  of  this  choice  is  that  for  linear  alkanes,  it 


FIGURE  1.  Selected  structures:  (a)  adamantane,  (b)  1 ,4-dinnethyl  bicycle  [2,2,2]  octane,  and  (c)  1,1,4,4-tetramethyl 
cyclohexane. 
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has  enabled  [6b,  9]  a  calculation  of  the  relative 
positions  of  inner  valence  ionization  lines  within 
less  than  4%  of  error  with  respect  to  experiment. 

Photoionization  intensities  have  been  evaluated 
using  the  parametric  Gelius  model  [23]  for  molecu¬ 
lar  orbital  cross  sections;  the  relative  atomic  pho¬ 
toionization  cross  sections  used  for  C2,,  and 
being  100.0,  7.69,  and  0.00,  respectively  (core 
atomic  functions  do  not  significantly  participate  in 
the  molecular  orbitals  of  the  valence  region).  Al¬ 
though  this  simple  model  implies  the  use  of  a 
minimal  basis  set,  it  reflects  nicely  [9,  14]  the 
influence  of  the  mixing  of  states  at  the 

border  of  the  inner  and  outer  valence  regions  aris¬ 
ing  with  the  development  of  methylenic  hypercon¬ 
jugation  effects  [6a]  on  the  calculated  photoioniza¬ 
tion  intensities.  For  a  better  account  of  many-body 
effects,  the  Gelius  cross  sections,  obtained  from  an 
uncorrelated  model,  are  multiplied  by  the  corre¬ 
sponding  pole  strength  factors  F^,  as  obtained  from 
a  one-electron  Green's  function  calculation.  More 
details  on  the  technical  aspects  of  these  calcula¬ 
tions  can  be  found  in  [6b]. 

The  simulated  XPS  spectra  shown  in  this  work 
are  constructed  from  a  superposition  of  peaks  cen¬ 
tered  at  the  Koopmans  and  2ph-TDA/QP  values 
for  electron  binding  energies,  respectively.  In  these 
spectra,  the  ionization  lines  have  been  convoluted 
using  a  spread  function  based  on  a  linear  combina¬ 
tion  of  one  gaussian  and  one  lorenzian  curve  with 
equal  weight  and  a  full  width  at  half  the  maxi¬ 
mum  of  1.1  eV,  in  order  to  account  roughly  for 
vibrational  broadening,  as  well  as  the  experimen¬ 
tal  resolution  characteristics  allowed  with  the  most 
recent  gas-phase  X-ray  spectrometers  [24]. 


Resells  and  Diseiissioii 

GEXERAf.  OBSERVATSOAS 

The  convoluted  spectra  of  the  selected  com¬ 
pounds  are  displayed  in  Figure  2  at  the  Koopmans 
and  diagonal  2ph-TDA/QP  levels  of  approxima¬ 
tion.  In  the  latter  case,  the  computed  main  lines 
are  also  given  as  a  spike  spectrum.  The  detailed 
molecular  orbital  cross  sections  and  Green's  func¬ 
tion  data  used  to  construct  these  convolutions  are 
given  in  Table  I,  together  with  results  obtained 
using  other  decoupling  schemes  of  the  many-body 
problem.  Except  for  a  shift  and  a  contraction  of 
bands  toward  lower  binding  energies,  the  Koop¬ 


mans  and  diagonal  2ph-TDA/QP  spectra  are  over¬ 
all  essentially  similar,  a  result  of  a  balance  in  the 
convolution  procedure  between  the  relaxation  ef¬ 
fects  on  binding  energies  and  the  dispersion  of 
photoionization  intensity  into  secondary  processes 
[1,  6a]. 

Two  bands  of  rather  different  intensity  can  be 
distinguished  on  the  selected  energy  scale,  justify¬ 
ing  a  somewhat  arbitrary  [6a]  classification  of  the 
valence  molecular  orbitals  of  saturated  hydrocar¬ 
bon  compounds  in  terms  of  inner  valence  levels 
dominated  by  €2^  atomic  contributions  and  of 
outer  valence  levels  of  leading  (€2^,  +  charac¬ 
ter.  Owing  to  their  large  number  and  to  their 
diffuse  character,  the  latter  molecular  orbitals  yield 
poorly  structured  bands  of  low  photoionization 
intensity,  from  which  it  is  usually  difficult  to  ex¬ 
tract  specific  information  on  the  molecular  struc¬ 
ture.  Therefore,  in  this  contribution,  attention  is 
mainly  focused  on  the  inner  valence  levels. 

As  shown  (Table  I)  by  the  values  obtained  for 
pole  strengths,  the  molecular  orbital  picture  is  well 
preserved  nearly  over  the  whole  range  of  binding 
energies,  justifying  the  use  of  diagonal  approxima¬ 
tions  for  the  self-energy  and  its  kernel.  Great  care 
has,  however,  to  be  exercised  for  a  definite  conclu¬ 
sion,  as  a  minimal  STO-3G  basis  set  yields  shake-up 
states  of  artificially  too  high  energy  and  tends 
therefore  to  limit  the  fragmentation  of  ionization 
lines  into  satellites.  In  direct  relationships  with  the 
results  given  in  [6b,  9]  and  [14],  the  pole  strengths 
obtained  for  the  innermost  levels  of  the  investi¬ 
gated  compounds  drop  significantly  at  the  2ph- 
TDA  level  of  approximation,  an  observation  which 
suggests  a  complete  breakdown  of  the  molecular 
orbital  picture  of  ionization  of  an  electron  out  of 
those  levels  in  Green's  function  calculations  of  the 
ADC  type,  based  on  larger  basis  sets.  Since  these 
levels  fall  far  apart  from  each  other,  their  fragmen¬ 
tation  tail  should  simply  yield  a  smoothing  and 
broadening  of  the  convoluted  band  profile  at  the 
bottom  of  the  inner  valence  region.  As  in  [9]  and 
[14],  the  idea  of  a  one-particle  process  of  ionization 
can  be  on  the  other  hand  reasonably  retained  be¬ 
low  an  approximate  threshold  of  23  eV  on  electron 
binding  energies. 

In  Figure  3,  we  compare  our  theoretical  results 
to  the  imier  valence  binding  energy  data  obtained 
by  R.  Boschi  et  al.  [15e]  from  He(II)  photoelectron 
spectroscopy  measurements  on  adamantane.  As 
clearly  shown  by  the  slope  of  linear  correlations 
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TABLE  I  _ _ 

Theoretical  electron  binding  energies  (in  eV)  and  Gelius  XPS  molecular  orbital  cross  sections  (pole  strengths 
in  brackets). 


Molecule 

level 

Koopmans 

QP2 

SBC 

Diagonal  2ph-TDA 

cr 

A(damantane  (7^) 

la. 

30.32 

27.10(0.880) 

26.16(0.804) 

25.40  (0.551) 

77.39 

1^2 

26.98 

24.21  (0.897) 

23.45  (0.840) 

22.90  (0.789) 

88.52 

1e 

23.58 

21.40(0.915) 

20.78  (0.869) 

20.34  (0,830) 

88.89 

2f2 

20.57 

18.56  (0.921) 

18.00  (0.880) 

17.62  (0.847) 

73.69 

2a, 

18.18 

16.46  (0.929) 

15.99(0.894) 

15.67  (0.867) 

25.72 

3a, 

16.40 

14.92(0.938) 

14.53  (0.907) 

14.26  (0.885) 

51.26 

3^2 

15.79 

14.27  (0.936) 

13.87  (0,906) 

13,60  (0.884) 

11.45 

% 

14.21 

12.59  (0.933) 

12.18(0.904) 

11,91  (0.883) 

7,01 

4^2 

13.69 

12.09(0.935) 

11.69(0.907) 

1 1 .43  (0.887) 

15.13 

2f, 

11.44 

9.82  (0.934) 

9.44  (0.908) 

9.10(0.889) 

9.34 

2e 

11.42 

9.73  (0.932) 

9.34  (0.905) 

9.10(0.887) 

16.44 

5^2 

9.76 

8.22  (0.937) 

7.89(0.913) 

7.68  (0.896) 

12.45 

1,4-Dinnethyl  bicycio 
[2,2,2]  octane  (D3/,) 

la'. 

30.09 

26.87  (0.880) 

25.93  (0.800) 

25.31  (0.135) 

79.21 

la'2 

27.64 

24.81  (0.894) 

24.02  (0.833) 

23.44  (0.761) 

87.05 

1e' 

26.00 

23.52  (0.904) 

22.81  (0.850) 

22.29  (0.796) 

87.67 

2a', 

24.84 

22.79(0.918) 

22.23  (0.873) 

21.81  (0.832) 

82.59 

2  82 

22.99 

21.12(0.925) 

20.62  (0.887) 

20.25  (0.852) 

90.83 

1e" 

21.70 

19.72(0.921) 

19.16(0.879) 

18.77(0.844) 

77.59 

3  a'-, 

19.87 

17.89(0.922) 

17.36  (0.883) 

16.99  (0.851) 

69.10 

3  a" 

16.97 

15.35(0.935) 

14.94(0.904) 

14.65  (0.880) 

74.14 

2e' 

16.75 

15.16(0.932) 

14.72(0.900) 

14.44  (0.876) 

6.59 

4  a'-, 

16.02 

14.23(0.926) 

13.78  (0.894) 

13.48(0.870) 

15.18 

2e" 

14.54 

13.32  (0.945) 

13.00  (0.922) 

12.80  (0.905) 

9.34 

la'2 

14.47 

13.12(0.942) 

12.76  (0.915) 

12.53  (0.897) 

5.15 

3e' 

14.47 

13.19(0.944) 

12.86(0.920) 

12.66  (0,903) 

6.32 

2e" 

12.93 

1 1 .70  (0.946) 

1 1 .38  (0.923) 

11.18(0.907) 

5.61 

4e' 

12.85 

11.12(0.933) 

10.79(0.905) 

10.54  (0.885) 

16.61 

4a'2 

12.13 

10.29(0.926) 

9.88  (0.899) 

9.62  (0.879) 

17.51 

la',' 

11.61 

10.41  (0.948) 

10.11  (0.926) 

9.93  (0.911) 

5.39 

5e' 

11.57 

9.90  (0.932) 

9.52  (0.906) 

9.28  (0.887) 

10.10 

Sa'-i 

9.90 

8.22  (0.931) 

7.86  (0.907) 

7.64  (0.889) 

19.60 

3e" 

9.76 

8.17(0.935) 

7.80  (0.910) 

7.59  (0.893) 

14.71 

1,1,4,4-Tetramethyl 
cyclohexane  (C2/,) 

lag 

29.61 

26.58  (0.887) 

25.77  (0.822) 

25.14(0.742) 

80.97 

28.05 

25.27  (0.895) 

24.55  (0.839) 

24.01  (0.778) 

85.42 

1a„ 

26.09 

23.61  (0.906) 

22.96  (0.856) 

22.48  (0.812) 

87.35 

25.15 

22.97  (0.915) 

22.39  (0.870) 

21 .98  (0.830) 

84.93 

2b, 

24.31 

22.50  (0.926) 

22.02  (0.888) 

21.68  (0.856) 

80.09 

Sag 

24.18 

22.39  (0.927) 

21.92  (0.891) 

21.60  (0.860) 

83.15 

3b, 

22.59 

20.69  (0.925) 

20.20  (0.888) 

19.86  (0.857) 

87.51 

21.77 

19.79  (0.921) 

19.27(0.882) 

18.90  (0.850) 

76.24 

4ag 

19.45 

17.56  (0.925) 

17.10(0.892) 

16.78  (0.865) 

71.85 

^b. 

18.06 

16.44(55.18) 

16.04(0.905) 

15.77  (0.881) 

59.09 

Sag 

16.35 

14.80  (0.934) 

14.42(0.907) 

14.17(0.885) 

10.13 
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TABLE  I  _ 
(Continued). 


Molecule 

level 

Koopmans 

QP2 

SBC 

Diagonal  2ph-TDA 

a 

2ba 

16.07 

14.62(0.939) 

14.25(0.912) 

14.01  (0.892) 

7.04 

9 

5by 

15.55 

14.14(0.941) 

13.79(0.914) 

13.57(0.895) 

20.24 

2a, 

15.51 

14.05  (0.938) 

13.70(0.912) 

13.47(0.893) 

7.74 

displayed  in  this  figure,  the  It 2,  1^,  2^2/  2fl^  HF 
one-electron  levels  of  adamantane  fall  in  an  energy 
range  that  is  too  large  by  about  36%  as  compared 
with  experiment,  a  defect  which  reduces  to  an 
error  of  20,  16,  and  12%  with  the  diagonal  second- 
order,  SBC,  and  2ph-TDA  results,  respectively. 
These  slopes  are  significantly  larger  than  the  ones 
obtained  in  the  same  conditions  [6b]  for  gas-phase 
samples  of  n-alkane  chains  (respectively,  HF:  1,27; 
QP2:  1.13;  SBC:  1.07;  2ph-TDA:  1.00),  reflecting  the 
enhanced  role  played  by  electronic  relaxation  ef¬ 
fects  in  a  polycyclic  structure.  This  relates  to  a 


lowering  of  the  fundamental  gap  as  a  result  of 
torsional  strains. 

INFLUEIVCE  OF  THE  ATOMIC  CONNECTIVITY 

Striking  differences  can  be  observed  in  these 
bands  from  one  compound  to  the  other,  confirm¬ 
ing  the  proposition  that  the  valence  region  of  XP 
spectra  can  be  successfully  used  to  extract  infor¬ 
mation  on  the  molecular  structure.  In  this  case,  the 
XPS  fingerprints  relate  to  combined  variations  on 


Experimental  Binding  Energy  (eV) 

FIGURE  3.  Correlation  between  theoretical  and  experinnental  [1 5e]  inner  valence  binding  energies  of  adamantane. 
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the  connectivity,  torsional  strains  in  polycyclic 
structures,  and  transannular  or  long-range  interac¬ 
tions,  an  observation  which  is  worth  further  analy¬ 
sis. 

The  inner  and  outer  valence  bands  fall  by  1.6  eV 
apart  in  the  ionization  spectrum  of  1, 1,4,4- 
tetramethyl  cyclohexane,  whereas  the  topmost  C2S 
level  of  1,4-dimethyl  bicyclo  [2,2,2]  octane  com¬ 
pound  lies  only  0.2  eV  above  the  first  outer  va¬ 
lence  level.  From  the  intensity  of  the  individual 
ionization  lines,  the  C2,  and  €2^  +  bands 
clearly  interpenetrate  in  the  XP  spectrum  of 
adamantane.  This  overall  behavior  relates  to  an 
enhancement  of  torsional  strains  with  the  closure 
and  fusion  of  one,  three,  and  four  six-membered 
rings,  respectively,  yielding  [9]  a  stronger  energy 
dispersion  of  the  €2^  ^nd  €2^,  +  one-electron 
levels. 

From  their  overall  aspect,  the  outer  valence 
bands  of  the  selected  compounds  present  some 
similarity,  with  the  emergence  of  four  rather  poorly 
resolved  substructures.  On  the  other  hand,  the 
atomic  connectivity  affects  heavily  the  overall  as¬ 
pect  of  the  convoluted  inner  valence  bands, 
through  the  interplay  of  energy  degeneracies  and 
line  positions.  Four  peaks  can  be  clearly  distin¬ 
guished  in  the  C2,  band  of  adamantane,  the  inten¬ 
sity  of  which  directly  reflects  the  degree  of  degen¬ 
eracy  of  the  corresponding  levels,  It 2,  le,  It 2, 

successively.  As  shown  in  Figure  2(a),  2a^  level 
falling  in  the  border  energy  range  with  the  outer 
valence  band  should  most  likely  remain  hidden  in 
the  convoluted  XPS  background.  The  1,4-dimethyl 
bicyclo  [2,2,2]  octane  compound  yields  six  peaks  in 
the  convoluted  C2,  band,  which  can  be  assigned  to 
the  following  sets  of  orbitals,  successively: 

(In;.  le\  and  2a\);  (24);  (le");  (3^);  (3^).  Only 
four  well-resolved  structures  emerge  from  the  in¬ 
ner  valence  band  of  the  1,1,4,4-tetramethyl  cyclo¬ 
hexane  compound,  which  from  their  energy  loca¬ 
tion  and  relative  intensity  remind  the  C2^  energy 
structure  of  the  1,4-dimethyl  bicyclo  [2,2,2]  octane 
compound. 

IXFLPENCE  OF 

IXTER4€;nOI\8 

The  boundary  energy  region  between  the  inner 
and  outer  valence  bands  exhibit  also  specific  dif¬ 
ferences  from  one  compound  to  the  other,  which 
cannot  be  easily  rationalized  considering  atomic 
connectivity  only.  These  changes  relate  to  a  modu¬ 
lation  of  long-range  interactions  with  variations  on 


the  molecular  architecture,  and  can  be  as  such 
regarded  as  probes  of  conformational  characteris¬ 
tics. 

From  the  computed  photoionization  cross  sec¬ 
tions  (Table  I),  the  2a^  orbital  of  adamantane  can 
be  shown  to  be  dominantly  of  (€2^  +  charac¬ 
ter,  whereas  the  3^^  orbital  is  clearly  dominated  by 
C25  atomic  contributions.  In  the  convoluted  XP 
spectrum  [Fig.  2(a)],  this  feature  is  fingerprinted 
by  the  disappearance  of  the  2a^  level  in  the  com¬ 
puted  background  and  by  the  emergence  of  a 
relatively  intense  peak  at  14.3  eV,  in  an  energy 
region  normally  assigned  to  the  outer  valence  band. 
As  for  the  chair  form  of  cyclohexane  [6b,  9]  or 
zig-zag  planar  chains  of  polyethylene  [6a,  9],  this 
particularly  strong  mixing  of  the  C2s/(C2^  + 
orbitals  at  the  border  of  the  inner  and  outer  va¬ 
lence  bands  typically  relates  to  the  development  of 
through-space  interactions  between  the  methylenic 
C~H  bond  (also  referred  as  cr-conjugation  effects 
[25]).  It  is  in  this  case  very  strongly  favored  by  the 
overlap  of  the  inner  and  outer  valence  energy 
ranges  due  to  the  ring  constraints  in  a  cage  struc¬ 
ture  on  the  one  hand,  and  on  the  other  hand  by  the 
topology  of  the  involved  orbitals,  2a^  and  3^^, 
enabling  a  hyperconjugation  pattern  fully  consis¬ 
tent  with  the  tetrahedral  symmetry  group.  In 
comparison,  when  considering  the  chair  form  of 
cyclohexane,  this  mixing  is  limited  by  an  energy 
interval  of  about  2.0  eV  between  the  inner  and 
outer  valence  regions,  and  by  the  presence  of  a 
nodal  surface  in  orbitals  belonging  to  the  fl2« 
ducible  representation  of  the  symmetry  group. 
Similar  considerations  prevail  with  zig-zag  planar 
forms  of  large  /^-alkane  chains,  in  which  the 
boundary  orbitals  of  the  inner  and  outer  valence 
regions  exhibit  also  a  nodal  surface  following 
roughly  the  carbon  backbone. 

Long-range  methylenic  hyperconjugation,  on  the 
other  hand,  is  forbidden  with  the  1,4-dimethyl 
bicyclo  [2,2,2]  octane  compound,  since  it  can  be 
regarded  as  a  fusion  of  boat  forms  [6b]  of  cyclo¬ 
hexane.  The  computed  photoionization  cross  sec¬ 
tions  differ  markedly  from  the  inner  and  outer 
valence  bands  (Table  I),  confirming  the  absence  of 
a  mixing  of  the  C2y(C2^  +  H^,)  character  of  the 
boundary  orbitals,  in  spite  of  their  very  weak 
energy  separation.  Accordingly,  the  3^2  orbital 
emerges  intensively  and  distinctively  at  the  top  of 
the  inner  valence  region,  2.35  eV  above  the  nearest 
inner  valence  energy  level,  corresponding  to  the 

orbital. 
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Methylenic  hyperconjugation  interactions  are 
recovered  with  the  1,1,4,4-tetramethyl  cyclohexane 
compound,  through  the  interplay  of  a  mixing  of 
the  C2,/(C2,,  Hi,)  character  of  the  4b„  and  5b„ 
orbitals,  as  reflected  by  their  XPS  photoionization 
cross  sections.  Besides  the  emergence  of  a  rather 
intense  peak  at  13.6  eV,  relating  directly  to  the  5b„ 
orbital,  this  mixing  contributes  to  a  net  stabiliza¬ 
tion  of  the  topmost  €2^  level  by  1.1  eV  in  compari¬ 
son  with  the  1,4-dimethyl  bicyclo  [2,2,2]  octane 
compound.  In  this  case,  the  highest  C2S  level  can¬ 
not  be  resolved  from  the  energy  level  below. 


Conclusions 

The  goal  of  this  work  was  an  assessment  of 
specific  fingerprints  of  the  molecular  architecture 
in  the  X-ray  photoionization  spectra  (XPS)  of  poly¬ 
cyclic  hydrocarbons,  the  adamantane  and  the  1,4- 
dimethyl  bicyclo  [2,2,2]  octane  compounds  and  of 
the  1,1,4,4-tetramethyl  derivative  of  cyclohexane. 
The  inner  valence  XP  spectra  of  these  molecules 
exhibit  striking  signatures  for  their  molecular  pri¬ 
mary  structure  (atomic  connectivity),  through  the 
interplay  of  electronic  degeneracies  and  the  influ¬ 
ence  of  cyclic  strains  on  binding  energies.  Owing 
to  the  sensitivity  of  X-ray  photoionization  cross 
sections  to  the  development  of  transannular  meth¬ 
ylenic  hyperconjugation,  these  spectra  can  also  be 
successfully  used  to  follow  the  conversion  from 
the  chair  to  the  boat  form  of  the  cyclohexane  unit 
on  which  these  compounds  are  based.  As  con¬ 
firmed  in  this  study,  the  boundary  region  between 
the  inner  and  outer  valence  bands  generate  spe¬ 
cific  signatures  for  molecular  characteristics  per¬ 
taining  to  conformational  questions. 

The  results  presented  in  this  study  have  been 
obtained  using  the  quasi-particle  approximation 
for  a  one-electron  Green's  function  scheme  based 
on  the  diagonal  2ph-TDA  renormalization  of  the 
self-energy.  Although  electronic  configuration  mix¬ 
ing  effects  in  the  cation  are  not  explicitly  consid¬ 
ered  in  this  approximation,  one  can  reasonably 
retain,  by  comparison  with  previous  works  on 
related  compounds,  the  idea  of  one-particle  picture 
of  ionization  nearly  over  the  whole  range  of  va¬ 
lence  binding  energies,  except  above  an  approxi¬ 
mate  threshold  of  23  eV  where  ionization  lines 
should  most  likely  split  into  dense  structures  of 
shake-up  lines.  To  cope  with  multistate  interac¬ 
tions,  as  well  as  with  electronic  correlation  in  the 


reference  ground  state,  one  should  consider  spe¬ 
cific  high-order  approximations  of  the  one-electron 
Green's  function  such  as  the  third-order  algebraic 
diagrammatic  construction  [ADC  (3)]  scheme. 

Another  limitation  in  our  simulations  result 
from  the  use  of  a  parametric  model  based  on  a 
noncorrelated  approach  for  the  evaluation  of  pho¬ 
toionization  cross  sections.  In  the  long-run,  pho¬ 
toionization  intensities  should  be  computed  using 
nonparametric  techniques  implemented  [26]  in 
the  framework  of  the  Green's  function  approach. 
Vibronic  coupling  effects  [27]  in  systems  with 
numerous  electronic  degeneracies  should  also  be 
considered. 
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ABSTRACT  - - 

Many  applications  involve  the  use  of  macromolecules  at  interfaces,  including  adhesion, 
lubrication,  and  the  manufacture  of  surfaces  resistant  to  wear.  Modeling  these  systems 
requires  an  understanding  of  how  the  dominant  conformational  features  of  a 
macromolecule  are  modified  by  the  presence  of  a  boundary.  In  this  work,  we  are 
interested  in  monitoring  the  shape  changes  that  accompany  the  configurational 
rearrangements  of  a  single  polymer  chain  near  a  hard  plane.  We  provide  a  detailed 
characterization  of  essential  molecular  shape  features  of  compressed  grafted  chains. 
These  chains  are  modeled  by  three-dimensional  (off-lattice)  walks  with  excluded- volume 
interaction,  anchored  to  a  flat  surface  by  a  terminal  point.  Compression  is  induced  by 
confining  the  chains  between  two  parallel  walls.  For  these  systems,  we  have  studied  how 
shape  descriptors  depend  on  polymer  size  and  the  separation  between  walls.  We  show 
that  compression  causes  a  number  of  nontrivial  changes  in  shape  (shape  transitions).  In 
particular,  we  show  that  the  most  compact  chains  found  during  compression  are  neither 
spheroidal  nor  maximally  self-entangled.  Maximal  entanglements  are  found  in  chains 
that  are  already  mostly  ^^flattened^'  toward  the  surface.  The  present  results  provide  new 
insights  that  can  be  useful  in  the  design  of  interfaces  with  targeted  properties.  ©  1996 
John  Wiley  &  Sons,  Inc. 


Introduction 

There  is  a  considerable  interest  in  modeling 
the  behavior  of  macromolecules  at  solid- 
liquid  interfaces  [1-3].  These  systems  find  many 
applications,  including  surface  wetting,  crystal 
growth,  adhesion,  film  spreading,  lubrication,  and 


oil  recovery.  The  design  of  size-exclusion  chro¬ 
matographic  phases,  filters,  and  surfaces  resistant 
to  wear  is  also  based  on  the  properties  of  confined 
polymers.  The  properties  of  macromolecules  at  thin 
interphases  are  also  central  for  understanding  the 
functions  of  biological  membranes. 

Several  models  have  been  proposed  for  study¬ 
ing  the  effect  of  boundary  conditions  on  chain 
molecules.  Two  typical  approaches  are  usually  fol- 
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lowed,  where:  (a)  macromolecules  are  anchored  at 
the  solid-liquid  interface  and  (b)  macromolecules 
form  a  liquid  interphase,  confined  between  tw'o 
solid  phases.  Some  specific  examples  have  consid¬ 
ered:  (i)  grafting  of  a  chain  to  a  surface  (i.e.,  termi¬ 
nal-point  chemisorption)  to  form  a  "brush"  [4-10], 
(ii)  adsorption  of  the  entire  chain  [5,  11-13],  (hi) 
macromolecules  geometrically  confined  within 
planar  impenetrable  surfaces,  e.g.,  slabs  [14,  15] 
and  prisms  [15],  (iv)  confinement  in  realistic  chan¬ 
nels  [16],  nanotubes  [17, 18],  liquid  layers  [19],  and 
(v)  adsorption  in  the  presence  of  an  irregular 
boundary  (e.g.,  a  pore)  [20].  All  these  models 
include  usually  chain-chain  and  chain-surface 
interactions  in  various  forms,  from  the  simple 
excluded-volume  interaction  [5]  to  detailed  hydro¬ 
gen-bonding  and  solvation  [21]. 

Recent  results  confirm  that  polymer  size  is 
greatly  affected  by  the  presence  of  a  surface  [22], 
its  texture  [23],  and  interphase  thickness  [24]. 
Computer  simulations  complement  these  observa¬ 
tions.  They  provide  us  with  a  microscopic  picture 
of  how  the  shape  and  size  of  the  dominant  chain 
configurations  are  modified  by  a  boundary. 

Computer  simulations  of  confined  chains  usu¬ 
ally  monitor  some  ad  hoc  molecular  size  features, 
from  the  components  of  the  radius  of  gyration  in 
preferential  directions  [10,  14,  19]  to  the  distribu¬ 
tion  of  end-to-end  distances  [25]  and  persistence 
length  [26].  Topological  properties  of  cyclic  poly¬ 
mers  (e.g.,  knot  probabilities)  have  also  been  stud¬ 
ied  in  terms  of  geometrical  confinement  [13,  15, 
27],  Yet,  a  systematic  study  of  how  distinct  shape 
features  are  affected  by  boundary  conditions  is 
lacking.  Here,  we  present  a  detailed  approach 
where  independent  aspects  of  geometrical  shape 
are  monitored  simultaneously. 

We  have  studied  how  distinct  molecular  shape 
features  of  grafted  chains  are  affected  by  compres¬ 
sion.  Only  linear  chains  are  considered  here.  Chains 
are  considered  in  "swollen"  (excluded-volume) 
configurations  and  they  are  modeled  by  off-lattice 
(self-avoiding)  walks,  grafted  to  a  plane.  Compres¬ 
sion  is  introduced  by  placing  the  chains  between 
tv\^o  "hard"  (impenetrable)  walls  with  variable 
separation.  (For  general  scaling  properties  of  ran¬ 
dom  walks  in  the  presence  of  a  surface,  see  Ref. 

[28]  and  others  therein.  Some  effects  of  confine¬ 
ment  on  branched  polymers  are  discussed  in  Ref 

[29] .) 

We  use  grafting  as  a  device  to  control  the  de- 
formability  of  the  chains  and  produce  reasonable 
cut-offs  on  the  "pressure"  that  can  be  applied  on 


them.  The  effect  of  grafting  on  the  configurational 
properties  of  long  chains  is  minor.  In  the  limit  of 
short  chains,  however,  grafting  and  excluded  vol¬ 
ume  allow  us  to  reproduce  the  usual  experimental 
observation  that  small  polymers  are  fairly  rigid 
and  difficult  to  deform  by  compression.  Our  pres¬ 
ent  goal  is  to  study  the  behavior  of  all  shape 
features  of  a  single  chain,  confined  to  an  inter¬ 
phase  with  variable  thickness.  Entanglements  with 
other  chains  and  the  possibility  of  full  chain  ad¬ 
sorption  are  not  considered  here.  Similarly,  we  will 
neglect  attractive  interactions  and  thus  eliminate 
the  possibility  of  forming  compact  globules.  Com¬ 
pactness  will  only  be  controlled  by  varying  the 
confinement  betw’^een  walls. 

For  these  model  chains,  we  have  computed  con¬ 
figurationally  averaged  descriptors  associated  with 
three  distinct  shape  features:  size,  anisometry,  and 
entanglement  complexity.  We  have  also  character¬ 
ized  the  occurrence  of  shape  transitions  due  to 
compression.  The  relation  between  self-entangle¬ 
ments,  anisometry,  and  polymer  size  has  recently 
been  studied  for  various  homopolymers  [30,  31] 
and  protein  backbones  [31-33].  In  this  work,  we 
show  that  the  interrelation  between  the  above 
shape  features  is  greatly  modified  when  a  bound¬ 
ary  is  taken  into  account.  In  contrast  with  "free" 
polymers,  compression  can  lead  to  structures  that 
are  compact  according  to  size,  but  whose  configu¬ 
rations  are  neither  spheroidal  nor  very  entangled. 

The  next  sections  deal  with  the  construction  of 
the  model  chains,  their  shape  characterization,  and 
the  onset  of  shape  transitions  by  compression  in¬ 
side  a  slab  with  impenetrable  walls.  We  show  that 
the  use  of  several  independent  shape  features, 
rather  than  a  single  descriptor,  can  provide  a  de¬ 
tailed  characterization  of  structural  changes. 


Grafted  Polymer  Mode!  with 
Confinement 

We  study  conformational  transitions  in  poly¬ 
mers  with  small  to  intermediate  size.  These  poly¬ 
mers  (with  typically  less  than  100  monomers)  com¬ 
prise  a  large  number  of  cases  with  practical  inter¬ 
est.  These  chains  are  well  represented  by  off-lattice 
models  [34].  Here,  we  use  a  "necklace"  model  (a 
Pearson  random  walk),  consisting  of  a  string  of  n 
identical  "beads,"  with  a  constant  bonded-bead 
distance  /  (Kuhn's  length).  We  simulate  the  effect 
of  a  "good  solvent"  [5,  35]  by  including  an  ex- 
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eluded-volume  interaction.  This  is  a  purely  repul¬ 
sive  interaction  characterized  by  the  condition  that 
all  nonbonded  monomers  must  be  at  a  distance 
larger  than  or  equal  to  the  radius  of  excluded 
volume.  [This  imposes  a  restriction  on  the  polymer 
shape:  in  the  limit  of  2/  the  chain  can  only 

be  a  rigid  rod.] 

Our  goal  is  to  explore  only  the  effect  of  confine¬ 
ment  on  grafted  chains  with  variable  length.  The 
parameters  /  and  r^^^are  kept  constant.  Our  pres¬ 
ent  choice  (/=  3.8  A  and  =  6  A)  generates 
chains  that  mimic  unfolded  (solvated)  proteins,  in 
absence  of  confinement.  A  smaller  value  is  not 
convenient  since  it  produces  compact  chains  that 
instead  resemble  polymers  in  poor  solvents.  Our 
main  results  depend,  however,  little  on  / . 

Grafting  and  confinement  are  simulated  as  fol¬ 
lows:  A  terminal  point  of  the  chain  is  attached  to  a 
planar  wall.  Then,  the  chain  is  allowed  to  '^grow 
by  random  walks  of  length  /  within  two  parallel 
(hard-plane)  walls  separated  by  a  distance  L.  All 
configurations  that  violate  the  condition  of  ex¬ 
cluded  volume  or  grow  outside  the  region  for 
confinement  are  rejected.  The  procedure  is  equiva¬ 
lent  to  a  Monte  Carlo  sampling  of  configurational 
space,  where  all  accepted  conformers  have  zero 
energy  and  all  rejected  conformers  have  infinite 
energy.  An  example  of  an  '"accepted'"  (n  =  100) 
configuration  is  given  in  Figure  1.  Successful  con¬ 
figurations  will  be  indicated  by  {K/}.  This  ap¬ 
proach  is  sufficient  to  explain  simple  size  and 
shape  features  in  free  chains  [31,  34].  In  this  work, 
we  expand  the  procedure  to  chains  subject  to 
boundary  conditions.  Our  goal  is  study  confined 
polymers  by  using  the  same  shape  descriptors  that 
are  useful  for  free  chains. 

Often,  descriptors  of  size  and  anisometry  are 
adapted  to  the  particular  geometrical  features  of 
the  chosen  confinement  [5,  10,  15,  35-41].  For 
example,  in  the  case  of  placing  a  chain  between 
two  parallel  planes,  one  can  analyze  the  compo¬ 
nents  of  the  radius  of  gyration  and  the  asphericity 
in  the  directions  parallel  and  perpendicular  to  the 
walls.  This  approach  may  not  be  useful  for  other 
confinement  geometries.  Elsewhere  [33]  we  have 
shown  that  the  simultaneous  use  of  distinct  shape 
descriptors  can  uncover  subtle  changes  in  molecu¬ 
lar  shape  induced  by  temperature.  For  the  present 
study  of  confined  polymers,  we  use  three  descrip¬ 
tors  that  are  independent  from  each  other. 

The  chosen  descriptors  characterize  the  shape  of 
freely  rotating  polymers  [33].  In  our  present  case, 
the  chains  are  not  created  in  isolation,  but  rather 


configuration  confined  between  two  parallel  planes.  (The 
example  corresponds  to  r?  =  1 00  beads,  a  step  length  of 
/=  3.8  A,  and  a  radius  of  excluded  volurne  of  '•ex  =  6.0 
A.  The  separation  between  planes  is  30  A.) 

grown  between  two  parallel  planes  and  grafted  to 
one  of  them.  The  subsequent  analysis  of  these 
chains  as  if  in  isolation  may  introduce  a  problem 
in  sampling.  Note  that  all  K,  configurations,  de¬ 
rived  from  each  other  by  translation  and  rotation, 
will  have  the  same  shape  description.  Yet,  not  all 
of  them  would  necessarily  be  "successful  configu¬ 
rations"  if  grafted  and  grown  under  confinement. 
To  circumvent  this  issue,  we  deal  with  a  slightly 
modified  subset  of  configurations  that  can  im- 
equivocally  be  associated  with  grafting.  Their  con¬ 
struction  is  as  follows:  (a)  the  first  three  monomer 
beads  are  chosen  collinear  and  perpendicular  to 
the  wall;  (b)  the  confined  Pearson  walk  with  ex¬ 
cluded  volume  is  built  starting  from  the  fourth 
bead,  located  at  a  distance  /  from  the  third 
collinear  bead  (see  Fig.  1). 

The  condition  that  the  chains  start  with  three 
collinear  beads  grafted  perpendicularly  provides 
an  unambiguous  specification  on  how  to  "insert" 
an  isolated  polymer  within  the  slab  and  anchor  it 
to  the  surface  by  one  end.  Moreover,  note  that  the 
probability  that  the  last  three  beads  {n  —  2,  n  -  1, 
n)  will  place  themselves  randomly  collinear  is  vir¬ 
tually  zero.  Thus,  in  practice,  it  will  only  be  possi¬ 
ble  to  "graft  back"  a  selected  configuration  K,  at 
one  end  of  the  chain,  the  one  with  the  three 
collinear  beads.  Its  shape  characterization  will  un¬ 
equivocally  be  associated  with  a  "successful"  con- 
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figuration.  Nonetheless,  this  special  condition  for 
grafting  has  little  influence  on  the  shape  of  long 
chains. 

There  exists  a  minimum  value  of  below 

which  no  polymer  configurations  are  found.  With 
the  first  three  beads  collinear,  this  critical  wall 
separation  increases,  but  the  difference  is  only 
significant  for  chains  with  <  10.  For  the  parame¬ 
ters  /  and  used  here,  conformers  were  always 
found  for  wall  separations  larger  than  8.5  A,  re¬ 
gardless  of  the  n  value.  This  value  is  comparable 
to  a  very  small  molecular  cavity  [10,  14,  18]. 


Detailed  Molecular  Shape  Description 

We  have  evaluated  shape  descriptors  on  the 
configurations  selected  according  to  the  above  cri¬ 
teria.  The  descriptors  characterize  three  distinct 
shape  features:  (a)  size  and  compactness,  de¬ 
scribed  by  the  span  R  and  the  instantaneous  ra¬ 
dius  of  gyration  [35];  (b)  anisometry,  described 
by  the  asphericity  [37,  38];  and  (c)  the  entangle¬ 
ment  complexity,  (Ascribed  by  the  mean  number 
of  overcrossings,  N  [42].  These  three  properties 
have  already  been  used  to  monitor  dynamical 
shape  transitions  in  biopolymers  [33]. 

The  evaluation  of  the  descriptors  has  been  dis¬ 
cussed  elsewhere  in  detail  [33].  We  summarize 
here  the  main  elements.  Let  {r^,  z  =  1,  2, . . . ,  n}  be 
the  coordinates  associated  with  a  chain  configura¬ 
tion  K-.  The  origin  of  the  coordinate  system  is  at 
the  geometric  centre  of  the  chain.  The  instanta¬ 
neous  radius  of  gyration  R^  and  the  span  R  are 
simply  given  as: 

=  -  E  '•f.  (la) 

^  i=l 

R  =  max  T:.  (lb) 

{z} 

A  number  of  so-called  shape  factors  have  been 
proposed  in  the  literature  to  account  for  the  config¬ 
urational  anisometry  [36~40].  In  our  case,  we  re¬ 
sort  to  Rq  and  R  to  characterize  size,  whereas  we 
use  the  asphericity  for  measuring  anisometry  [40]: 

ti-jjE  <2) 

defined  in  terms  of  the  three  principal  moments  of 
inertia  {A,}.  Spheroidal  chains  exhibit  fi  ~  0, 
whereas  prolate  chains  satisfy  O  ~ 


Finally,  we  use  the  probability  distribution  of 
overcrossings  as  a  measure  of  the  complexity  of 
self-entanglements  in  a  chain  [42].  In  the  present 
context,  the  term  “self-entanglement"  is  meant  to 
convey  how  the  linear  chains  “coil  and  twist  over 
themselves."  (Note  that  we  deal  with  a  single 
chain  that  in  general  will  not  be  grafted  to  both 
surfaces,  and  thus  they  will  not  form  knots  or 
braids.)  The  distribution  of  overcrossings  provides 
a  geometrical  characterization  of  the  complexity  of 
these  “coils"  that  is  independent,  in  general,  from 
size  and  anisometry  features.  The  distribution,  in¬ 
dicated  as  {At.,(n)},  gives  the  probability  of  ob¬ 
serving  N  overcrossings  when  projecting  the 
bonding  pattern  of  a  rigid  «-atom  macromolecular 
conformation  onto  two  dimensions.  (The  over¬ 
crossings  are  the  points  where  two  bonds  appear 
to  “cross"  when  projected  to  a  plane.)  In  practice, 
{di;v/(«)}  is  evaluated  by  averaging  over  a  large 
series  of  randomized  projections  (t)qDically  5X10^ 
points)  [42].  Here,  we  use  a  single  parameter  to 
describe  the  type  of  entanglements  obtained, 
namely  the  mean  number  of  overcrossings  N  de¬ 
fined  as: 

maxN 

N=  Z  NA^(n), 

N=0 

max  N  =  (n  -  3)(n  -  2)/2.  (3) 

This  descriptor  captures  some  essential  folding 
features^  In  swollen  or_disentangled  chains  we 
expect  N  ^  0,  whereas  N  will  take  large  values  in 
convoluted  or  entangled  polymers. 

For  each  polymer  of  length  n,  we  have  com¬ 
puted  configurational  averages  of  the  above  de¬ 
scriptors  over  many  chains.  If  D,  is  the  value  of  a 
generic  descriptor  D  at  a  successful  configuration 
reliable  (and  conservative)  estimates  for  the 
configurational  mean  <D>  can  be  obtained  by  av¬ 
eraging  over  sequences  of  the  computed  descriptor 
(Dj,  D2,  . . . ),  as  follows: 

2  N>2  k>2 

<D>  =  v^E  t  Wk.  (4) 

^  k=2  i=l 

We  have  added  enough  configurations  to  reach 
3-significant-figure  accuracy  in  The  si¬ 

multaneous  use  of  (Riy^,  (R),  (ft),  and  <N> 
should  provide  a  detailed  picture  of  all  relevant 
molecular  shape  features  for  compressed  chains.  In 
free  chains,  some  degree  of  correlation  is  found 
between  descriptors  [33,  34].  For  example,  a  de¬ 
crease  in  size  is  accompanied  by  an  increase  in 
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entanglements  and  a  decrease  in  asphericity.  This 
behavior  is  contrasted  below  with  the  results  for 
various  wall  separations  L. 


Shape  Transitions  Induced  by 
Compressing  a  Grafted  Poiymer 

Intuitively,  the  behavior  of  the  grafted  chains  in 
the  limit  of  large  L  values  (i.e.,  weak  confinement) 
and  small  L  values  (i.e.,  strong  confinement)  could 
be  predicted:  (a)  Long  grafted  chains  with  no  con¬ 
finement  should  resemble  free  chains.  For  the  cho¬ 
sen  /  and  parameters,  this  means  slightly 
prolate  configurations  ((O)  ~  0.1).  There  are  no 
attractive  interactions  between  beads  that  can  turn 
the  chain  into  a  spherical  globule,  (b)  When  com¬ 
pression  is  applied,  the  chains  should  become 
smaller,  more  spheroidal,  and  entangled.  There¬ 
fore,  we  expect  that  (  {  R),  and  (fi)  shoidd 

initially  decrease  when  L  decreases,  whereas  <  N  > 
should  increase  under  the  same  conditions,  (c) 
Under  strong  confinement  (i.e.,  with  wall  separa¬ 
tions  approaching  the  chains  will  be 

squashed  along  directions  parallel  to  the  surface. 
The  resulting  configurations  will  be  oblate,  large  in 
size  (not  compact),  and  unentangled.  Therefore, 
we  expect  the  behavior  of  the  shape  descriptors 
near  to  be  the  opposite  to  that  for  large  L 
values, _i.e.,  <RG>^^^  and  <a>  will  increase 
(and  (N>  decrease)  with  compression.  Figure  2 
illustrates  this  behavior  for  representative  configu¬ 
rations  of  an  n  =  100  chain  at  large,  mtermediate, 
and  small  wall  separations  (L  =  50  A,  30  A,  and 
10  A,  respectively). 

The  above  analysis  suggests  that,  at  least  for 
large  n  values,  size  and  anisometry  descriptors 
will  exhibit  a  minimum  as  a  function  of  L,  and  the 
averaged  mean  number  of  overcrossings  will  have 
a  maximum  for  some  L  value.  Let  L*(D)  be  the 
wall  separation  at  which  the  descriptor  D  has  a 
critical  point  (i.e.,  [  d{  D)/ dL]i^  =  0).  As  we  show 
below,  the  positions  of  the  critical  points  for  size, 
anisometry,  anc^ntanglement  complexity  [  L*(Rg), 
L*(fi),  and  L*(N),  respectively]  are  not  coincident. 

The  occurrence  of  a  minimum  radius  of  gyra¬ 
tion  in  chains  enclosed  in  various  geometries  has 
already  been  reported  [10, 15].  Our  results  confirm 
the  same  behavior  for  confined  chains  grafted  onto 
a  surface.  Figure  3  shows  the  configurationally 
averaged  radius  of  gyration  as  a  function  of  L  for 
n  =  75  chains.  The  molecular  size  has  a  minimum 


I 


L  =  10  A 

_ / 


FIGURE  2.  Typical  changes  in  polymer  configuration 
due  to  compression.  (The  parameters  for  the  chain  are 
the  same  as  in  Fig.  1.  The  separation  between  walls  is 
indicated  to  the  right.) 
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FIGURE  3.  Change  in  mean  radius  of  gyration  of  a 
n  =  75  chain  (/=  3.8  A,  =  6.0  A)  with  compression. 
(Wall  separation  and  radius  of  gyration  in  angstroms. 
The  error  bars  represent  the  uncertainty  in  the  mean 
values  after  configurational  sampling.) 
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at  approximately  30  A  [i.e.,  L*{Rq)  «  L*(R)  ~  30 
A],  This  minimum  is  rather  flat,  as  the  averaged 
molecular  size  changes  little  between  L  ~  20  and 
40  A. 

The  mean  number  of  overcrossings  and  the  av¬ 
erage  asphericity  (Figs.  4  and  5,  respectively)  com¬ 
plement  the  picture  of  how  the  shape  of  the  n  =  75 
chains  is  affected  by  compression.  The  most  entan¬ 
gled  configurations  are  foj^d  in  the  regime  of 
strong  confinement  [L*(N)  ~  20  A],  whereas 
spheroidal  oiies  appear  at  weaker  confinement 
[  ~  45  A].  By  comparing  the  results  in  Fig¬ 

ures  3-5,  it  is  clear  that  the  maximum  in  entangle¬ 
ment  complexity  is  found  for  oblate  chains  that  are 
not  very  compact.  Note,  however,  that  <N> 
changes  fast  as  L  is  further  reduced.  As  expected, 
the  large  and  flat  chains  found  at  very  strong 
compression  (L  ~  10  A)  are  not  entangled. 

The  above  results  indicate  that  different  shape 
features  are  affected  differently  by  confinement. 
Size,  compactness,  anisometry,  and  entanglement 
complexity  are  not  trivially  related  when  the 
boundary  conditions  are  changed.  This  trend  is 
also  found  at  other  chain  lengths.  Figures  6-8 
show  the  three  averaged  shape  descriptors  for 
chains  with  n  =  20.  Again,  the  positions  of  the 
critical  points  for  the  various  descriptors  do  not 
coincide.  The  most  compact  conformations  (i.e., 
those  with  smallest  (Rl))  are  found  to  be  oblate 
rather  than  spheroidal.  Note  that  a  maximum  in 


20  40  60  80  100 


L  (wall  separation) 

FIGURE  4.  Change  in  the  configurationally  averaged 
mean  number  of  overcrossings  of  a  n  =  75  chain  with 
compression.  (Wall  separation  is  in  angstroms. 
Simulation  conditions  are  the  same  as  in  Fig.  3.) 


entanglement  complexity  appears  at  very  strong 
confinement  [L*(N)  -  10  A],  just  above  the 
threshold  below  which  no  conformers  are  found 
[^min  ^  8.5  A].  This  critical  point  is  not  detected 
for  chains  shorter  than  n  =  20.  In  these  cases,  <  N  > 
increases  monotonically  during  compression. 
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L  (wall  separation) 

FIGURE  5.  Change  in  the  configurationally  averaged 
asphericity  of  a  n  =  75  chain  with  compression.  (Wall 
separation  is  in  angstroms.  Simulation  conditions  are  the 
same  as  in  Fig.  3.) 


L  (wall  separation) 

FIGURE  6.  Change  in  mean  radius  of  gyration  of  a 
n  =  20  chain  {/=  3.8  angstroms,  =  6.0  A)  with 
compression.  (Wall  separation  and  radius  of  gyration  in 
A.  The  error  bars  represent  the  uncertainty  in  the  mean 
values  after  configurational  sampling.) 
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FIGURE  7.  Change  In  the  configurationally  averaged 
nnean  number  of  overcrossings  of  a  n  =  20  chain  with 
compression.  [Wall  separation  is  in  angstroms. 
Simulation  conditions  are  the  same  as  in  Fig.  6.] 

The  critical  behavior  of  shape  descriptors  is 
summarized  in  Table  I  for  polymers  with  n  <  100. 
A  number  of  observations  can  be  made  from  these 
approximate  estimates: 

1.  Over  the  small  range  of  polymers  considered, 
the  wall  separation  leading  to  the  most  compact 
conformations  increases  approximately  linearly 
with  71.  As  well,  the  value  of  min(Kc)^^^  mini¬ 
mum  over  L,  for  n  constant)  appears  to  be  linear 
with  n. 

2.  The  minimum  value  of  asphericity  is  virtually 
the  same  for  all  chains,  that  is,  all  polymers  reach  a 
similar  "'spheroidal  regime"  at  some  point  during 
the  compression.  The  actual  position  of  this  mimi- 
mum  [L*(fl)]  varies  with  the  polymer  length. 
Nevertheless,  its  change  is  easily  predicted.  Let 


FIGURE  8.  Change  in  the  configurationally  averaged 
asphericity  of  a  n  =  20  chain  with  compression.  (Wall 
separation  is  in  angstroms.  Simulation  conditions  are  the 
same  as  In  Fig.  6.) 

be  the  value  of  the  average  span  in  the  limit  of 
large  wall  separations  (R^  =  For  the 

examples  in  Table  I,  we  find  that  the  span  of  the 
nonconfined  chains  and  the  wall  separation  that 
leads  to  the  most  spheroidal  chains  are  virtually 
indistinguishable  (at  95%  confidence  level  and  val¬ 
ues  in  angstroms): 

L*(I1)  «  (0.97  ±  0.15) R^  +  (0.3  ±  4.9).  (5) 

These  results  suggest  that  the  asphericity  is  not  a 
very  discriminating  shape  descriptor.  Asphericity 
alone  provides  little  insight  into  the  structural 
changes  taking  place  during  compression,  beyond 
giving  a  first  indication  on  the  chain's  reorienta¬ 
tion  toward  the  boundary  surface.  In  order  to 


TABLE  I _ _ _ ^ ^ _ 

Estimated  critical  values  of  configurationally  averaged  shape  descriptors  for  grafted  chains  of  various  lengths.® 


n 

L*(Ra) 

max<  N ) 

L*(N) 

min<Ll> 

L*(n) 

10 

_ 

_ 

_ 

— 

0.104  +  0.003 

12  +  1 

20 

11.9  ±  0.1 

17  +  3 

0.92  +  0.03 

10  +  1 

0.106  +  0.004 

21  +  1 

50 

20.7  +  0.3 

25  +  5 

3.76  +  0.05 

13  +  1 

0.101  +  0.004 

35  +  3 

75 

25.5  ±  0.5 

30  +  3 

6.8  +  0.1 

16  +  2 

0.100  +  0.003 

45  +  5 

100 

30.0  ±  0.5 

40  +  5 

9.9  +  0.1 

20  +  3 

0.099  +  0.003 

50  +  5 

^[Radius  of  gyration  and  wall  separations  L*  in  angstronns.  Due  to  the  configurational  restrictions  imposed  on  the  model  chains,  the 
shortest  chains  (e.g.,  n  =  10)  exhibit  no  extrema  for  the  descriptors  of  size  and  entanglement.  The  estimated  values  of  max  N  are 
less  accurate  due  to  the  intrinsic  errors  in  computing  mean  overcrossing  numbers,  in  addition  to  the  fluctuations  associated  with 
the  configurational  sampling.] 
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obtain  more  detailed  information,  one  should  ana¬ 
lyze  the  individual  principal  moments  of  inertia 
(cf.  eq.  (2)). 

3.  In  the  range  of  polymers  considered,  the  wall 
separation  leading  to  the  most  entangled  confor¬ 
mations  is  rather  small  [L*(N)  --  L*(RcV2]/  and 
it  also  increases  with  n.  The  results  suggest  that 
the  entanglement  complexity  is  maximized  just 
before  a  rapid  "configurational  transition"  leading 
to  flattened  chains  almost  adhered  to  the  surface. 

Figure  9  conveys  the  differences  in  shape  de¬ 
scriptors  from  another  viewpoint.  The  figure  gives 
the  dependence  of  the  span  (R)  on  the  separation 
between  walls.  We  observe  the  occurrence  of  rather 
flat  minima,  for  all  n  (except  n  =  10).  The  dashed 
lines  to  the  left  and  right  of  the  minima  represent 
the  approximate  wall  separations  for  the  maxima 
in  entanglements  and  minima  in  asphericity,  re¬ 
spectively.  This  figure  suggests  an  interesting  in¬ 
terrelation  between  the  three  shape  descriptors:  (a) 
The  minimum  in  asphericity  marks  the  onset  of 
boundary  effects  on  the  chains'  size;  (b)  the  mini¬ 
mum  in  molecular  size  defines  a  regime  where  the 
confinement  forces  the  chains  into  compact,  in¬ 
creasingly  entangled,  and  oblate  configurations; 
and  (c)  the  maximum  in  entanglement  complexity 
marks  the  limits  to  compactness.  Further  squeez¬ 
ing  brings  about  a  rapid  variation  in  molecular 
size.  Since  the  configurational  "space"  accessible 


0  20  40  60  80  100 

L  (wall  separation.  A) 


FIGURE  9.  Change  in  chain  span  as  a  function  of  the 
wall  separation  for  polymers  of  various  lengths.  (The 
dashed  lines  to  the  left  and  right  represent  qualitatively 
the  values  of  </?)  and  L  associated  with  the  maximum 
degree  of  entanglement  and  the  minimum  asphericity, 
respectively.  The  gray  area  gives  the  L  values  where  no 
chains  are  possible.) 


to  long  chains  is  very  reduced  for  L  <  L*(]V),  the 
chains  are  quickly  forced  to  adopt  laterally  spread, 
open,  and  imentangled  structures.  Note  that  ex¬ 
cluded  volume  will  pose  a  smaller  constraint  for 
shorter  chains.  Therefore,  a  decrease  in  entangle¬ 
ment  complexity  brought  with  further  compres¬ 
sion  may  not  be  visible  before  reaching  the  critical 
Kin  value. 


Further  Comments  and  Conclusions 

The  improved  design  of  interphases  with  de¬ 
sired  structural  properties  requires  appropriate 
tools  to  analyze  how  molecular  shape  is  modified 
by  various  control  factors  (e.g.,  interphase  thick¬ 
ness,  density  of  grafting,  temperature,  and  solva¬ 
tion).  In  the  present  work,  we  have  shown  that 
three  descriptors  (associated  with  independent 
shape  features)  can  provide  a  detailed  characteri¬ 
zation  of  configurational  transitions  in  compressed 
polymers.  The  results  show  that  it  is  not  necessary 
to  use  ad  hoc  descriptors  "adapted"  to  the  nature 
of  the  particular  geometrical  confinement  consid¬ 
ered. 

The  present  study  indicates  that  the  terms 
"compactness,"  "sphericity,"  and  "entanglement" 
are  not  interchangeable  for  the  compressed  chains. 
Summarizing,  we  have  found  that:  (i)  compressed 
chains  reach  their  smallest  size  for  configurations 
which  are  not  the  ones  closest  to  sphericity  and  (ii) 
the  maximum  entanglement  complexity  occurs  for 
rather  flat  conformers  that  are  not  maximally  com¬ 
pact.  Therefore,  it  is  clear  that  the  interplay  be¬ 
tween  these  shape  features  cannot  be  captured  by 
a  single  descriptor. 

The  same  procedure  presented  here  should 
be  valuable  to  study  configurational  transitions 
in  more  realistic  polymer  models,  A  number  of 
effects,  omitted  in  this  work,  should  introduce 
important  shape  changes.  For  instance,  we  have 
restricted  our  study  to  confinement  on  "hard" 
polymer  chains  with  variable  length.  We  have  ex¬ 
cluded  attractive  interactions  between  monomers 
or  between  the  polymer  and  the  surface,  as  well  as 
avoided  the  explicit  presence  of  molecules  in  the 
liquid  phase.  Moreover,  whereas  the  presence  of 
grafting  poses  a  reasonable  limit  to  the  accessible 
compression,  it  plays  little  role  in  modifying  the 
shape  of  long  chains.  This  is  due  to  the  fact  that  we 
have  only  considered  single-chain  effects  (i.e.,  a 
grafted  surface  with  low  surface  coverage  of  poly- 
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mers).  A  less  diluted  surface  grafting  between  at¬ 
tractive  chains  will  of  course  affect  enormously  the 
nature  of  their  entanglements.  The  role  of  these 
factors  on  polymer  shape  transitions  will  be  dis¬ 
cussed  elsewhere. 
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ABSTRACT _ 

The  local  self-consistent  field  (LSCF)  method  which  allows  full  SCF  computations  on  a 
fragment  of  very  long  molecular  systems  represented  by  a  classical  force  field  has  been 
adapted  to  the  description  of  nonmetallic  crystals.  The  periodicity  of  the  network  is 
achieved  by  modifying  self-consistently  the  basic  parameters  of  the  classical  subsystem 
(charges,  geometric  parameters)  along  the  SCF  iterative  scheme.  The  method  is  tested  on 
a-cristobalite.  The  parametrization  of  the  quantum  classical  junction,  achieved  by 
localized  bond  orbitals  has  been  performed  with  the  help  of  a  fragment  located  in  the 
bulk.  The  stability  of  the  method  with  respect  to  the  size  of  the  fragment  and  the  size  of 
the  crystalline  sample  appears  to  be  very  good.  The  properties  of  fully  hydroxylated  (010) 
surface  are  corrected  described.  Modeling  of  water  adsorption  on  the  ideal  surface  as 
well  as  on  two  kinds  of  surface  defects  gives  rise  to  very  reasonable  results  with  an 
absorption  energies  of  ca  10  kcal/mol  which  are  close  to  the  upper  limit  of  the 
experimental  data.  This  preliminary  study  appears  to  be  quite  encouraging  regarding  the 
possibilities  of  using  this  method,  which  can  be  considered  as  an  extension  of  the 
embedded  cluster  approach  to  covalent  solids.  Many  applications  to  surface  chemical 
reactivity  studies  can  be  imagined.  ©  1996  John  Wiley  &  Sons,  Inc. 
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Introduction 

Theoretical  studies  of  chemical  reactivity  re¬ 
quire  the  use  of  quantum  mechanics  (QM)  in 
order  to  be  able  to  describe  the  modification  of  the 
electronic  structure  of  the  reactants.  This  is  now 
quite  a  common  method  for  the  study  of  reactions 
in  the  gas  phase.  WTien  the  reaction  involves  the 
chemical  participation  of  the  surface  of  a  solid,  the 
quantum  chemical  study  of  the  exceedingly  large 
system  becomes  a  very  difficult  problem.  In  the 
case  of  nonmetallic  surfaces  one  often  replaces  the 
whole  solid  by  a  small  part  of  it  in  which  the 
valence  rules  are  fulfilled,  for  the  dangling  bonds, 
by  extra  atoms  (usually  hydrogen)  and  performs 
the  computations  with  the  usual  molecular  codes 
at  the  ab  initio,  density  functional  or  semiempirical 
levels  of  approximation  [1~6].  In  such  approaches 
the  size  of  the  clusters  is  limited  and  the  level  of 
accuracy  is  rather  low.  For  instance,  the  cluster 
H54Si25025,  model  of  a  silica  surface,  has  been 
computed  at  the  3-21  G  level  [7]  and  larger  clus¬ 
ters  with  about  100  Si  and  O  atoms  were  investi¬ 
gated  at  the  MNDO  level  [8].  In  addition  long- 
range  electrostatic  interactions  which  are  present 
in  crystals  are  either  omitted  or  roughly  reintro¬ 
duced  [9,  10]  in  such  models. 

Another  approach  consists  of  using  periodic 
Hartree-Fock  computations  [11]  on  a  periodic 
two-dimensional  slab  of  the  crystal  structure  [12], 
but  this  method  suffers  some  serious  limitations 
too:  The  slab  cannot  be  very  thick  and  the  influ¬ 
ence  of  the  bulk  crystal  is  not  very  well  repre¬ 
sented.  Moreover  the  periodicity  makes  difficult 
the  introduction  of  some  irregularities  or  defects. 
New  computational  approaches,  based  on  the 
Car-Parrinello  formalism  [13]  seem  very  promis¬ 
ing  [14,  15],  in  particular  for  modeling  metal  sur¬ 
faces.  However,  they  usually  require  large  com¬ 
puter  facilities  and  suffer  some  limitations  which 
make  alternative  approaches  still  desirable. 

In  the  case  in  which  the  interactions  can  be 
considered  at  the  physical  level,  modeling  of  large 
systems  can  be  conveniently  achieved  by  means  of 
molecular  mechanics  (MM).  For  instance,  the  ener¬ 
getics  of  zeolite  frameworks  [16,  17],  the  localiza¬ 
tion  of  adsorbed  molecules  in  silicalite  channels 
[18,  19],  and  amorphous  substances  [20]  have  been 
approached  successfully  with  such  models. 


In  the  case  of  chemical  interactions  one  usually 
assumes  that  the  electronic  perturbation  created  by 
a  local  interaction  in  a  large  system  becomes  negli¬ 
gible  at  some  distance  of  the  interaction  center  so 
that  one  immediately  thinks  of  limiting  the  quan¬ 
tum  chemical  computation  to  this  part  of  the 
system,  provided  that  one  is  able  to  include  the 
interaction  with  the  rest  of  the  system  in  the  com¬ 
putation  scheme  and  treat  properly  the  boundary 
between  the  quantum  and  the  classical  parts  of  the 
system.  This  last  point  is  straightforward  in  the 
case  of  molecular  solutions  in  which  it  is  easy  to 
chemically  define  the  solute  and  the  solvent 
molecules  which  here  play  the  role  of  the  classical 
surroundings  [21-25],  giving  rise  to  hybrid  QM- 
MM  methods.  This  is  also  true  for  molecular  crys¬ 
tals  in  which  one  may  introduce  a  Madelung  po¬ 
tential  due  to  the  surrounding  molecules  [26].  The 
task  becomes  far  more  difficult  when  there  is  no 
chemical  discontinuity  between  the  quantum  and 
the  classical  parts  of  the  system  such  as  in  macro¬ 
molecules  or  in  nonmetallic  solids.  The  idea  of 
using  localized  orbitals  is  not  so  recent  [27]  and 
offers  various  levels  of  approximation  for  the  reac¬ 
tive  part  and  for  the  surroundings  [28-30].  Va¬ 
lence  bond  treatments  are  an  alternative  to 
overcome  the  difficulty  [31].  Recently,  a  simple 
modification  of  the  usual  Roothaan  equations  [32] 
used  in  molecular  Hartree-Fock  computations 
have  been  proposed  under  the  denomination  of 
local  self-consistent  field  (LSCF)  method  [33]  which 
enables  us  to  treat  quantum  mechanically  part  of  a 
large  system  while  the  boundary  between  the 
quantum  and  classical  subsystem  is  properly 
treated.  The  method  consists  in  representing  the 
electron  pairs  involved  in  the  chemical  bonds  con¬ 
necting  the  quantum  and  classical  subsystems  by 
localized  orbitals  and  by  constraining  the  molecu¬ 
lar  orbitals  in  the  quantum  subsystem  to  remain 
orthogonal  to  these  localized  orbitals.  The  rest  of 
the  system  is  represented  by  a  classical  force  field, 
and  the  two  subsystems  interact  through  electro¬ 
static  and  van  der  Waals  interactions.  The  analytic 
derivatives  of  energy  with  respect  to  the  atomic 
coordinates  have  been  established  for  the  quantum 
subsystem  giving  rise  to  a  classical  quantum  force 
field  (CQFF)  [34]  which  allows  geometry  optimiza¬ 
tion  and  even  molecular  dynamics  computations. 
This  methodology  has  been  tested  successfully  on 
protein  and  enzymatic  systems.  The  present  work 
mainly  deals  with  the  implementation  of  the 
method  for  the  study  of  the  surface  reactivity  of 
covalent  solids. 
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After  having  solved  the  technical  details  inher¬ 
ent  to  solid  state,  we  chose  to  test  the  method  by 
modeling  a  silica  surface.  It  is  well  known  that 
silicon  oxide  is  one  of  the  most  abundant  compo¬ 
nents  of  Earth's  crust,  but  silica  is  also  a  widely 
studied  solid  by  computational  approaches,  using 
either  quantum  [6,  35-37]  or  classical  [18,  38,  39] 
mechanics.  Solid  silica  exists  in  many  different 
crystalline  forms  [40]  in  which  the  basic  units, 
most  often  a  tetrahedral  Si04  motif,  share  its  apices 
with  the  neighboring  tetrahedrons,  the  various  ge¬ 
ometric  arrangements  of  these  connectivities  lead¬ 
ing  to  the  different  well-defined  crystalline  forms. 
In  this  study  we  limit  ourselves  to  the  hydroxyl- 
ated  (010)  surface  of  a-cristobalite. 

This  study  will  be  organized  as  follows.  In  the 
next  section  we  briefly  recall  the  principles  of  the 
LSCF  method  and  the  modifications  made  neces¬ 
sary  by  its  extension  to  the  study  of  solid  surfaces. 
The  applications  to  a-cristobalite  are  described  in 
the  third  section.  The  parametrization  of  the  quan¬ 
tum  classical  junction  is  detailed  and  tested  on  a 
quantum  subsystem  located  in  the  bulk  of  the 
crystal.  The  results  regarding  the  geometry  and 
electronic  structure  of  a  regular  (010)  surface  are 
given  and  simple  examples  of  surface  irregularities 
are  presented.  The  results  are  briefly  discussed  in 
the  fourth  section  and  the  last  section  is  devoted  to 
the  conclusion. 


Extension  of  the  LSCF  Method 

The  principles  of  the  LSCF  method  have  been 
described  previously  [33].  The  whole  system  is 
divided  into  the  quantum  subsystem  (S)  and  its 
classical  environment  (£).  The  chemical  bonds 
linking  the  atoms  of  S  (frontier  atoms)  to  E  are 
assumed  to  be  represented  by  predefined  localized 
orbitals  built  with  hybrid  orbitals.  The  hybrid  or¬ 
bital  of  the  frontier  atom  belonging  to  S  is  called 
the  frozen  hybrid  orbital.  At  the  NDDO  level  of 
approximation  [41]  which  will  be  used  here,  the 
only  parameters  we  need  to  introduce  are  the  (i) 
the  s  content  of  the  normalized  frozen  hybrid 
orbital  of  each  frontier  atom  of  S.  This  quantity 
will  be  denominated  a.  (ii)  The  diagonal  element 
Pji  of  the  density  matrix  corresponding  to  each 
hybrid  orbital  in  the  localized  bond  orbital  linking 
S  to  E. 

This  pair  of  parameters  is  assumed  to  be  trans¬ 
ferable  and  is  usually  predetermined  on  a  model 
system  computed  quantum  chemically  with  the 


same  method  as  used  in  the  forthcoming  computa¬ 
tions.  At  the  NDDO  level  of  approximation  the 
energy  of  the  systems  is  expressed  by: 
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In  this  equation  denotes  the  density  matrix 
element  corresponding  to  atomic  orbitals  |  fx}  and 
|j^)  of  subsystem  S  (orthogonalized  to  the  frozen 
hybrid  orbitals).  The  semi-empirical  core-core 
atomic  repulsion  function  is  symbolized  by 
and,  according  to  the  NDDO  formalism,  the 
Coulomb  interaction  between  two  atomic  charges 
A  and  K  is  assumed  to  be  proportional  to  the 
Coulomb  integral  between  s  functions  and  Sj^, 
respectively.  The  equivalent  valence  electronic 
population  of  a  classical  atom  A  is  simply  defined 
by  where  Z^  is  the  number  of 

valence  electrons.  Therefore,  the  first  line  of  Eq.  (1) 
represents  the  Hartree-Fock  energy  of  electrons  in 
S;  the  second  sum  corresponds  to  the  potential 
energy  of  the  electronic  population  of  the  frozen 
hybrid  orbitals.  The  third  sum  is  the  core-core 
atomic  repulsion  in  S  and  the  fourth  one  stands 
for  the  electrostatic  interaction  between  subsys¬ 
tems  S  and  E.  The  two  extra  terms  V(E)  and 
V'(E,  S)  represent,  respectively,  the  potential  en¬ 
ergy  of  the  classical  subsystem  and  all  interactions 
between  the  two  subsystems  E  and  S,  except  the 
electrostatic  ones. 
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The  derivation  of  the  Hartree-Fock  equations 
from  this  expression  of  £  is  straightforward  and 
the  computation  of  one-  and  two-electron  integrals 
of  Eq.  (1)  involving  hybrid  atomic  orbitals  requires 
simple  linear  transformation  of  standard  integrals 
as  explained  in  Ref.  [33]. 

The  extension  of  this  formalism  to  crystalline 
solids  can  be  performed  at  least  in  two  different 
manners.  We  can  choose  a  finite  piece  of  crystal, 
especially  if  the  atoms  are  assumed  to  be  fixed, 
and  compute  the  interaction  of  the  quantum  sub¬ 
systems  with  all  the  atoms  of  £  as  it  is  the  case  in 
proteins.  The  macroscopic  size  of  the  crystal  may 
be  simulated  by  using  an  Ewald  sum  [42]  to  ex¬ 
tend  the  classical  environment  to  infinity.  In  the 
present  preliminary  study  we  chose  the  first 
method  in  order  to  analyze  the  convergence  of  the 
results  when  the  size  of  the  sample  is  increased.  In 
order  to  take  into  account  the  volume  and  the 
surface  periodic  arrangement  of  the  atoms,  the 
code  has  been  modified  in  such  a  way  that  the  net 
charges  located  on  the  atoms  of  the  classical  sub¬ 
system,  and  which  are  used  to  define  the  electro¬ 
static  interaction  of  the  two  subsystems,  are  set 
equal  to  the  net  charges  computed  on  the  equiva¬ 
lent  atoms  of  the  quantum  subsystem.  This  adjust¬ 
ment  is  performed  by  iterations  until  the  self- 
consistency  is  reached.  Similarly,  the  code  allows 
an  optimization  of  the  geometric  parameters  of  the 
surface  atoms  in  the  quantum  part,  and  these 
parameters  are  repeated  in  the  classical  part  of  the 
surface  in  order  to  achieve  the  tv\m-dimensional 
periodicity  of  a  regular  surface. 

The  input  data  of  the  program  are  the  basic 
parameters  a  and  Pji  of  the  frozen  orbitals  and  the 
atomic  coordinates  of  the  unit  cell  of  the  crystal. 
The  code  generates  finite  size  samples  by  repeat¬ 
ing  this  cell  N  times  along  the  three  directions  of 
space,  to  create  an  assembly  of  unit  cells. 

The  surfaces  are  generated  by  a  cleavage  plane 
defined  by  the  equation 

hx  -\-  ki/  Iz  =  (2) 

in  which  h,  k,  and  /  are  the  Miller  indices  of  the 
plane  and  is  an  integer.  By  varying  one 
defines  planes  with  the  same  atomic  structure.  In 
the  case  of  finite  samples  as  defined  above,  differ¬ 
ent  values  of  D,  generate  surfaces  limiting  a  crys¬ 
tal  sample  with  a  variable  number  of  atoms  for  a 
given  value  of  N.  Consequently,  by  changing  N 
and  Dy  one  can  study  the  influence  of  the  size  of 
the  bulk  and  of  the  extension  of  the  surface  on  the 
computed  properties. 


Computation  and  Results 

DEFI^mON  OF  THE  PARAMETERS 

The  test  system  investigated  is  a-cristobalite. 
This  crystal  has  a  tetragonal  space  group 

symmetry  [40].  The  pseudocubic  primatic  cell  con¬ 
tains  12  atoms  and  its  parameters  are  Uq  =  = 

4.9733  A  and  Cq  =  6.9262  A. 

In  the  computations  reported  here  the  frontier 
atoms  of  the  quantum  subsystem  are  oxygen 
atoms.  The  basic  parameters  a  and  are  adjusted 
on  the  O-Si  bonds  surrounding  a  Si04  tetrahedron 
located  in  the  bulk  of  the  crystal.  We  used  a 
pentamer  Si50j6  [Fig.  1(b)]  located  at  the  center  of 


A 


FIGURE  1.  General  view  (A)  and  the  quantum  systems 
(B,  C)  for  the  modeling  of  a  bulk. 
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Self-consistent  Si  and  O  charges  and  precalculated  parameters  of  the  MNDO-AM1  and  the  MNDO-PM3  levels. 


Units® 

Atoms^ 

MNDO-AM1 

MNDO-PM3 

Qsi 

Qo 

p 

‘  ss 

a 

Qsi 

Qo 

p 

^ss 

a 

125 

1500 

2.090 

- 1 .045 

1.526 

0.79 

1.590 

-0.795 

1.370 

0.79 

343 

4116 

2.130 

- 1 .064 

1.534 

0.79 

1.760 

-0.880 

1.420 

0.79 

729 

8748 

2.170 

- 1 .085 

1.540 

0.79 

1.890 

-0.945 

1.450 

0.79 

1331 

15972 

2.190 

- 1 .095 

1.544 

0.79 

1.980 

-0.990 

1.480 

0.79 

^  Total  number  of  elementary  units. 

^  Total  number  of  atoms  in  quantum  and  classical  subsystems. 


pseudocubic  samples  of  variable  size.  We  used 
four  different  samples,  containing  125,  343,  729, 
and  1331  crystallographic  unit  cells  (1500,  4116, 
8748,  and  15972  atoms)  [Fig,  1(a)].  The  parameters 
are  adjusted  self-consistently. 

The  computations  have  been  carried  on  by  both 
AMI  [43]  and  PM3  [44]  methods.  AMI  is  used 
with  its  native  parametrization.  In  the  case  of  PM3 
we  had  to  modify  the  Po  parameter  of  silicon.  We 
simply  transferred  the  value  used  in  AMI  ( po '  = 
1.38544).  The  values  obtained  are  collected  in  Table 
I,  together  with  the  self-consistent  values  of  the 
charges. 

In  order  to  check  the  reliability  of  the  method,  a 
second  computation  has  been  performed  with  a 
larger  cluster  embedded  in  the  same  crystal  sam¬ 
ples.  We  chose  a  12-member  silicon  oxygen  ring 
with  the  stoichiometric  formula  Si|8054  [Fig.  1(c)]. 
The  charges  computed  in  the  quantum  subsystem, 
by  using  the  parameters  determined  with  the 
smaller  cluster,  are  compared  with  the  charges 


TABLE  II _ 

Average  values  of  Si  and  O  charges,  calculated 
for  clusters  B  and  C  (Fig.  1). 


Cluster 

Units® 

Atoms® 

MNDO-AM1 

MNDO-PM3 

Osi 

Oq 

Osi 

Oq 

33 

2.015 

-0.972 

1.324 

-0.571 

B 

125 

1500 

2.091 

- 1 .048 

1.592 

-0.790 

C 

125 

1500 

2.071 

- 1 .040 

1.650 

-0.789 

B 

343 

4116 

2.134 

-1.071 

1.760 

-0.848 

C 

343 

4116 

2.130 

-1.067 

1.826 

-0.877 

B 

729 

8748 

2.168 

- 1 .086 

1.885 

-0.909 

C 

729 

8748 

2.164 

-1.084 

1.952 

-0.935 

B 

1331 

15972 

2.191 

-1.097 

1.975 

-0.958 

C 

1331 

15972 

2.189 

-1.096 

2.044 

-0.988 

®The  definition  was  given  in  Table  I. 
^  Data  for  the  molecular  cluster. 


computed  on  the  latter.  Due  to  the  boundary  con¬ 
ditions,  the  charges  are  not  rigorously  the  same  on 
the  various  atoms  of  the  same  element.  We  give  in 
Table  II  the  averaged  value  of  these  charges.  The 


FIGURE  2.  General  view  (A)  and  the  quantum  systems 
(B,  C)  for  the  modeling  of  the  surface  with  a  regular 
structure. 
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TABLE  III 


Values  of  optimized  parameters  for  cluster  B  (Fig.  2). 


Atoms^ 

D" 

Approx. 

^H403 

<H403Si2 

<H403 

SijOi 

RogSig 

(OgSia 

OsSi® 

(HsOzSie 

(HeO^ 

Si605 

(O/SieOs 

750 

2 

AMI 

0.946 

116.1 

81.1 

1.765 

3.2 

0.948 

114.6 

156.4 

1.756 

106.4 

750 

2 

PM3 

0.946 

113.5 

70.9 

1.723 

3.1 

0.945 

113.6 

171.6 

1.715 

106.2 

1050 

3 

AMI 

0.946 

116.4 

81.1 

1.764 

3.2 

0.947 

114.9 

159.1 

1.755 

106.3 

1350 

4 

AMI 

0.946 

116.6 

81.2 

1.763 

3.2 

0.947 

115.1 

161.8 

1.754 

106.2 

1470 

2 

AMI 

0.946 

116.1 

77.0 

1.765 

3.9 

0.947 

114.5 

153.5 

1.756 

107.1 

3822 

6 

AMI 

0.946 

116.9 

76.9 

1.761 

4.1 

0.947 

115.1 

157.7 

1.754 

107.1 

2430 

2 

AMI 

0.946 

116.1 

71.7 

1.765 

4.2 

0.947 

114.5 

155.0 

1.757 

107.3 

8262 

8 

AMI 

0.946 

117.1 

73.4 

1.761 

4.5 

0.946 

115.2 

157.4 

1.755 

107.5 

3630 

2 

AMI 

0.946 

116,2 

69.4 

1.765 

4.5 

0.947 

114.5 

156.1 

1.757 

107.3 

15" 

— 

AM1 

0.947 

116.8 

121.7 

1.740 

2.8 

0.945 

116.6 

106.6 

1.758 

111,7 

15" 

— 

PM3 

0.945 

116.1 

123.7 

1.699 

2.3 

0.944 

117.0 

94.4 

1.712 

109.0 

15^ 

AMI 

0.949 

117.6 

147.2 

1.730 

20.4 

0.948 

117.8 

89.0 

1.730 

109.3 

15^ 

— 

PM3 

0.945 

116.4 

131.8 

1.690 

28.3 

0.944 

116.4 

93.9 

1.689 

110.2 

®The  definition  was  given  in  Table  I. 

^  For  the  definition  of  D,-  see  Eq.  (2). 

^Corresponds  to  the  molecular  cluster  with  the  same  number  of  the  optimized  parameters. 

Corresponds  to  the  fully  optimized  molecular  cluster. 

convergence  is  quite  good,  and  the  charges  com¬ 
pare  quite  well  with  the  classical  charges  opti¬ 
mized  of  the  rest  of  the  crystal.  The  same  quanti¬ 
ties  computed  on  the  isolated  molecular  cluster 
(Si50i5H22)  depart  noticeably  from  the  data  ob¬ 
tained  on  the  embedded  cluster.  The  absolute  val¬ 
ues  of  charges  are  higher  at  the  MNDO-AMl  level 
because  the  charges  of  molecular  cluster  for  these 
systems  are  higher  for  the  MNDO-AMl  method 
(see  Table  II).  We  also  briefly  mention  that  the 
results  obtained  by  AMI  seem  in  better  agreement 
with  ab  initio  computations  which  suggest  a  charge 
close  to  —1  on  oxygen  [45].  In  other  respects  both 
methods  have  the  same  tendencies. 


MODELING  OF  THE  HYDROXYLATED  (010) 
SURFACE 

The  dangling  bonds  of  (010)  a-cristobalite  sur¬ 
face  belong  to  Si  and  O  unsaturated  atoms.  These 
surface  atoms  are  considered  as  chemisorption  sites 
for  polar  molecules,  especially  for  water,  which 
generates  an  hydroxylic  layer  on  the  surface.  The 
reaction  may  be  depicted  by  the  general  scheme: 

O  OH  OH 

I  I  I 

Si  — O— Si  +  H,0  ^  Si— O— Si 

/I  |\  /I  |\ 

Oo  OO  Oo  OO 

I  \  /I  \  \  /  / 


TABLE  IV _ 

Values  of  optimized  parameters  for  cluster  C  (Fig.  2)  at  the  MNDO-AMl  levei. 


Atoms® 

D," 

(HeOjSu 

SI4O3 

R05SU 

(H.|oOgSl8 

<Hio09 

SisO? 

ROgSis 

(SisOgOy 

750 

2 

0.948 

115.8 

78.0 

1.762 

112.1 

0.947 

114.6 

185.3 

1.757 

106.4 

1050 

3 

0.948 

116.1 

77.1 

1.760 

112.4 

0.947 

114.7 

189.5 

1.757 

106.3 

1350 

4 

0.948 

116.2 

78.3 

1.760 

112.3 

0.947 

114,8 

190.2 

1.757 

107.2 

1470 

2 

0.948 

115.9 

74,1 

1.761 

112.2 

0.947 

114.6 

182.8 

1.757 

107.2 

39" 

0.947 

115.9 

137.2 

1.746 

111.1 

0.947 

115.3 

115.3 

1.744 

110.4 

39 

— 

0.947 

116.0 

146.4 

1.725 

112.4 

0.949 

115.7 

115.7 

1.729 

112.0 

^The  definition  was  given  in  Table  I. 

^  For  the  definition  of  D,-  see  Eq.  (2). 

^Corresponds  to  the  molecular  cluster  with  the  same  number  of  the  optimized  parameters. 
^  Corresponds  to  the  fully  optimized  molecular  cluster. 
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TABLE  V 


Average  values  of  atomic  charges  for  cluster  B  (Fig.  2). 


Atoms® 

of' 

Approx. 

Ooi 

Q03 

0h4 

O05 

750 

2 

AMI 

-0.978 

1.948 

-0.744 

0.220 

-1.000 

750 

2 

PM3 

-0.683 

1.336 

-0.574 

0.179 

-0.665 

1050 

3 

AMI 

-0.982 

1.959 

-0.747 

0.222 

-1.003 

1350 

4 

AMI 

-0.984 

1.964 

-0.748 

0.223 

-1.005 

1470 

2 

AMI 

-0.999 

1.955 

-0.745 

0.218 

- 1 .004 

3822 

6 

AMI 

-0.999 

1.984 

-0.754 

0.223 

-1.015 

2430 

2 

AMI 

-0.998 

1.959 

-0.747 

0.218 

- 1 .008 

8262 

8 

AMI 

-1.009 

2.001 

-0.760 

0.224 

-1.023 

3630 

2 

AMI 

- 1 .004 

1.962 

-0.748 

0.218 

-1.010 

15" 

_ 

AMI 

-0.820 

1.990 

-0.735 

0.223 

-0.847 

15" 

_ 

PM3 

-0.616 

1.317 

-0.536 

0.185 

-0.552 

15^^ 

_ 

AMI 

-0.673 

1.775 

-0.673 

0.245 

-0.776 

15^ 

— 

PM3 

-0.510 

1.223 

-0.507 

0.202 

-0.541 

®  The  definition  was  given  in  Table  1. 

^  For  the  definition  of  D,  see  Eq.  (2). 

""  Corresponds  to  the  molecular  cluster  with  the  same  number  of  the  optimized  parameters. 
Corresponds  to  the  fully  optimized  molecular  cluster. 


The  ideal  hydroxylated  (010)  a-cristobalite  is 
shown  of  Figure  2(a).  The  hydroxyl  groups  all 
occupy  vicinal  positions  according  to  the  usual 
classification  [46]. 

Structure  of  the  Ideal  Surface 

In  order  to  analyze  the  electronic  properties  of 
the  surface,  we  performed  some  LSCF  computa¬ 
tions  on  two  clusters  of  variable  size  with  the 
formulas  Si207H2  and  Si60i9H2  that  contain  two 
vicinal  hydroxyl  groups  [Fig.  2(b)  and  2(c)].  Table 
III  collects  the  results  obtained  with  cluster  B  for  a 
variable  number  of  atoms  in  the  classical  system. 


including  the  free  cluster.  The  corresponding  re¬ 
sults  for  cluster  C  can  be  found  in  Table  IV.  Again 
one  notices  that  the  conclusions  do  not  depend 
significantly  neither  on  the  size  of  the  cluster  nor 
on  the  size  of  the  classical  subsystem.  Neverthe¬ 
less,  the  results  corresponding  to  the  free  cluster 
depart  strongly  from  all  the  other  ones  for  the 
orientation  of  the  OH  groups  (see  the  values 
<H403Si20|  and  <H807Si605  in  the  Table  III  and 
<H605Si403  in  the  Table  IV). 

The  main  difference  between  the  atoms  on  the 
surface  compared  with  the  bulk  is  the  electronic 
charge  (Tables  V  and  VI).  The  absolute  value  of 
the  charge  of  the  hydroxylic  oxygen  atoms  is  less 


TABLE  VI 


Average  value  of  atomic  charges  for  cluster  C  (Fig.  2)  at  the  MNDO-AM1  ievei. 


Atoms® 

D/" 

O01 

Osig 

O03 

0si4 

O05 

Ohs 

O07 

750 

2 

-1.014 

2.014 

-0.995 

1.940 

-0.840 

0.223 

- 1 .000 

1050 

3 

-1.017 

2.028 

- 1 .000 

1.948 

-0.852 

0.225 

- 1 .005 

1350 

4 

-1.019 

2.034 

-0.996 

1.947 

-0.853 

0.226 

-1.010 

1470 

2 

-1.024 

2.022 

-1.003 

1.946 

-0.847 

0.223 

- 1 .003 

39"= 

_ 

-0.804 

2.013 

-0.888 

1.921 

-0.750 

0.225 

-0.906 

39^ 

-0.687 

1.781 

-0.842 

1,935 

-0.668 

0.239 

-0.865 

^The  definition  was  given  in  Table  I. 

^  For  the  definition  of  D,  see  Eq.  (2). 

""  Corresponds  to  the  molecular  cluster  with  the  same  number  of  the  optimized  parameters. 
^  Corresponds  to  the  fully  optimized  molecular  cluster. 
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TABLE  VII _ 

Geometry  (A  and  degree)  for  clusters  B  and  D 
(Fig.  3)  at  the  MNDO-PM3  level.^’^ 


Cluster  Parameter  Value 


^02H3 

0.957(0946) 

^H305 

2.654 

1^H607 

1.783 

Ro7H8 

0.945(0946) 

Rh405 

0.970(0.951) 

^H605 

0.957(0.951) 

(h405H6 

106.8(107.7) 

(H302Si8 

116.6(112.8) 

(H807Si9 

116.8(112.9) 

^01H2 

0.946(0.945) 

^H401 

1.759 

^03H6 

1.904 

^H403 

0.967(0.951) 

f^H503 

0.954(0.951) 

Rh607 

0.948(0946) 

(h403H5 

106.9(107.7) 

(H201Si9 

114.6(112.8) 

(H607Si8 

114.1(113.5) 

®  Values  in  parentheses  correspond  to  isolated  clusters  or  to 
isolated  water  molecule. 

^  Surface  has  been  defined  with  Atoms  =  750,  D,  =  2. 


TABLE  VIII _ 

Geometry  (A  and  degree)  for  clusters  B  and  D  (Fig.  4) 
at  the  MNDO-PM3  level. 


Parameter 

Cluster  B 

Cluster  D 

Psi203 

1.699 

1.701 

^03Si4 

1.708 

1.702 

Rsi405 

1.696 

1.697 

I^OSSiS 

1.707 

1.708 

Rsi407 

1.704 

1.703 

^Si409 

1.715 

1.719 

f^H807 

0.948 

0.948 

^H10O9 

0.945 

0.945 

(01Si203 

87.4 

87.5 

(si203Si4 

110.7 

111.0 

(03Si405 

125.2 

125.1 

(si406Si6 

117.7 

117.6 

(o5Si601 

90.4 

90.4 

(H807Sj4 

113.9 

114.4 

(H10O9Si4 

112.8 

114.5 

(07Si409 

102.7 

101.7 

^H11012 

— 

0.968 

Rh13012 

— 

0.954 

Ro9H11 

— 

1.768 

(hi1012H13 

— 

106.6 

than  the  corresponding  quantity  for  oxygens  coor¬ 
dinated  with  two  surface  silicon  atoms,  and  the 
oxygen  atoms  of  the  bulk  bear  the  greatest  charge. 
The  same  variation  is  observed  for  the  positive 
charge  of  the  silicon  atoms.  These  observations 
indicate  that  the  Si-O  bonds  are  less  polarized 
near  the  surface  than  in  the  bulk,  as  expected, 
since  a  condensed  phase  is  a  polarizable  medium 
which  favors  charge  separation. 

The  barrier  to  internal  rotation  of  one  OH  group 
on  the  surface  can  be  easily  estimated.  The  results 
range  from  2.9  to  5.9  kcal/mol  depending  on  the 
size  of  the  bulk.  They  meet  the  known  experimen¬ 
tal  data  [47]. 

Finally  one  notices  that  the  results  obtained  by 
the  PM3  method  are  in  agreement  with  those  ob¬ 
tained  by  AMI. 

Interaction  of  a  Water  Molecule  \Mth  the  Ideal 
Surface 

This  study  has  been  performed  at  the  PM3  level. 
The  full  geometry  optimization  of  a  water  molecule 
interacting  with  the  hydroxylated  surface  shows 
two  stable  conformations  in  which  the  adsorbed 
molecule  interact  with  the  hydrogen  atom  of  a  OH 
group  and  with  the  oxygen  atom  of  a  OH  group. 
These  two  situations  correspond  to  the  cases  in 
which  one  hydrogen  atom  (H^)  of  the  adsorbed 
molecule  is  hydrogen  bonded  to  the  07Hg  group 
[Fig.  3(a)  and  (b)l  or  both  hydrogen  atom  (H4)  and 
oxygen  (O3)  are  involved  in  a  bifurcated  interac¬ 
tion  [Fig.  3(c)  and  (d)].  In  both  cases  (Table  VII)  the 
hydrogen  bonds  with  the  oxygen  atom  of  the  hy- 
droxylic  group  are  rather  short  (1.78  A  or  less) 
although  the  hydrogen  bonds  involving  the  oxy¬ 
gen  atom  of  the  adsorbed  water  molecule  are  closer 
to  the  usual  bonds  in  water  dimers  (1.90  A).  The 
interaction  energies  are  of  the  same  order  of  mag¬ 
nitude  [11.5  and  10.4  kcal/mol  for  structures  (b) 
and  (d)].  These  values  are  in  the  observed  experi¬ 
mental  range  [5,  48,  49],  although  they  appear  as 
slightly  overestimated. 

Modeling  Surface  Defects 

Silica  surfaces  can  hardly  be  considered  as  ide¬ 
ally  regular  surfaces  [50-52].  Most  of  the  theoreti¬ 
cal  approaches  deal  with  point  defects  [5].  We 
shall  here  consider  two  special  cases:  an  extra  Si02 
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FIGURE  3.  General  view  (A,  C)  and  the  quantum  systems  (B,  D)  for  the  modeling  of  water  adsorption  on  a  surface  with 
a  regular  structure. 


group  added  to  the  ideal  (010)  surface  and  a  crys¬ 
tal  edge. 

The  extra  SiOj  group,  represented  on  Figure 
4(a)  is  modeled  by  a  quantum  cluster  schematized 
in  Figure  4(b)  (formula  Si702iH6).  The  presence  of 
this  chemically  bonded  extra  group  introduces  a 
local  perturbation  in  the  bond  angles  and  bond 
lengths  (Table  VIII).  One  possible  structure  of  a 
water  molecule  adsorbed  on  this  defect  is  repre¬ 
sented  in  Figures  4(c)  and  4(d)  (the  interacting 
atoms  are  O9  and  H^i).  The  energy  of  interaction 
found  in  this  case  is  10.2  kcal/mol,  very  close  to 
the  values  obtained  on  the  ideal  surface. 

We  finally  considered  the  edge  of  a  (010)  plane 
represented  in  Figure  5.  The  uneven  geometry  of 
the  silanol  groups  is  clearly  visible  on  this  figure. 
The  energy  of  interaction  of  a  water  molecule  with 
these  groups  was  again  found  equal  to  10.2 
kcal/mol. 


Discussion 

Although  the  main  aim  of  this  work  is  to  test 
the  extension  of  the  LSCF  method  to  crystal  sur¬ 
faces,  especially  to  its  capability  for  modeling  their 
chemical  properties,  the  preliminary  results  ob¬ 
tained  on  the  (010)  surface  of  a-cristobalite  already 
give  some  interesting  indications  on  the  chemical 
properties  of  the  surface. 

The  most  striking  result  is  the  great  stability  of 
the  computed  properties  with  the  size  of  the  quan¬ 
tum  subsystem  either  in  the  bulk  or  on  the  surface. 
This  should  give  confidence  in  the  method.  The 
other  very  clear  result  arises  from  the  comparison 
of  the  geometric  features  of  an  isolate  surface  clus¬ 
ter  with  an  imbedded  one  which  exhibits  larger 
differences,  indicating  that  the  conclusions  drawn 
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FIGURE  4.  General  view  (A,  C)  and  the  quantum  systems  (B,  D)  for  the  modeling  of  a  surface  with  irregular  structure. 


FIGURE  5.  General  view  of  a  corner  of  (01 0) 
a-cristobalite  surface. 


from  the  study  of  small  molecular  clusters  in¬ 
tended  for  representing  the  surface  of  a  solid  may 
be  somewhat  erroneous. 

The  only  chemical  property  studied  here  is  the 
energy  of  interaction  of  a  water  molecule  with  the 
hydroxylated  (010)  surfaces.  The  unexpected  result 
is  the  small  range  of  variation  of  these  energies. 


This  could  mean  that  this  interaction  involves  the 
properties  of  individual  OH  groups  and  that  these 
properties  do  not  vary  much  from  one  place  to 
another,  as  expected  for  covalent  bonds. 

This  method  is  well  adapted  to  the  study  of 
other  physical  properties,  such  as  vibrational  fre¬ 
quencies  [36],  as  well  as  chemical  reactivity  such 
as  acidity  or  basicity  of  surface  groups  or  the 
influence  of  chemical  defects  (substitution  of  one 
atom  by  another  element).  It  should  be  tested  in 
this  way  in  the  near  future. 


Conclusion 

The  extension  of  the  LSCF  method  to  periodic 
systems  appears  to  be  quite  an  appropriate  tool  to 
perform  quantum  chemical  computations  on  a 
cluster  of  any  size  (provided  that  the  computa¬ 
tional  requirements  do  not  exceed  the  possibilities 
of  the  computer  in  use)  interacting  with  the  rest  of 


SURFACE  REACTIVITY  OF  COVALENT  SOLIDS 


the  solid  and  linked  to  it  by  localized  bond  orbitals 
which  minimize  the  electronic  discontinuity  be¬ 
tween  the  two  systems.  When  such  a  cluster  is 
located  at  the  surface  of  the  solid,  it  allows  the 
study  of  chemical  properties  of  the  surface  and  of 
chemical  reactions  in  which  this  surface  is  in¬ 
volved. 

This  computational  scheme,  which  was  origi¬ 
nally  devised  for  modeling  biopolymers,  can  also 
be  used  for  amorphous  solids  provided  that  one  is 
able  to  generate  the  structure  of  these  solids  and  to 
define  a  chemical  similarity  between  the  atoms  of 
the  quantum  part  and  those  of  the  rest  of  the 
system,  in  order  to  achieve  the  self-consistency  in 
the  charge  distribution. 

In  its  present  form,  the  method  works  very 
efficiently  at  the  NDDO  semi-empirical  level,  due 
to  the  assumed  orthogonality  of  the  atomic  or¬ 
bitals.  The  basic  principles  of  the  method  are  also 
valid  at  the  ab  initio  level,  but  they  require  a  full 
orthogonalization  of  the  quantum  subsystem  to 
the  localized  bond  orbitals  which  separate  the  clas¬ 
sical  and  the  quantum  subsystem.  This  additional 
subroutine  is  under  development  in  the  Nancy 
group.  Therefore,  the  local  self-consistent  field 
method  [33]  together  with  the  classical-quantum 
force  field  [34]  appear  as  a  natural  extension  of  the 
embedded  cluster  methods  to  covalent  solids  and 
thus  offer  some  very  appealing  prospects. 
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ABSTRACT _ _ _ 

In  a  positively  charged  metallic  cluster,  surface  tension  tends  to  enhance  the  ionic  density 
with  respect  to  its  bulk  value,  while  surface-charge  repulsion  tends  to  reduce  it.  Using 
the  stabilized  jellium  model,  we  examine  the  self-expansion  and  compression  of  positively 
charged  clusters  of  simply  metals.  Quantal  results  from  the  Kohn-Sham  equations  using 
the  local  density  approximation  are  compared  with  continuous  results  from  the  liquid 
drop  model.  The  positive  background  is  constrained  to  a  spherical  shape.  Numerical 
results  for  the  equilibrium  radius  and  the  elastic  stiffness  are  presented  for  singly  and 
doubly  positively  charged  aluminum,  sodium,  and  cesium  clusters  of  1-20  atoms. 
Self-expansion  occurs  for  small  charged  clusters  of  sodium  and  cesium,  but  not  of 
aluminum.  The  effect  of  the  expansion  or  compression  on  the  ionization  energies  is 
analyzed.  For  Al^,  we  also  consider  net  charges  greater  than  2^.  The  results  of  the 
stabilized  jellium  model  for  self-compression  are  compared  with  those  of  other  models, 
including  the  SAPS  (spherical  averaged  pseudopotential  model).  ©  1996  John  Wiley  & 
Sons,  Inc. 


1.  Introduction 

The  stabilized  jellium  model  (SJM)  or  struc¬ 
tureless  pseudopotential  model  [1]  is  a  sim¬ 
ple  modification  of  the  jellium  model.  In  the  jel- 

*To  whom  correspondence  should  be  addressed. 


Hum  model,  the  ions  are  replaced  by  a  continuous 
charge  background  of  density  h  =  3/(477 r^^),  trun¬ 
cated  sharply  at  a  cluster  radius  R  =  r^ where 
Nq  is  the  number  of  valence  electrons  in  the  neu¬ 
tral  cluster.  This  model  is  widely  used  in  the 
physics  of  metal  clusters,  since  the  precise  position 
of  the  ions  is  not  important  for  many  physical 
properties.  Stabilized  jellium  as  well  as  jellium 


International  Journal  of  Quantum  Chemistry:  Quantum  Chemistry  Symposium  30,  1537-1548  (1996) 

©  1 996  John  Wiley  &  Sons,  Inc.  CCC  0360-8832  /  96  /  071 537-1 2 


VIEIRA  ET  AL 


may  be  deformed,  but  we  limit  our  consideration 
here  to  spherical  shapes. 

Bulk  jellium  (Ng  oc)  is  unstable  at  density 
parameters  which  are  very  different  from  that  of 
sodium  (r/  =  3.93  bohr).  (We  adopt  atomic  units 
in  which  e  =  m  =  h  =  1.)  The  SJM  cures  the  main 
deficiencies  of  the  jellium  model,  namely  unrealis¬ 
tic  binding  energies  at  all  densities,  unrealistic 
bulk  moduli  for  metals  with  low  valence  electron 
density  such  as  cesium  (r^  ^  5.62  bohr),  and  unre¬ 
alistic  surface  energies  for  high  valence  electron 
density  metals  such  as  aluminum  =  2.07  bohr). 
The  SJM  retains  the  simplicity  and  universality  of 
jellium,  as  the  only  inputs  to  these  models  are  the 
bulk  density  parameter  and  the  valence  z.  In 
the  SJM,  we  subtract  the  spurious  self-repulsion  of 
the  jellium  charge  inside  each  Wigner-Seitz  sphere 
and  then  add  a  constant  potential  acting  on  the 
electrons  inside  the  cluster.  This  constant  poten¬ 
tial,  different  for  each  metal,  is  designed  so  that 
the  bulk  metal  is  stable  at  the  observed  valence- 
electron  density  (i.e.,  the  pressure  P  vanishes  at 
the  experimental  r^). 

In  previous  works,  the  surface  properties  [2], 
the  energetics  of  small  clusters  and  cohesive  prop¬ 
erties  of  bulk  metals  [3],  and  a  number  of  fragmen¬ 
tation  processes  of  charged  clusters  [4]  were  ex¬ 
plored  in  the  framework  of  the  SJM.  In  all  those 
works  the  density  parameter  of  the  clusters  was 
taken  to  be  r  the  bulk  density  parameter.  How¬ 
ever,  stabilized  jellium  permits  adjustment  of  the 
background  density  so  that  the  energy  per  particle 
of  a  cluster  is  minimal  [5,  6].  A  neutral  cluster  with 
has  a  lower  energy  than  a  cluster 
with  r/.  This  effect,  which  cannot  be  described  in 
the  jellium  model,  is  simply  explained,  within  the 
liquid  drop  model  (LDM),  by  the  surface  tension. 
It  is  called  ''self-compression"  and  has  analogs  in 
nuclear  physics  [7]  and  in  the  physics  of  helium 
droplets. 

In  the  present  work,  the  SJM  is  employed  to 
study  not  only  the  self-compression  but  also  the 
self-expansion  of  charged  clusters  of  simple  met¬ 
als.  The  magnitude  of  self-compression  should  be 
reduced  for  charged  systems.  Self-expansion  is  ex¬ 
pected  if  the  Coulomb  repulsion  of  the  excess 
charge  overwhelms  the  surface  tension.  We  use  the 
Kohn-Sham  equations  of  density  functional  theory 
in  the  local  density  approximation  (LDA),  and  the 
LDM,  considering  clusters  of  three  different  metals 
(Al,  Na,  and  Cs),  which  cover  the  range  of  physical 


densities,  with  single  and  double  positive  charges. 

Let  us  see  how  the  main  reasoning  used  to 
introduce  the  self-compression  of  neutral  clusters 
can  be  extended  to  the  case  of  charged  ones.  The 
constant  potential  is  an  average  of  the  difference 
between  an  electron-ion  pseudopotential  and  the 
jellium  electron-background  potential  in  a  imit  cell. 
It  can  therefore  be  expressed  in  terms  of  a  pseu¬ 
dopotential  "core  radius"  r^.  Then  the  bulk  stabil¬ 
ity  condition  is 


lim 

No--  dr^\NQ 


(1.1) 


where  Eq  =  E(Nq,  z,  is  the  total  energy  of 
the  neutral  cluster  in  the  SJM.  Equation  (1.1)  fixes 
==  z)  for  each  metal. 

The  equilibrium  positive-background  density 
parameter  r*  for  a  neutral  cluster  with  a  finite 
number  Nq  of  valence  electrons  is  the  solution  of 
the  following  equation: 


\  Wo 


(1.2) 


where  the  derivative  is  evaluated  using  the  same 
as  in  Eq.  (1.1)  (assuming  transferability  of  the 
pseudopotential  from  the  bulk  to  the  cluster  envi¬ 
ronment). 

The  elastic  stiffness  or  inverse  compressibility  of 
the  neutral  duster  measures  the  curvature  of  the 
energy  with  respect  to  r^,  around  the  point  rf.  It  is 
defined  by: 


llnrf  drj  (  N,  ] 


(1.3) 


It  is  a  function  of  Nq,  rf,  z,  and  r^.  This  second 
derivative  goes  over  to  the  bulk  modulus  B  ®  when 

If  we  ionize  the  cluster,  the  number  of  valence 
electrons  becomes  N  =  Nq  +  v,  with  Nq  the  num¬ 
ber  of  valence  electrons  of  the  neutral  system  and 
V  the  number  of  excess  electrons  (e.g.,  for  a  cluster 
with  a  single  positive  charge,  p  =  - 1).  The  energy 
of  the  charged  cluster  may  be  evaluated  by  solving 
the  self-consistent  Kohn-Sham  [8]  equations  and 
filling  up  in  sequence  the  available  single-particle 
levels.  It  can  also  be  evaluated  using  the  LDM  for 
charged  systems  [9,  10],  in  which  we  disregard 
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shell  fluctuations.  In  either  case,  the  equilibrium 
radius  of  the  positive  background  is  R  = 
with  the  equilibrium  density  parameter  defined  by 


d 


E 

N 


(1.4) 


where  E  =  EiN^,  r,,  z,  r^,  v)  is  the  total  energy  of 
the  charged  cluster.  We  have  expansion  if  r*  >  rf 
or  compression  if  rf  <  rf. 


The  elastic  stiffness  of  the  charged  cluster. 


1  ^E 

llTrr*  N 


(1.5) 


is  different  from  the  stiffness  of  a  neutral  cluster. 
Besides  Nq,  r* ,  z,  and  r„  it  depends  on  the 
charge  v. 

We  shall  determine  rf  and  related  physical 
properties  as  functions  of  Nq  and  v  for  different 
metals  (different  sets  of  rf  and  z). 


Al 


+ 


FIGURE  1.  (a)  Ratio  of  the  equilibrium  density  parameter  rf  to  its  bulk  value  rf  as  a  function  of  number  of  atoms 
^at  3  singly  charged  cluster  of  stabilized  jellium  representing  Al”^.  Heavy  dots:  LDA  Kohn-Sham  results. 

Solid  cun/e:  Exact  numerical  solution  within  the  LDM.  These  results  should  be  compared  with  those  for  neutral  clusters 
[5,  6].  Note,  however,  that  in  Ref.  [5]  the  local  spin  density  (LSD)  was  used  instead  of  the  LDA  and,  in  Refs.  [5,  6],  the 
densities  of  Na  and  Cs  were  rf  =  3.99  and  rf  =  5.63,  in  contrast  with  the  present  7=0  values  rf  =  3.93  and 
rf  =  5.62.  (b)  Ratio  of  the  elastic  stiffness  to  its  bulk  value  as  a  function  of  number  of  atoms  for  AIT 
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In  the  next  section  we  elaborate  on  the  LDM  for 
charged  clusters.  In  Section  3,  we  present  and 
discuss  the  quantal  and  liquid  drop  results,  and  in 
Section  4  the  main  conclusions  are  drawn. 


2.  Energj  of  the  Charged  System 

To  evaluate  the  energy  of  a  charged  system,  we 
use  the  Kohn~Sham  equations  in  the  LDA,  with 
the  Per  dew- Wang  parametrization  [11]  for  the 
correlation  energy.  The  quantal  equations  which 
were  solved  are  explicitly  written  in  Refs.  [3,  4]. 

The  LDM  for  a  charged  system  deserves  more 
attention,  since  it  is  not  so  well  known.  In  this 
model,  the  energy  of  a  neutral  cluster  is  written  as 


a  power  series  of  the  valence  electron  number 

Eq  =  z,  r^)No  +  z, 

+  +  (2.1) 

where  is  the  average  energy  per  electron  for  a 
bulk  system  of  uniform  density,  is  the  surface 
energy  coefficient,  and  is  the  curvature  energy 
coefficient.  This  expression  can  be  derived  using 
density  functional  techniques  by  developing  the 
density  of  the  cluster  around  the  density  profile  of 
the  planar  surface  (Lang-Kohn  calculation)  [2]. 
This  procedure,  which  goes  under  the  name  of 
'Teptodermous  expansion''  since  it  applies  to  sys¬ 
tems  with  thin  skin,  allows  us  to  evaluate  the 
surface  and  curvature  energies  [7].  Although  Eq. 


Al 


++ 


FIGURE  2.  Same  as  Fig,  1,  For 
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(2.1)  is  strictly  valid  for  Nq  ^  it  provides  a 
useful  average  of  the  shell-structure  oscillations  of 
the  quantal  result  and  may  be  accurate  even  in  the 
atomic  limit  Ng  =  1  [3,  12]. 

The  energy  of  a  charged  cluster  may  also  be 
described  by  a  continuous  approximation.  In  Refs. 
[9,  10]  the  following  liquid  drop  formula  of  a 
charged  cluster  with  v  excessive  electrons  has  been 
obtained  in  the  jellium  model; 


£  =  Eo 


„(w4) 


2(R  +  8) ' 


(2.2) 


where  W  is  the  work  function,  c  is  a  size  correc¬ 
tion  to  the  work  function,  and  8  is  the  distance 
from  the  image  plane  to  the  jellium  edge  of  the 
planar  surface.  In  Eq.  (2.2),  —  (W  -I-  c/R)  is  the 
chemical  potential  of  the  neutral  cluster;  its  contri¬ 
bution  of  order  R~^  is  currently  unknown.  If  we 


replace  the  jellium  LDM  coefficients  by  those  of 
the  SJM,  the  expression  (2.2)  is  still  valid  for  stabi¬ 
lized  jellium,  given  the  similarity  of  the  two  en¬ 
ergy  functionals.  It  has  been  used  in  Ref.  [4,  13] 
without  any  self-expansion  or  compression  effects. 
We  stress  that  the  LDM  formula  (2.2)  is  only  justi¬ 
fied  when  the  charge  i'  is  small  in  comparison 
with  the  total  number  of  electrons  N  =  Ng  -t  v. 

The  ionization  energy  and  the  electron  affinity 
may  be  derived  from  Eq.  (2.2): 


I  =  E(v=  -1)  -  E(v=  0) 

1  \  1 
=  W  -t  I  -  -t  c 


2  'I  R  +  8' 
A  =  Eiv  =  0)  -  E(v  -  1) 
1  )  1 


(2.3) 


(2.4) 


Na^ 
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Therefore,  once  values  of  W  and  c  are  available, 
the  SJM  can  predict  ionizations  and  affinities  for 
finite  systems.  The  work  function  is  given  by  the 
same  surface  calculation  which  yields  and  a,. 
[2].  Lacking  any  first  principles  calculation  of  c,  we 
have  evaluated  it  by  taking  finite  singly  charged 
and  neutral  clusters  and  considering  an  extended 
Thomas-Fermi  approximation  (TFDW-4)  for  the 
kinetic  energy  functional  of  the  self-consistent  den¬ 
sity.  Then,  fitting  the  ionization  energy  by  the 
formula  (2.3)  yields  c  =  -0.1  in  the  case  of  A1  [4], 
close  to  the  value  known  for  jellium  (c  =  -  0.07) 
[14].  Since  in  the  jellium  model  c  is  approximately 
independent  of  r^,  we  assume  that  the  same  is  true 
for  the  SJM.  Finally,  S  was  taken  from  Ref.  [15]. 

The  ionization  energy  is  always  positive,  but 
within  the  jellium-LDA  approach  the  electron 


affinity  is  negative  for  small  clusters.  That  is  also 
the  case  for  stabilized  jellium-LDA.  It  is  interest¬ 
ing  to  investigate  the  number  of  particles  for  which 
the  affinity  turns  our  to  be  positive.  From  Eq.  (2.4), 
the  condition  A  >  0  is  equivalent  to: 


No> 


9  C 


rM 


(2.5) 


More  generally,  the  affinity  of  order  v  is  defined 
by 


A(v)  =  E{v-  1)  -  E{v). 
This  quantity  is  positive  if 


No> 


-  c 


rW 


8 


(2.6) 


(2.7) 
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The  results  on  the  limits  of  stability  for  negative 
ions  are  displayed  in  Table  L 

The  solution  of  (1.4)  with  the  energy  given  by 
(2.2)  is  found  numerically,  since  the  dependence 
of  the  LDM  coefficients  and  W  is  known 

from  numerical  planar  surface  calculations.  The 
coefficients  c  and  8  are  assumed  to  remain  con¬ 
stant  when  compression  changes  the  density  of 
each  metal. 

The  LDM  can  be  used  to  make  simple  estimates 
of  the  transition  from  expansion  to  contraction.  We 
use  the  Taylor  expansion  of  Ref.  [5]  around  the 
bulk  density  parameter: 

E  =  £(r/)  +  (r,  -  rf)E'  +  i(r,  -  rffE"  +  - 

(2.8) 


with  E'  =  <?E/^rJr^=r/  =  “*■ 

E"  =  d'^E/ drl\r^=rf-  The  equilibrium  condition 
leads  to 

r*  -  =  -E’/E" .  (2.9) 

The  condition  r*  =  r/  is  then  equivalent  to 
E'  =  0  or,  considering  only  the  surface  and  the 
Coulomb  term  (with  8  =  0): 


2(r®) 


=  0. 


(2.10) 


For  v=  -2,  this  happens  at  Nq  =  3  for  aluminum, 
Nq  =  9  for  sodium,  and  Nq  =  11  for  cesium.  (The 
values  of  the  derivatives  appear  in  Ref.  [5].) 
Clearly,  these  guesses  for  the  transition  from  ex¬ 
pansion  to  compression  are  only  reasonable  if  the 


Cs^ 


FIGURE  5.  Same  as  Fig.  1 ,  for  Cs"^. 
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TABLE  I 


The  smallest  integer  value  of  Nq  for  which  the 
electron  affinity  of  the  (Nq  +  v  -  1)  valence 
electron  cluster  is  positive,  according  to  Eq.  (2.7).® 


V 

Al 

(rf  =  2.07, 
z  =  3) 

Na 

(rf  =  3.93, 
z=1) 

Cs 

{r^  =  5.62, 
z=1) 

1 

3 

2 

2 

2 

90 

42 

33 

3 

435 

210 

153 

4 

1216 

554 

423 

®The  smallest  neutral  cluster  of  stabilized  jellium  that  can 
bind  excess  electrons  has  Nq  valence  electrons.  The 
results  are  similar  to  those  of  the  jellium  model  [26]. 


truncation  of  the  Taylor  expansion  (2.8)  is  a  good 
approximation  to  the  exact  LDM  result,  and  if  the 
excluded  terms  in  (2.10)  may  be  neglected. 


3.  Results 

In  Refs.  [5,  6]  we  have  found  for  neutral  systems 
with  up  to  20  atoms  that: 

1.  Small  clusters  display  self-compression  rf  < 
r/,  i.e.,  the  ionic  density  is  higher  in  the  cluster 
than  in  bulk. 

2.  Quantal  shell  effects  produce  local  minima  in 
r*  and  local  maxima  in  B  for  closed-shell  clusters 
Nq  =  2,  8,  etc. 
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Figures  l(a)-6(a)  show  the  ratio  of  the  equilib¬ 
rium  density  parameter  r*  to  its  bulk  value  r  ®  as 
a  function  of  the  number  of  atoms  N^t  =  Nq/ z,  for 
A1+,  Al^^,  Na+,  Na^^,  Cs+,  and  Cs^^,  respectively. 
From  these  figures  we  see  that  Al'*'  and  Al^"^  show 
self-compression  for  all  sizes,  although  this  com¬ 
pression  is  not  so  pronounced  as  in  the  neutral 
case.  This  is  due  to  the  high  surface  tension  of  Al. 
On  the  contrary,  Na^+  and  Cs^”"  exhibit  self¬ 
expansion  for  small  clusters  and  self-compression 
for  large  clusters.  The  transition  takes  place  at 
Nat  =  8  and  N^t  =  9,  respectively,  using  the  LDM 
(and  the  Kohn-Sham  approach  gives  practically 
the  same  transition).  Na'*'  and  Cs'*'  always  show 
self-compression.  The  simple  rule  given  by  Eq. 
(2.11)  makes  a  good  estimate  of  the  transition  from 
dilatation  to  contraction,  except  for  Al  where  the 
truncated  Taylor  expansion  is  not  accurate  [6].  In 
the  asymptotic  limit  (Nq  ^  °°),  a  fixed  net  charge 
is  negligible  so  that  the  asymptotic  expression  (Ref. 
[5])  for  in  the  neutral  case  still  holds  in  the 
charged  case.  As  in  the  neutral  case,  the  LDM 
provides  a  good  average  of  the  quantal  results, 
and  quantal  shell  effects  are  evident  in  the  curves 
r*/rf  versus  N^f 

The  elastic  stiffness  was  calculated  numerically 
by  making  a  least-squares  fit  of  a  fourth-order 
polynomial  to  the  curve  EfNg,  r^,  z,  r^,  v),  around 
the  minimum  r* ,  in  the  quantal  as  well  as  in  the 
liquid  drop  cases.  Figures  l(b)-6(b)  show  the  ratio 
of  the  elastic  stiffness  to  its  bulk  value  as  a  func¬ 
tion  of  the  number  of  atoms,  for  Al'*',  Al^^,  Na”^, 
Na^”",  Cs"^,  and  Cs^”^,  respectively.  We  conclude 
that  in  the  LDM  picture  for  Al^"^,  bigger  clusters 
are  harder  than  the  bulk  and  smaller  clusters  are 
softer.  (The  same  was  found  for  neutral  clusters  in 
Ref.  [5],  where  the  asymptotic  N  ^  oo  correction  to 
the  bulk  modulus  was  positive  for  Al  but  negative 
for  Na  and  Cs).  The  crossover  occurs  at  Nq  =  4. 
On  the  contrary,  the  quantal  calculations  for  Al 
always  show  a  cluster  stiffness  higher  than  that  of 
the  bulk.  These  observations  may  have  technologi¬ 
cal  implications.  Na  and  Cs  clusters  charged  or 
neutral,  are  always  softer  than  the  bulk  in  the 
LDM  picture.  The  quantal  results  show  fluctua¬ 
tions,  with  some  clusters  harder  than  the  bulk 
material. 

The  role  of  the  charge  is  to  decrease  the  elastic 
stiffness.  Disagreement  between  the  LDM  and  the 
quantal  stiffnesses  for  very  small  charged  clusters 
indicates  that  the  LDM  may  be  failing  in  those 
cases.  For  some  very  small  clusters,  we  do  not  find 
solutions  for  the  equilibrium  radius.  The  disap¬ 


pearance  of  such  solutions  is  prefigured  by  a  sud¬ 
den  drop  in  the  elastic  stiffness. 

Ionization  energies  are  affected  by  the  expan¬ 
sion  or  compression.  We  can  define  two  ionization 
energies:  one  "vertical,"  where  the  density  param¬ 
eter  is  frozen  at  the  value  of  the  neutral  system, 
and  another  "relaxed,"  where  both  the  neutral  and 
the  charged  system  are  allowed  to  minimize  their 
energies  with  respect  to  r^.  The  first  is  more  realis¬ 
tic,  since  in  a  real  ionization  process  the  ions  do 
not  have  time  enough  to  readjust  their  positions. 
Figure  7  shows  the  vertical  ionization  energy  for 
Cs,  in  comparison  with  that  calculated  for  the  bulk 
density  (without  any  compression  or  expansion). 
The  stronger  self-contraction  that  occurs  for 
closed-shell  clusters  leads  to  a  stronger  binding 
of  the  least-bound  electron  and  thus  enhances 
the  shell-structure  fluctuations  of  the  ionization 
energy. 


4.  Conclusions 

We  have  examined  the  self-expansion  and  self¬ 
compression  of  singly  and  doubly  positively 
charged  clusters  of  stabilized  jellium.  We  have 
found  that,  for  a  range  of  densities  which  cover  Na 
and  Cs,  self-expansion  occurs  for  smaller  doubly 
charged  clusters  and  self-compression  for  larger 
ones.  However,  for  high  densities,  such  as  that  of 
Al,  self-compression  prevails  for  the  net  charges 
considered  here.  If  the  charge  increases  further, 
self-expansion  appears  even  for  Al,  and  for  very 
large  charges  the  phenomenon  of  "blow  up" 
(Coulomb  explosion)  arises  via  disappearance  of  a 
solution  for  the  background  equilibrium  radius 
(see  Figure  8). 

Note  that  the  recently  proposed  "ultimate  jel¬ 
lium  model"  [16],  unlike  the  stabilized  or  standard 
jellium  models,  does  not  admit  solutions  for  any 
positively  charged  clusters. 

Typically  the  charged  liquid  drop  model  yields 
a  useful  average  of  the  quantal  results.  But  it  fails 
to  give  results  for  some  very  small  charged  clus¬ 
ters,  where  the  quantal  method  still  applies. 

Finally,  let  us  discuss  the  experimental  and  the¬ 
oretical  information  available  on  the  compression 
and  expansion  of  charged  clusters.  Lattice  com¬ 
pression  or  dilatation  has  been  observed  in  many 
cases  (see  Ref.  [17]),  but  the  experimental  data  are 
contradictory.  The  observed  clusters  are  boimd  to 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


1545 


VIEIRA  ET  AL 


Cs 


FIGURE  7.  First  ionization  energy  of  Cs,  in  the  SJM,  where  the  neutral  clusters  have  relaxed  background  density  r* 
and  the  charged  clusters  have  the  same  density,  in  comparison  with  calculations  where  both  neutral  and  charged ° 
clusters  have  the  bulk  background  density  r^.  The  dashed  horizontal  line  indicates  the  work  function. 


a  surface,  and  the  effect  of  the  substrate  is  not  well 
understood. 

It  is  interesting  to  compare  the  self-compression 
of  a  neutral  system  predicted  by  the  SJM  with  a 
similar  compression  given  by  atomistic  calcula¬ 
tions.  The  spherical  averaged  pseudopotential 
model  (SAPS)  [18]  is  a  model  which  has  an  inter¬ 
mediate  degree  of  sophistication  between  the  SJM 
and  full  atomistic  calculations.  In  Refs.  [19,  20],  the 
SAPS  was  used  to  predict  an  inhomogeneous  con¬ 
traction  of  a  Cs  cluster,  in  which  the  ions  in  the 
inner  layers  are  more  closely  packed  than  those 
near  the  surface.  The  SAPS  has  been  used  recently 
to  calculate  the  frequency  of  the  "breathing"  mode, 
where  the  ions  oscillate  while  maintaining  spheri¬ 
cal  symmetry  [21].  In  the  SJM  framework,  we  can 
infer  the  frequency  of  this  mode  from  the  bulk 
modulus  via  Eq.  (21)  of  Ref.  [21].  The  result  for  Na 
ihu>Q  =  73N  meV)  does  not  differ  very  much 
from  that  obtained  in  the  SAPS.  (A  good  fit  [21]  is 
given  by  hwQ  =  a  +  with  a  =  0.6  meV 

and  b  =  30  meV).  This  suggests  that  the  SAPS  is 
similar  to  the  SJM.  The  SAPS  may  also  be  used  to 


evaluate  contraction  or  dilatation  of  charged  sys¬ 
tems  and  is  probably  somewhat  more  realistic  than 
any  continuous  background  model. 

In  Ref.  [22],  some  of  the  present  authors  have 
evaluated  the  contraction  of  six-atom  clusters  of 
different  metals  (Li,  Na,  Mg,  and  Al),  using  a 
linear  combination  of  atomic  orbitals  local  density 
molecular  code  but  considering  only  the  octahe¬ 
dral  shape.  The  results  were  only  qualitatively 
comparable  with  those  of  the  SJM. 

Some  theoretical  calculations  on  small  neutral 
and  charged  aluminum  clusters  with  unrestricted 
geometry  are  available.  In  Ref.  [23],  density  func¬ 
tional  calculations  (with  a  local  spin  density  ap¬ 
proximation  for  the  exchange-correlation  energy) 
and  finite-temperature  molecular  dynamics  are  ap¬ 
plied  to  neutral  and  singly  charged  aluminum 
clusters,  with  the  number  of  atoms  ranging  up  to 
10.  In  Ref.  [24],  an  all-electron  study  on  neutral, 
singly,  and  doubly  charged  aluminum  clusters  (for 
2-6  atoms)  is  carried  out  within  the  Kohn-Sham 
formalism,  including  gradient  corrections  to  the 
exchange  and  correlation  energy.  In  Ref.  [25],  the 
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q 

FIGURE  8.  Elastic  stiffness  B  and  equilibrium  density  parameter  r*  calculated  by  the  Kohn-Sham  method  for  the 
spherical  SJM  of  Alg+.  There  is  no  self-consistent  solution  for  q  >  12. 


structure  and  properties  of  singly  positively  and 
negatively  charged  (1-5  atoms)  aluminum  clusters 
are  investigated  by  using  the  linear  combination  of 
Gaussian  type  orbitals  method,  employing  local 
and  nonlocal  spin  density  approximations  and  a 
model  core  potential  that  allows  the  explicit  treat¬ 
ment  of  3s^  3p’  valence  electrons.  In  all  these 
studies  the  average  bond  length  tends  to  be  smaller 
in  smaller  clusters,  in  agreement  with  our  own 
results.  But,  again,  the  agreement  is  more  qualita¬ 
tive  than  quantitative.  Charged  clusters  show  a 
smaller  bond-length  reduction  that  neutral  ones. 

We  plan  to  study  the  SAPS  for  several  charged 
metallic  clusters  and  to  carry  out  more  exact  calcu¬ 
lations  for  the  charged  six-atom  clusters  with  re¬ 
stricted  and  unrestricted  geometries. 
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ABSTRACT _ _ _ _ _ 

Theoretical  models  will  be  presented  in  which  the  internal  energy  of  (a)  a  ferromagnet 
and  (b)  a  pyroelectric  material  is  expressed  in  terms  of  magnetization  and  electric 
polarization,  respectively.  For  the  ferromagnet,  simple  models  of  elementary  excitations 
(e.g.,  spin  wave  theory  in  an  insulator,  to  which  Stoner  excitations  must  be  added  in  a 
metal)  lead  to  formulas  for  the  internal  energy  at  low  temperature  as  power  laws  in  the 
change  of  magnetization  from  its  saturation  value.  An  unconventional  use  of  two  order 
parameters,  the  sublattice  magnetization  plus  the  metallic  discontinuity  in  momentum 
distribution  at  the  Fermi  surface,  allows  the  phase  transition  between  metallic  and 
insulating  states  of  antiferromagnets  to  be  treated  at  T  =  0,  the  three-dimensional 
transition-metal  dichalcogenides  being  an  example  here.  The  treatment  of  the  internal 
energy  of  the  ferromagnet  is  then  extended  to  include  the  entropy  also,  using  specifically 
the  Ising  model  in  nonzero  external  magnetic  field.  Its  relevance  to  the  Landau  theory  of 
phase  transitions  will  be  emphasized.  Some  comments  will  finally  be  made  about  the 
analogue  for  pyroelectrics.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

The  concept  of  the  order  parameter  is,  of 
course,  central  to  the  theory  of  phase  transi¬ 
tions.  In  the  present  work,  we  shall  be  concerned 

*To  whom  correspondence  should  be  addressed. 


with  magnets — both  ferro  and  antiferro — where 
the  magnetization  is  the  crucial  quantity,  and  with 
pyroelectrics,  where  the  polarization,  or  dipole 
moment  per  unit  volume,  plays  a  central  role.  A 
little  more  specifically,  the  order  parameter  of  the 
transition  from  a  paramagnetic  to  a  ferromagnetic 
state  with  long-range  magnetic  order  is  the  magne¬ 
tization  M.  This  transition  is  second-order;  e.g., 
the  susceptibility  and  specific  heat,  second-order 
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derivatives  of  the  free  energy,  diverge  at  the  tran- 
sition. 

Though  we  shall  deal  at  some  length  with  ferro- 
magnets,  we  shall  also  make  reference  to  the  anti¬ 
ferromagnetic  case.  Thus  we  should  note  that  an 
antiferromagnet  exhibits  phase  transitions  at 
nonzero  field.  In  this  case  a  series  of  ordered 
phases  are  characterized  by  different  order  param¬ 
eters  related  to  components  of  the  sublattice  mag¬ 
netization  [1]. 

The  final  case  treated  in  the  present  study  is 
that  of  pyroelectric  materials.  The  phenomenon  of 
pyroelectricity,  which  is  observable  through  the 
temperature  dependence  of  the  electric  dipole  mo¬ 
ment,  is  important  technologically  for  infrared  de¬ 
tection.  It  will  be  treated  here  along  with  magnets, 
though  the  models  employed  below  are  more 
primitive  than  in  the  case  of  magnetic  systems. 

The  outline  of  the  study  is  then  as  follows. 
Immediately  below  the  exact  Onsager  solution  of 
the  two-dimensional  Ising  model  [2,3]  will  be 
shown  to  yield  a  unique  plot  of  the  internal  energy 
of  such  a  ferromagnetic  description  in  terms  of  the 
order  parameter,  the  magnetization.  Then  by  dis¬ 
cussing  low-lying  elementary  excitations  in  (a)  in¬ 
sulating  ferromagnets  using  spin  wave  theory  [4] 
and  (b)  metallic  ferromagnets  in  terms  of  Stoner 
excitations  [5],  the  form  of  the  internal  energy  in 
terms  of  the  change  from  the  saturation  magneti¬ 
zation  as  the  temperature  is  raised  from  T  =  0  will 
be  set  out.  Following  this,  a  brief  discussion  will 
be  given  in  the  main  text  of  the  treatment  of  the 
metal-insulator  transition,  now  at  T  =  0,  between 
two  antiferromagnetic  phases,  the  detailed  phe¬ 
nomenology  in  terms  of  sublattice  magnetization, 
the  conventional  order  parameter,  plus  a  second, 
unconventional  choice  at  T  =  0,  being  given  in  the 
Appendix. 

To  discuss  the  free  energy,  the  focus  on  the 
internal  energy  must  be  expanded  to  embrace  the 
entropy.  In  doing  this,  attention  will  be  restricted 
to  the  ferromagnetic  case,  A  generalization  of  the 
study  of  Tuszyhski  and  Wierzbicki  [6]  is  effected 
in  this  section  by  employing  now  the  one-dimen¬ 
sional  Ising  model  in  a  nonzero  external  magnetic 
field.  It  is  shown,  when  entropy  is  related  again  to 
the  order  parameter,  that  the  results  thereby  ob¬ 
tained  reduce  to  those  of  Ref.  [6]  when  the  interac¬ 
tion  between  spins  is  finally  "switched  off." 

A  brief  treatment  of  the  internal  energy  in  pyro¬ 
electric  materials  is  then  given  by  analogy  with  the 
magnetic  case,  and  some  predictions  as  to  low- 
temperature  behavior  follow. 


Internal  Energy  of  Ferromagnet  in 
Terms  of  Magnetization 

We  turn  first  to  discuss  the  internal  energy  of  a 
ferromagnet  in  relation  to  the  order  parameter,  the 
magnetization  M.  We  start  from  the  model  of  the 
two-dimensional  Ising  Hamiltonian,  solved  in  ex¬ 
act  form  by  Onsager  [2].  The  magnetization  is 
given  in  convenient  form  by  Callaway  [3].  As 
Grout  and  March  [7]  emphasized,  one  can  thereby 
express  the  internal  energy,  known  also  in  closed 
form  as  a  function  of  T  in  this  simple  model,  by 
solving  two  equations  simultaneously,  in  terms  of 
the  magnetization  M(T).  The  result  is  shown  in 
Figure  1. 

What  is  of  interest  when  we  turn  to  realistic 
magnetic  materials  below,  both  insulating  and 
metallic  ferromagnets,  is  the  deviation  from  the 
saturation  magnetization  M(T  -  0)  =  Mq:  say 

AM  =  Mo  -M(T).  (1) 

Then,  it  can  be  seen  from  Figure  1  that,  in  the 
two-dimensional  Ising  model: 

Eo  -  E(T) 

=  constant  AM  (at  small  AM  or  low  T).  (2) 

It  can  be  shown,  in  fact  [7]  that  for  all  Ising 
models,  Eq.  (2)  is  true  at  sufficiently  low  tempera¬ 
ture. 

LOW-LYING  EXCITATIONS  IN 

THREE-DIMENSIONAL  MAGNETS 

In  an  insulating  ferromagnet  in  three  dimen¬ 
sions,  one  can  express  the  predictions  of  spin  wave 


FIGURE  1.  lE{T)-Eo]/JwsAm(0)-m(T)]/m(0) 
In  two-dimensional  Ising  model  (after  Grout  and 
March  [7]). 
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theory  very  simply  for  the  low-lying  excitations:  (i) 
the  magnon  dispersion  relation  a  translates 
readily  into  the  form  of  internal  energy 

E(T)  -  Eq  a  (at  low  T),  (3) 

whereas  (ii)  the  deviation  of  magnetization  from 
the  saturation  value  is  given  by 

AM  a  (4) 

a  law  first  derived  long  ago  by  Bloch.  Inverting  Eq. 
(4)  to  obtain  T  a  and  substitution  in  Eq. 

(3)  yields  the  desired  result  for  an  insulating  ferro- 
magnet  at  low  temperature: 

E(T)  -  Eq  a  (for  an  insulator  at  low  T). 

(5) 

Turning  to  metallic  ferromagnets,  such  as  Ni, 
one  has  single-particle  or  Stoner  excitations,  and 
these,  it  turns  out,  dominate  aspects  of  the  low- 
temperature  thermodynamics  in  the  metallic  mate¬ 
rials.  In  particular,  one  has  instead  of  Eq.  (3) 

E(T)  -  Eo  a  T" 

(for  a  metallic  ferromagnet  at  low  T).  (6) 

However,  the  magnetization  is  still  dominated  by 
the  spin  waves.  Substituting  therefore,  according 
to  Eq.  (4),  T  a  {AMY'’^  but  now  into  eq.  (6),  one 
obtains  immediately  for  the  metallic  magnet: 

E(T)  -  Eo  a  (AM)"/^  (7) 

These  results  are  evidently  rather  different  from 
the  Ising  model  prediction  (2)  and  are  the  correct 
results  for  the  low-temperature  behavior  of  three- 
dimensional  magnets;  insulating  according  to  Eq. 
(5)  and  metallic  having  behavior  (7).  In  other 
words,  the  low-temperature  expansion  of  Eg  - 
E(T)  in  powers  of  AM  is  now  nonanalytic  around 
T  =  0  at  leading  order. 


Metal-Insulator  Transition  at  T  =  0 
Between  Two  Antiferromagnetic 
Phases 

March  et  al.  [8]  proposed  to  build  a  phenomeno¬ 
logical  theory  of  metal— insulator  transitions  at 
T  =  0  using  the  discontinuity,  q  say,  in  the  elec¬ 
tronic  momentum  distribution  at  the  Fermi  surface 
of  the  metallic  phase  as  the  order  parameter.  In 


this  way  they  were  able  usefully  to  treat  (a)  the 
transition  between  electron  liquid  and  Wigner 
electron  crystal  in  the  jellium  model  and  (b)  the 
transition  induced  in  a  metal  described  by  the 
Hubbard  Hamiltonian  to  an  insulating  phase  as 
the  energy  it  costs  to  put  two  electrons  on  the 
same  site  with  antiparallel  spins  is  increased  from, 
say  U,  to  some  critical  value  L^. 

Below,  we  add  this  order  parameter  to  the  con¬ 
ventional  one— the  sublattice  magnetization— to 
treat  the  metal-insulator  transition  between  two 
antiferromagnetic  phases  at  T  =  0:  see  also  Ferraz 
et  al.  [9].  This  treatment,  summarized  below,  was 
designed  with  the  three-dimensional  transition- 
metal  chalcogenides  in  mind.  A  schematic  phase 
diagram  for  the  specific  case  of  NiS2_j.Se^  is 
drawn,  for  example,  by  Czjzek  et  al.  [10]. 

To  model  such  a  phase  diagram,  it  is  quite  clear 
that  one  must  introduce  strong  electron-electron 
correlations  in  narrow  energy  bands,  which  is  just 
the  situation  for  which  the  Hubbard  Hamiltonian 
was  invented.  Writing  this  for  a  half-filled  doubly 
degenerate  band  one  has,  in  a  customary  notation. 

g  ^  ^  a 

+  (H  —  /)  52  W^2(r  • 

g'<r 

In  Eq.  (8),  /  is  the  interatomic  exchange  integral, 
and  the  index  a  =  1  and  2  denotes  the  two  or¬ 
bitals.  One  then  associates  the  metal-insulator 
transition  within  this  framework  with  a  decrease 
in  the  ratio  of  the  band  width,  proportional  to 
to  the  correlation  energy  U,  and  /,  through  some 
critical  value,  as  discussed  by  Mott  [11].  In  the 
three-dimensional  transition-metal  chalcogenides 
cited  above,  namely  NiS2_;,Se;„  the  band  width 
will  clearly  depend  on  x,  since  the  transfer  of  an 
electron  between  Ni  sites  will  occur  via  an  anion. 

The  phenomenological  model  employed  here 
generalizes  the  work  of  March  et  al.  [8]  to  antifer¬ 
romagnetic  phases  as  follows.  While  the  fermiol- 
ogy  of  an  antiferromagnetic  metal  is  a  somewhat 
complex  problem,  it  is  assumed  in  the  phe¬ 
nomenology  that  some,  average,  discontinuity  in 
the  momentum  distribution,  again  defined  as  q, 
can  be  used  as  one  order  parameter  in  the  antifer¬ 
romagnetic  metal-insulator  transition.  The  details 
are  set  out  in  the  Appendix,  and  in  particular  in 
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Eq.  (Al)  the  internal  energy  E(q,M)  is  expanded, 
following  Landau,  in  the  two  order  parameters,  q 
already  defined  above,  and  the  conventional  one, 
the  sublattice  magnetization  M. 

As  set  out  in  the  Appendix,  minimization  of  the 
internal  energy  E(q,  M)  leads  to  the  simple  predic¬ 
tion  that  the  difference  between  the  sublattice 
magnetization  in  the  metallic  (m)  and  the  insulat¬ 
ing  (/)  phases  is  proportional  to  the  discon¬ 
tinuity  q: 

M,„  -M,  a  q.  (9) 

This  Eq.  (9)  would  seem  to  be  of  interest  in  connec¬ 
tion  with  magnetic  properties  of  two  antiferromag¬ 
netic  phases  around  the  metal-insulator  transition 
and  seems  worthy  of  attempts  to  test  it  experimen¬ 
tally.  This  would  mean  low-temperature  measure¬ 
ments  of  sublattice  magnetization  in  two  antiferro¬ 
magnetic  phases  and  also  positron  annihilation,  or 
Compton  profile  experiments  on  the  metallic  phase 
near  the  transition. 

So  far,  the  discussion  has  focused  entirely  on 
internal  energy  E.  To  complete  the  thermodynamic 
description,  one  must  also  know  the  entropy.  The 
results  of  our  very  recent  investigation,  via  again 
the  Ising  model,  are  the  focus  of  the  ensuing  dis¬ 
cussion.  Then,  the  combined  considerations,  lead¬ 
ing  to  the  free  energy  will  be  related  to  the  Landau 
theory. 


Expressions  for  Entropy 

In  earlier  work  [6],  the  entropy  S  of  an  assembly 
of  noninteracting  spins  of  arbitrary  magnitude  /  as 
a  function  S(m)  of  the  magnetization  m  induced 
by  an  applied  external  magnetic  field  H  has  been 
derived.  This  entropy  is  given  implicitly  by  the 
two  coupled  equations  below: 

S  _  I  sinh[(2}  +  l)gfi^v/2]\ 

sinh(gju.gj//2)  j  +  (10) 

and 


2m  =  gjjigl  (2/  +  Dcoth 


(27+1) 


where  m  denotes  mean  magnetization  per  particle: 
m  =  M/N  and  is  a  Lagrange  multiplier.  For 
7=2,  this  reduces  to  the  well-known  expression 
[12]: 

5 

=  ln(2)  -  |ln(l  —  -  a  tanh“'(o-), 

(12) 


where  a  is  the  reduced  magnetization  defined  by 
a-  =  M/igjx^N).  Equation  (12)  is  plotted  in  Figure 
2.  Here,  our  main  aim  is  to  generalize  this  result  to 
apply  to  interacting  spins,  and  thereby  to  make 
contact  with  the  approximate  theory  of  Landau 
[13]. 

To  proceed,  we  start  out  from  well-established 
results  [3, 14]  for  (i)  equilibrium  magnetization  as 
a  function  of  temperature  at  H  =  0  and  (ii)  the 
Helmholtz  free  energy  F  =  E  -  TS  for  the  two- 
dimensional  Ising  model  in  zero  field.  The  magne¬ 
tization  7n  can  be  written  in  the  explicit  form:  [14] 
for  r  <  T, 


(1  + 

\/l  - 


(13) 


Here,  p  =  lAkJ)  while  €  measures  the  interac¬ 
tion  strength  and  is  related  to  the  critical  tempera- 
ture  by  kgT^  =  2.2696.  The  entropy  S  per  site 
can  be  obtained  as  a  function  of  /Be  also  through 
TS  =  E  ~  F,  where  the  Helmholtz  free  energy  F 
per  site  is  given  by  [14]: 

jSF  =  ln[2cosh(2/36)] 


where  3  =  2 sinh(2jSe)/cosh2(2/3e).  The  internal 
energy  per  site  [7]  takes  the  form 


/3£  =  -^6coth(2/3e)|l  -I-  —  [2tanh^(  )Se)  -  1] 


(15) 


Using  these  results  [Eqs.  (13)-(15)],  the  entropy  S 
is  plotted  for  this  model  as  a  unique  function  of 
the  equilibrium  magnetization  m  in  Figure  3.  This 
complements  the  plot  of  E(m)  given  in  Ref.  [7]. 
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FIGURE  2.  Entropy  (for  >^  =  5)  of  Eq.  (12)  as  a  function  of  the  reduced  magnetization  a  [6, 12]  for  an 

ensemble  of  noninteracting  spins. 


Naturally  since  m  =  0  for  T  >  such  plots  apply 
only  for  T  <  T^.  When  we  consider  S(m)  in  Figure 
3  in  relation  to  the  noninteracting  results  of  Ref.  [6] 
and  Figure  2,  the  maximum  entropy  per  site 
^nonint/^B  =  ln(2),  is,  as  expected,  substantially  re¬ 


duced  by  the  presence  of  interactions,  measured 
by  the  energy  e  in  Eqs.  (13)-(15). 

To  make  closer  contact  with  the  work  of  Ref.  [6], 
we  turn  next  to  consider  the  one-dimensional  Ising 
model  in  the  nonzero  external  field  H.  The  magne- 


FIGURE  3.  Entropy  per  Boltzmann  constant  as  a  function  of  the  equilibrium  magnetization  m  for  the  two-dimensional 
Ising  model. 
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the  way  the  entropy  of  an  interacting  spin  system 
can  be  expressed  in  terms  of  the  equilibrium  mag¬ 
netization.  This  has  been  done  for  (i)  the  two-di¬ 
mensional  Ising  model  in  zero  field  and  (ii)  the 
one-dimensional  Ising  model  in  an  applied  field. 
Particularly  through  example  (iiX  contact  can  be 
made  with  the  result  of  Ref.  [6]  for  noninteracting 
spins.  Note  that  the  related  computer  simulation 
results  of  Stump  [15]  for  the  two-dimensional  Ising 
model  in  an  external  field,  in  contrast  to  our  ap¬ 
proach,  were  concerned  with  the  entropy  S  = 
S(E,  m),  as  a  function  of  internal  energy  and  mag¬ 
netization. 


Using  the  thermodynamic  relation  S  =  -{dG/ 
^T)^,  one  can  find  S  in  terms  of  the  equilibrium 
magnetization  m  and  /36.  A  plot  of  S  versus  these 
two  variables  (h  and  fSe)  is  shown  in  Figure  4. 

To  summarize  this  section,  we  have  established 


FIGURE  4.  A  three-dimensional  surface  plot  of  the  entropy  per  Boltzmann  constant  as  a  function  of  x  =  and 
h  ^  pH  for  the  one-dimensional  Ising  model  in  the  presence  of  a  magnetic  field. 


Relation  to  Landau  Theory 

Landau  theory  [13]  can  be  taken  to  start  out 
from  an  approximate  form  of  free  energy  as  a 


tization  m  can  now  be  written  in  terms  of  pe  and 
pH  =  h  as  [14] 

sinh(/z) 

m  =  ! .  .  (16) 

ycosh^C/z)  “2c  sinh(2/3e) 

For  this  one-dimensional  model,  the  Gibbs  energy 
G  per  site  is  given  in  the  simplest  form  by: 


PG{  pe,  h)  =  -  Pe  -  In  cosh(/z)  + 


sinh(h) 
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function  of  the  magnetization  m  and  the  tempera¬ 
ture  T,  namely 

F(m,T)  =  E(m,T)  -  TS{m,T),  (18) 

Assuming  that  Eq.  (18)  is  sufficiently  general  to 
apply  away  from  equilibrium,  one  now  performs 
the  minimization  of  f  with  respect  to  m  at  con¬ 
stant  T  to  find 


dF 

dE 

dS 

=  Q  =  _ 

-  T - 

dm 

T  dm 

T  dm 

To  establish  contact  between  this  Eq.  (19),  valid 
on  the  minimum  of  the  free  energy,  and  the  results 
presented  earlier,  we  have  taken  the  (equilibrium) 
relations  between  S  and  m  and  E  and  m  in  the 
two-dimensional  Ising  model  in  zero  magnetic  field 
and  have  plotted  the  two  derivatives  {dE/dm)j 
and  (dTS/dm)j  in  Figure  5  against  magnetization 
m.  What  is  remarkable  to  us  is  that,  over  an  exten¬ 
sive  range  of  magnetization,  coming  away  from 
the  critical  point,  Eq.  (19)  is  well  satisfied.  How¬ 
ever,  as  one  nears  the  region  where  the  magnetiza¬ 
tion  is  approaching  saturation,  some  inadequacies 
in  using  our  results  for  large  values  of  magnetiza¬ 
tion  are  being  exposed. 

In  this  context,  we  note  the  results  for  (three-di¬ 
mensional)  insulating  and  metallic  ferromagnets 
near  the  saturation  value  of  the  magnetization. 
Power  laws  (n/3,  with  n  integer)  emerge  for  the 
internal  energy  A  £  in  the  two  cases  in  terms  of  the 
deviation  from  the  saturation  magnetization 
Am  =  m(0)  —  m(T),  by  consideration  of  the 
low-lying  excitations  in  the  two  types  of  ferromag- 
net.  Evidently,  near  T  =  0,  Am  is  a  satisfactory 
order  parameter  though  the  internal  energy  is  non- 
analytic  in  Am. 

In  contrast,  the  Landau  theory  concerns  itself 
with  an  expansion  around  the  critical  temperature 
T^,  where  the  magnetization  m  (the  order  parame¬ 
ter  in  the  examples  treated  in  the  present  work)  is 
itself  small.  Clearly,  to  extend  the  Landau  theory, 
one  must,  at  the  very  least,  interpolate  successfully 
between  the  Landau  limit  and  the  nonanalytic 
behavior  in  Eqs.  (5)  and  (7). 


Analogous  Treatment  of  Pyroelectric 
Materials 

Especially  as  the  theory  and  experiment  are 
much  less  extensively  studied,  especially  at  low 
temperatures,  it  is  appropriate  to  follow  the  above 


a 


b 


FIGURE  5.  (a)  Derivatives  of  pE  (solid  line)  and  S/k^ 
(dashed  line)  with  respect  to  m  at  constant  temperature 
T  as  functions  of  m.  (b)  A  comparison  of  the  profiles 
between  the  pE  (solid)  and  the  S/k^  (dashed)  curves 
as  functions  of  m  Is  made  by  adding  a  positive  constant 
(0.93)  to  the  pE  curve. 

discussion  of  magnets  with  some  comments  on  the 
analogue  for  pyroelectrics  [16,  17]. 

Very  briefly,  changes  in  the  electric  polarization 
of  solids  can  be  brought  about  by  various  means. 
These  embrace  the  application  of  a  strain  (piezo¬ 
electricity).  It  is  pyroelectrics  we  focus  on  in  this 
section.  It  is  relevant  to  note  here  that  ferroelectrics 
constitute  a  technologically  important  class  of  ma¬ 
terials,  in  which  the  polarization  can  be  switched 
by  application  of  an  electric  field — see,  for  exam¬ 
ple,  the  book  by  Lines  and  Glass  [16]  and  the 
review  by  Resta  [17].  The  work  of  King-Smith  and 
Vanderbilt  [18]  should  also  be  mentioned  in  the 
above  context. 
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Below,  the  low-temperature  behavior  of  the  py¬ 
roelectric  coefficient  is  the  focus.  We  stress  that  in 
noncentrosymmetric  crystals,  the  measurable 
quantity  is  the  temperature  dependence  of  the 
electric  dipole  moment:  not  the  polarization  p(T) 
itself.  Early  important  theoretical  work  was  that  of 
Born  [19]. 

Let  us  proceed  first  by  analogy  with  ferromag- 
nets  and  write  the  deviations  of  internal  energy  E 
and  polarization  p  from  their  zero  temperature 
values  as 

A£  =  c^(Ap)^  where  Ap  a  TL  (20) 

To  recover  from  these  low- temperature  forms,  with 
exponents  t  and  s  to  be  determined,  the  Debye 
law  for  the  specific  heat  associated  with  acoustic 
phonon  excitations  we  require  that  AE  a  and 
hence 

ts  =  4.  (21) 

Elsewhere,  models  have  been  considered  to  deter¬ 
mine  the  low-temperature  exponents  t  and  s,  re¬ 
spectively.  We  mention  here  the  work  of  Ohmura 
et  al.  [20],  where  the  starting  point  is  the  well- 
known  Dick-Overhauser  shell  model.  These  work¬ 
ers  argue  that  due  to  the  deformation  of  the  elec¬ 
tronic  charge  clouds  as  the  ions  vibrate,  the  Dick- 
Overhauser  exchange  charge  has  a  quadratic  tem¬ 
perature  dependence  in  the  low-temperature  limit. 
This  leads  to  an  (observable)  pyroelectric  coeffi¬ 
cient  (essentially  dAp/dt)  proportional  to  T,  and 
hence  s  =  2  in  Eq.  (20).  Then  from  Eq.  (21),  t  =  2, 
which  would  be  what  one  would  assert  in  a  molec¬ 
ular  field  treatment.  This  result,  compatible  as 
noted  above  with  a  shell  model,  would  be  changed 
in  a  (less  realistic)  rigid  ion  model,  which  leads  to 
Ap  a  and  hence  t  =  1  in  Eqs.  (20)  and  (21). 
This  is  hence  to  be  compared  with  the  Ising  model 
result  discussed  above  for  magnets  where  AE  a 
Am  at  low  temperatures:  a  result  differing  from 
the  realistic  low-temperature  behavior  in  Eqs.  (5) 
and  (7)  for  insulating  and  itinerant  electron  mag¬ 
nets,  respectively. 

However,  we  must  stress  that  experiment  has 
not  yet  confirmed  the  shell-model  prediction  of  a 
linear  temperature  dependence  of  the  pyroelectric 
coefficient  at  low  temperature,  though  the  simple 
model  of  Ohmura  et  al.  predicts  a  value  of  2  X  10“^ 
/xcm““  K“^  at  5  K,  which  agrees  reasonably  with 
that  measured  by  Lines  [21]  for  LiTa03.  It  is  rele¬ 
vant  in  the  context  above  to  note  that  Grout  et  al, 
[22]  give  upper  bounds  to  the  pyroelectric  coeffi¬ 


cient.  But  at  low  temperatures,  both  the  shell-model 
and  the  rigid-ion  predictions  are  compatible  with 
the  bounds,  so  a  full  microscopic  theory  transcend¬ 
ing  these  models  is  still  required. 


Discussion  and  Summary 

The  internal  energy  of  a  ferromagnet  has  first 
been  treated  from  the  point  of  view  of  the  two- 
dimensional  Ising  model.  One  consequence  of 
working  out  the  change  in  internal  energy  from  its 
ground-state  limit  in  terms  of  the  deviation  AM  of 
the  magnetization  from  its  saturation  value  is  the 
linear  relation  at  low  temperatures.  This  is  modi¬ 
fied  in  both  insulating  and  metallic  three-dimen¬ 
sional  ferromagnets  to  exhibit  an  exponent  correc¬ 
tion  c:  AE  a  (AM)^^^  where  the  fractional  expo¬ 
nent  c  is  §  for  insulating  magnets  and  |  for  metal¬ 
lic  systems.  At  T  =  0,  another  order  parameter  is 
valuable  in  discussing  the  metaHinsulator  transi¬ 
tion  between  two  antiferromagnetic  phases.  This  is 
the  discontinuity  in  the  momentum  distribution  of 
the  metal  at  the  Fermi  surface,  the  further  order 
parameter  being  the  customary  sublattice  magneti¬ 
zation.  The  internal  energy  has  also  been  treated  at 
low  temperatures  in  pyroelectrics  where,  in  con¬ 
trast  to  the  magnets,  the  low-lying  excitations  are 
acoustic  phononlike. 

The  ferromagnet  in  the  Ising  framework  has 
finally  been  reconsidered  in  relation  to  the  en¬ 
tropy,  needed  of  course  to  construct  the  free  en¬ 
ergy  appearing  in  the  Landau  theory  of  phase 
transitions.  The  results  of  our  analysis  are  of  signif¬ 
icance  to  a  number  of  particular  applications,  two 
of  which,  ferromagnets  and  pyroelectrics,  have 
been  discussed  in  the  article. 


Appendix:  Unconventional  Order 
Parameter  for  Metal-Insulator 
Transition  Between  Antiferromagnetic 
Phases 

In  this  Appendix  we  shall  summarize  an  uncon¬ 
ventional  phenomenological  approach  to  the  treat¬ 
ment  of  the  metal-insulator  transition  at  T  =  0 
between  phases  which  both  exhibit  cooperative 
antiferromagnetic  ordering.  Following  the  ideas  set 
out  in  the  main  text,  we  must  first  note  that  to 
model  the  phase  diagram  (the  case  of  three-dimen¬ 
sional  transition-metal  dichalcogenides  is  a  specific 
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example;  see,  e.g.,  the  schematic  phase  diagram  in 
Ref  [10])  of  such  systems  requires  the  incorpora¬ 
tion  of  strong  correlation  interactions  in  narrow 
energy  bands.  In  the  text,  this  situation  is  modeled 
by  the  Hubbard  Hamiltonian,  written  there  for  a 
half-filled  doubly  degenerate  band. 


from  Eqs.  (A3)  and  (A4)  that  AM  a  cj,  as  dis¬ 
cussed  in  the  body  of  the  text.  Experimental  tests 
of  this  relation  as  the  metal-insulator  transition  is 
approached  from  the  metallic  phase  would  be  of 
considerable  interest  but  these  would  be  required 
at  the  lowest  feasible  temperatures. 


PHENOMENOLOGICAL  MODEL  IN  TERMS 
OF  METALLIC  DISCONTINLITY  AT 
FERMI  SURFACE 

In  accord  with  the  approach  of  March  et  al.  [8], 
the  discontinuity  at  the  Fermi  surface  is  to  be  used 
as  an  order  parameter  to  describe  the  metal -insu¬ 
lator  phase  transition  at  absolute  zero  of  tempera¬ 
ture.  Then  (see  also  Chapman  and  March  [23]  at 
elevated  temperatures),  in  the  region  of  the  transi¬ 
tion  in  the  metallic  side  one  expands  E  in  both  the 
discontinuity,  cj  say,  and  the  difference  AM  be¬ 
tween  the  sublattice  magnetizations,  as  a  further 
order  parameter  as  discussed  in  the  body  of  the 
text.  Then  near  the  transition,  and  following  the 
spirit  of  Landau  theory  of  phase  transitions, 

E(q,AM)  =  Eq  +  E^cj  +  \E^q^  +  aAM 

+  ^biAMf  +  \qAM  +  ■■■ ,  (Al) 

where  AM  =  M„^gt  -  Mj^^  <  0,  since  it  will  be 
supposed  that  there  is  magnetic  saturation  in  the 
insulating  phase.  Also 


a  =  I3(U,  -  U)  >0,  El  =  -a(U,  -  H)  <  0 

(A2) 


with  a,  13,  b,  E2,  and  A  all  being  taken  as  positive 
quantities.  The  physical  significance  of  A  is  that 
this  parameter  reflects  the  dependence  on  magne¬ 
tization  of  the  formation  of  electron-hole  pairs. 

Carrying  out  the  minimization  of  E(q,AM)  in 
Eq.  (Al)  with  respect  to  q  and  AM  then  gives 


AM 


[(ctA/Ej)  +  b][U^  -  U] 

(aVE^)  -  b 


(A3) 


and 


[  i3  +  (a/3/A)][{J,  -  Lf] 
(E2VA)  -  A 


(A4) 


Since  AM  <  0  as  argued  above,  we  must  have 
A^/Ejb  <  1.  It  is  evident  from  Eqs.  (A3)  and  (A4) 
that  both  AM  and  q,  in  this  model,  go  continu¬ 
ously  to  zero  at  Lf  ^  L4.  It  immediately  follows 
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ABSTRACT _ 

Self-consistent  electronic  structure  calculations  of  a-doped  quantum  wells  (QW)  in  the 
presence  of  in-plane  magnetic  fields  B  up  to  20  Tesla  are  carried  out  within  the 
frameworks  of  the  effective  mass  and  the  local  density  approximations.  QWs  composed 
of  two  layers  of  Ga^_^A4  As,  separated  by  a  layer  of  GaAs  with  a  donor  §-doped  sheet 
in  the  center,  are  considered.  The  width  of  the  GaAs  layer  was  varied  from  100  to  400  A. 
It  is  shown  that  the  diamagnetic  shift  increases  with  the  increasing  of  the  GaAs  QW 
width.  The  magnetic  field  induces  remarkable  changes  in  the  energy  dispersions  of 
electrons  and  holes,  along  an  in-plane  direction  perpendicular  to  B.  The  most  striking 
effect  occurs  in  the  nature  of  the  band  gap  of  these  systems.  We  found  that  the  valence 
band  displays  a  double-maximum  character  instead  of  a  single  maximum  at  the  center  of 
the  Brillouin  zone.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

Recently,  an  increasing  interest  has  been  re¬ 
vealed  in  the  use  of  the  B-doping  concept  to 
produce  very  high  carrier  concentration  two-di¬ 
mensional  (2D)  semiconductor  structures  of  sev¬ 
eral  applications,  including  devices  and  the  gener¬ 


ation  of  high-mobility  2D  electron  gas  systems  [1]. 
Such  structures  are  obtained  by  atomic  layer  dop¬ 
ing  of  epitaxially  grown,  e.g,,  GaAs  or  GaAlAs 
crystals.  An  ideal  B-doping  profile  is  achieved  by 
confining  the  dopant  atoms  to  a  crystal  monolayer, 
generating  a  V-shaped  potential  well  in  the  host 
material.  The  presence  of  a  2D  electron  gas  with  a 
characteristic  subband  structure  in  the  V-shaped 
potential  well  has  been  confirmed  by  several  ex- 
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perimental  and  theoretical  works  [2-5].  The  lack, 
however,  of  confinement  of  the  minority  carriers, 
i.e.,  holes  in  the  case  of  n-5-doping,  has  brought 
difficulties,  for  instance,  on  the  use  of  optical  ex¬ 
periments  to  investigate  this  kind  of  system  [6-8]. 

Additional  information  on  the  physics  of  these 

5- doped  structures  can  be  obtained  when  the  5- 
layer  lies,  e.g.,  within  a  quantum  well  (QW),  since 
the  QW  barriers  may  provide  an  extra  degree  of 
control  to  the  number  and  population  of  typically 
various  confined  subbands  [9-11].  This  has  been 
also  achieved  by  building  up  a  superlattice  of  such 

6- layers,  which  allows  for  the  needed  minority 
carriers  confinement  [2,  3]. 

The  effects  of  a  magnetic  field  on  the  2D  elec¬ 
tron  gas  in  semiconductor  heterostructures  have 
been  the  subject  of  intensive  investigations.  Partic¬ 
ularly,  in  the  well-known  modulation-doped  quan¬ 
tum  well  (MDQW)  structures,  the  behavior  of  2D 
carriers  under  the  influence  of  an  in-plane  mag¬ 
netic  field  B  has  been  widely  studied  both  theoreti¬ 
cally  [12-14]  and  experimentally  [15,  16].  A  dou¬ 
ble-minimum  character  of  the  electron  energy 
dispersion  with  a  magnetic  field-induced  indirect 
band  gap  has  been  predicted  [14]  and  confirmed 
by  photoluminescence  measurements  [16].  In  these 
MDQWs,  the  doping  in  the  barriers  provides  the 
required  confinement  of  holes  in  the  valence  band, 
therefore  making  possible  optical  transitions  within 
the  well  to  be  observed.  The  electronic  subbands  of 
an  isolated  n-6-doped  layer  in  bulk  GaAs  under  a 
magnetic  field  parallel  to  the  plane  of  the  donor 
atoms  have  also  been  studied  through  magneto¬ 
conductivity  experiments  [17].  An  increase  of  the 
bottom  of  each  electron  subband  energy  level  due 
to  the  diamagnetic  shift  was  seen.  No  attempts, 
however,  have  been  done  to  investigate  the  effect 
of  the  in-plane  magnetic  field  on  the  holes  in  the 
valence  band  of  such  6-doped  layers. 

In  this  work,  we  present  detailed  calculation 
results  for  the  electronic  structure  of  an  n-type 
6-doped  layer  placed  at  the  center  of  GaAs/ 
Al^Ga^^^As  QWs  (x  =  0.3),  under  in-plane  mag¬ 
netic  fields  B  up  to  20  Tesla.  Calculations,  based  on 
the  self-consistent  (SCF)  one-electron  state  solution 
of  the  Schrodinger  equation  and  the  Poisson  equa¬ 
tion,  were  performed  for  a  sheet  donor  concentra¬ 
tion  N5  =  7.0  X  10^^  cm“^  and  different  values  of 
the  QW  widths  L^.  Many-body  effects  such  as 
exchange  and  correlation  were  taken  into  account 
within  the  local  density  approximation  (LDA).  The 
effects  of  the  field  on  the  electron  and  hole  sub¬ 
band  structures  were  analyzed.  It  is  shown  that 


the  hole  energy  dispersion  has  a  double  maxi¬ 
mum,  which  is  more  pronounced  as  the  QW  be¬ 
comes  larger,  whereas  the  electron  dispersions  keep 
the  single  minima  at  k  =  0.  Thus,  the  band  gap  is 
predicted  to  become  indirect.  For  such  6-doped 
type-I  QWs,  the  valence  band  discontinuity  pro¬ 
vides  the  required  confinement  of  the  holes  for 
optical  investigations.  Besides,  the  presence  of  an 
in-plane  magnetic  field  may  give  further  support 
for  probing  the  nature  of  the  conduction-valence 
band  gap. 


Theory 

The  system  considered  is  a  GaAs/Ga^^^Al^As 
QW  with  a  silicon-6-doped  layer  at  its  center.  We 
use  the  effective-mass  approximation  [18]  to  de¬ 
scribe  the  electrons  in  the  host  semiconductor  het¬ 
erostructure  and  replace  the  localized  donor 
charges  by  a  z-dependent  charge  density  which 
has  been  averaged  over  the  x-y  plane.  A  schemahc 
representation  of  the  layers  in  the  system  investi¬ 
gated  here  is  depicted  in  Figure  1.  All  Si  donors 
are  assumed  to  be  ionized^  and  uniformly  dis¬ 
tributed  within  a  range  of  2  A.  Since  low  tempera¬ 
ture  is  usually  employed  to  reduce  diffusion  of 
silicon  out  of  the  doped  plane,  no  significant  spread 
of  the  donors  is  expected  [1].  The  electrons  re¬ 
leased  by  the  donors  form  a  quasi-2D  electron  gas 
which  screens  the  potential  of  the  positive  charges. 

The  electron-electron  interactions  in  the  con¬ 
duction  band  can  then  be  separated  into  a  Hartree 
(Coulomb)  contribution  due  to  the  electrostatic 
potential  of  the  total  electron  density  and  an  ex¬ 
change-correlation  term.  Hohenberg  et  al.  [19] 
showed  that  the  exchange-correlation  potential  can 
be  obtained  by  taking  the  functional  derivative  of 
the  exchange  and  correlation  part  of  the  ground- 
state  energy  with  respect  to  the  electron  density;  in 
general,  the  exchange-correlation  potential  energy 
is  an  unknown  functional  of  the  electron  density 
n(z).  In  the  simplest  approximation,  i.e,,  the  LDA, 
the  exchange-correlation  potential  is  approxi- 


Si 


FIGURE  1 .  Schematic  representation  of  the  GaAs/ 
GaAlAs  QW  with  a  silicon  6-doped  layer  in  the  center. 
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mated  by  the  exchange-correlation  contribution  to 
the  chemical  potential  of  a  homogeneous  electron 
gas  having  a  uniform  electron  density  which  equals 
the  local  electron  density  n(z)  of  the  inhomoge¬ 
neous  system.  It  is  interesting  to  point  out  that  this 
simple  approximation  works  surprisingly  well  and 
has  been  currently  used  with  great  success  to  ob¬ 
tain  ground-state  properties  of  several  systems  [20], 

The  magnetic  field  was  assumed  to  be  along  the 
y  axis  B  =  (0,  B,  0),  for  which  the  gauge  A  = 
(B2,0,0)  has  been  chosen,  with  z  being  the  direc¬ 
tion  perpendicular  to  the  layers  or  to  the  QW.  The 
problem  is  in  the  LDA  reduced  to  a  one-particle 
Schrbdinger  equation,  with  electrons  moving  in  an 
effective  potential  which  is  a  sum  of  the  Coulom- 
bic  and  the  exchange-correlation  potentials,  plus 
the  magnetic  potential.  The  calculations  are  per¬ 
formed  at  temperature  T  =  0  K. 

The  electronic  states  will  be  denoted  by  the 
wave  function  0(x,  y,  z)  which  due  to  the  transla¬ 
tional  symmetry  in  the  x-y  plane  can  be  written 
as 


(|)  =  (1) 

where  and  are  components  of  the  wave 
vector  k  =  (k^,k^,0)  and  ^(z)  is  the  solution  of 
the  one-dimensional  Schrddinger  equation,  which 
is  solved  for  each  state  i: 


-h  Vr(z)  -h  Vxc(z)  +  Vgfz) 


%(z) 


=  E,%(z), 


(2) 


with  m*  being  the  electron  effective  mass.  Here, 
Vq  Vxc  is  the  effective  potential,  given  by 

the  sum  of  the  Coulombic  potential  Vc(zX  the 
local  exchange  potential  Vxc(^X  the  magnetic 
potential  Vg(z).  The  potential  VgCz)  generated  by 
the  magnetic  field  is  given  by  [14] 


We  consider  a  finite  layer  of  thickness  2d  with  a 
3D  density  of  donors  Uq.  Thus,  —  2  X  d  X  Hq 
is  the  Si  donor  concentration  per  unit  area.  As 
mentioned  before,  it  is  assumed  that  all  the  Si 
atoms  are  ionized,  supplying  the  electrons  to  form 
the  2D  electron  gas.  Therefore,  the  2D  electron 
density  equals  the  sheet  donor  density  Ng.  The 
Coulombic  potential  V^iz)  due  to  the  electrostatic 
electron-electron  and  electron-ionized  donor  in¬ 
teractions,  is  obtained  from  the  Poisson  equation 

d^Vr 

—^= - [n(z)-nj^],  (5) 

dz  s 

where  s  is  the  dielectric  constant,  and  n(z),  the 
electron  gas  density  given  by 

n{z)  =  J^N^ikJ\%(z)\^,  (6) 


with  N^(k^)  being  the  number  of  electrons  per 
unit  area  in  the  fth  subband,  which  is  given  by 


NKkA  = 


2  m* 


-(Ep  -  E,(k,)) 


1/2 


(7) 


where  Ep  is  the  Fermi  energy  obtained  from  the 
condition 


Ns=  Y.N^(kje[Ep  -  E,(k,)]dk,.  (8) 

In  the  above  expression,  0  is  the  Heaviside  step 
function. 

For  the  exchange-correlation  potential  Vxc^^:), 
we  used  the  simple  analytic  expression  suggested 
by  Hedin  and  Lundqvist  [21]: 

/  21 

1  -I-  0. 0368 r, In  1  -h  — 

I 

(9) 


2sat  TTar^ 


Vg(z)  =  ^m*wc(2  + 

where  (Oq  =eB/vff  is  cyclotron  frequency,  e  is 
the  electronic  charge,  and  1  =  yh/(eB)  is  the 
magnetic  length.  The  magnetic  potential  V^(z)  de¬ 
pends  on  the  in-plane  wave  vector  k^  perpendicu¬ 
lar  to  B.  Thus,  the  energies  related  to  the  trans¬ 
verse  axis  X  have  the  form  E^(/c^),  so  that  the  total 
energy  of  the  carrier  is 

Er  =  +  Ei(E/.  (4) 

zm 


with  a*  =  ea^i  being  the  effective  Bohr  radius, 
Tg  =  {[477a*^n(z)]/3}”^/^  and  a  =  [4/(977)]^/^  is 
a  numerical  constant. 

The  value  x  =  0.3  was  assumed  for  the  A1  con¬ 
centration  in  the  alloy  and  the  currently  used 
60-40%  rule  for  the  conduction-to-valence  band 
discontinuity  was  adopted  [22].  This  leads  to  a 
barrier  height  of  226  meV  for  the  QW  where  the 
electrons  are  confined.  The  values  used  for  m*  and 
s  are  0.068m  and  12.536^^,  respectively,  where  m  is 
the  free  electron  mass,  and  Sq,  the  vacuum  per- 
missivity  [23]. 
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We  are  dealing  here  with  an  n-type  6-doping 
system  where  the  majority  carriers  are  electrons. 
To  obtain  the  one-particle  energy  states  for  the 
minority  carriers,  we  assumed  that  they  are  sub¬ 
mitted  to  the  SCF  potential  calculated  for  the  elec¬ 
trons.  This  assumption  is  valid  as  long  as  one 
keeps  the  hole  density  very  low.  For  a  hole  gas  of  a 
finite  density,  a  proper  treatment  is  required  to 
obtain  the  valence  band  dispersion  curves  [24]. 
According  to  the  low-density  limit  assumed  here, 
the  holes  are  submitted  to  the  repulsive  potential 
calculated  for  the  electrons  and  confined  by  the 
valence  band  potential  of  the  QW,  which,  for  x  = 
0.3,  has  a  height  of  148  meV.  The  confined  energy 
states  for  the  holes  are  then  calculated  by  consider¬ 
ing  these  one-particle  external  potentials  and  a 
value  of  0.36  m  for  the  heavy-hole  effective  mass  in 
the  Schrodinger  equation  [23].  We  neglected  the 
effects  of  the  magnetic  field  on  the  light-hole  va¬ 
lence  band,  since  any  occupation  of  this  band  may 
be  ruled  out  in  the  description  of  electron-hole 
recombination  transitions  of  the  ?7-type  6-doped 
QWs  investigated  here  [7].  Changes  in  the  effective 
masses  within  the  GaAlAs  barriers  were  also  ne¬ 
glected.  In  spite  of  the  fact  that  nonparabolicity 
effects  of  the  valence  bands  have  not  been  taken 
into  account  in  the  present  work,  the  inclusion  of 
such  effects  is  not  expected  to  alter  qualitatively 
the  conclusions  made  here  and  is  in  progress. 


Results  and  Discussion 

In  Figure  2,  we  show  the  conduction  band  po¬ 
tential  profile  and  energy  levels  for  a  6-doped  QW 
with  typical  values  of  sheet  donor  concentration 
N5  =  7.0  X  10^^cm~“  and  well  width  =  100  A. 
Two  subbands  Eq  and  E^  are  found  occupied, 
below  the  Fermi  energy  Ep.  Within  the  QW,  there 
is  still  an  empty  level  E2  which  appears  well 
above  Ep.  This  latter  energy  level  may  be  impor¬ 
tant  in  absorption  and  absorption-related  optical 
experiments  [11,  25].  The  6-doped  layer  is  respon¬ 
sible  for  the  spikelike  potential  profile  seen  in 
Figure  2,  in  which  lies  the  lowest  energy  level  Eq 
comprising  most  of  the  2D  electrons. 

Some  general  effects  of  the  magnetic  field  B  on 
the  carriers  are  best  illustrated  assuming  =  0, 
i.e.,  an  undoped  QW.  In  this  case,  V(z)  =  Vc(z)  -h 
Vxc(^)  is  just  a  square-well  potential.  Figure  3 
shows  the  energy  dispersions  along  the  in-axis 
of  the  first  electron  and  hole  subbands  for  =  0 


and  Ly^  =  150  A  and  two  values  of  B:  B  =  0  and 
B  =  20T.  The  results  displayed  in  this  figure  show 
that  the  magnetic  field  increases  the  electron-hole 
recombination  energy,  which  is  seen  by  the  large 
diamagnetic  shifts  at  =  0.  Figure  3  also  shows  a 
flattening  of  the  dispersion  curves  for  both  elec¬ 
trons  and  holes  for  B  A  0  [14]. 

The  effect  of  the  magnetic  field  on  the  electronic 
structure  of  the  6-doped  QW  is  first  considered  in 
Figure  4.  There,  energy  dispersion  curves  for  a 
sheet  donor  concentration  of  N5  =  3.5  X  10^^  cm“^ 
and  a  QW  width  of  Lpy  =  150  A  are  shown.  Solid 
lines  correspond  to  values  in  the  absence  of  mag¬ 
netic  field  (B  =  0)  and  dashed  lines  are  related  to 
B  ==  20r.  First,  we  analyze  the  results  depicted  in 
Figure  4  for  B  =  0.  The  presence  of  the  6-doped 
layer  is  responsible  for  a  strong  attractive  potential 
which  is  partially  screened  by  the  Hartree  poten¬ 
tial  generated  by  the  electrons.  This  makes  the 
carriers  to  feel  higher  barriers,  therefore  being  more 
localized  around  the  doped  plane.  The  energy  lev¬ 
els  appear  several  meV  above  the  bottom  of  the 
potential  well,  which  was  taken  as  the  energy  zero. 
This  is  a  characteristic  picture  of  the  subbands 
inside  6-doped  QWs  [5,  7].  The  presence  of  the 
magnetic  field  flattens  the  energy  dispersions  and 
gives  rise  to  relatively  small  diamagnetic  shifts  if 
compared  to  the  undoped  QW,  as  expected  (see 
Fig.  3). 


FIGURE  2.  Conduction  band  potential  profile  and 
energy  levels  for  a  typical  n-6-doped  QW  with  a  sheet 
donor  concentration  =  7.0  X  10''^  cm~^  and  a  well 
width  =  100  A.  The  subbands  Eq  and  are 
occupied.  The  Fermi  energy  Ep  is  indicated  by  a  dashed 
line. 
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B  =  20T  (dashed  line).  The  zero  in  the  energy  scale  is 
taken  at  the  bottom  of  the  conduction  band  at  =  0  for 
electrons  and  at  the  top  of  the  valence  band  at  the  same 
/f-point  for  holes. 


In  Figure  5  we  present  the  energy  dispersions 
for  electron  and  hole  subbands,  for  different  val¬ 
ues  of  magnetic  fields  for  three  well  widths  Ly^  = 
100,  200,  and  400  A  and  a  fixed  sheet  donor  con¬ 
centration  N5  =  7.0  X  10^^  cm“^  As  expected, 
with  the  increasing  of  Lyy ,  the  energy  of  the  elec¬ 
tron  subbands  is  lowered,  with  higher  levels  start¬ 
ing  to  be  populated.  For  Lyy  =  400  A,  already  four 
subbands  are  occupied  at  =  0.  For  such  a  large 
well  width,  the  system  behaves  essentially  as  an 
isolated  6-doped  layer  [5,  17];  the  compositional 
band  offset  does  not  affect  the  8  potential.  This 
can  also  be  concluded  by  analyzing  the  charge 
distribution  within  the  QWs.  For  the  larger  QW 
width,  the  total  charge  density  is  found  essentially 
localized  around  the  doped  plane,  with  no  pene¬ 
tration  in  the  GaAlAs  barriers.  It  is  worth  mention¬ 
ing  that,  contrarily,  for  the  100  A  width  QW,  the 
barriers  play  a  relevant  effect  on  the  electronic 


FIGURE  4,  Energy  dispersions  of  electrons  and  holes 
for  an  n-5-doped  QW  with  Lyy  —  ^50  A  and  =  3.5  x 
10''^  cm"^  for  B  =  0  (solid  line)  and  S  =  20T  (dashed 
line).  The  Fermi  energy  Ep  Is  also  shown.  The  zero  in  the 
energy  scale  is  taken  as  in  Figure  3. 


states  confinement,  which  is  reflected  by  the  exten¬ 
sion  of  the  wave  functions  within  the  barrier  re¬ 
gions  [26].  The  main  effects  of  the  magnetic  field, 
as  seen  in  Figure  4,  are  the  increase  of  the  subband 
levels  at  ^  0  (diamagnetic  shift)  and  a  flatten¬ 
ing  of  the  dispersion  curves  as  one  moves  away 
from  k^  =  0.  In  contrast  to  what  has  been  ob¬ 
served  in  MDQWs  [14],  the  effect  of  B  is  more 
pronounced  for  the  higher  electronic  subbands. 
This  feature  was  also  seen  in  previous  investiga¬ 
tions  reported  on  [17]  for  the  case  of  an  isolated 
5-layer  in  GaAs. 

The  most  striking  characteristic  observed  in  the 
electronic  structure  of  the  5-doped  QWs  investi¬ 
gated  here  under  the  effect  of  a  parallel  magnetic 
field  is  that  the  maximum  of  the  hole  energy 
dispersion  curves  moves  out  from  the  center  of  the 
Brillouin  zone  (BZ)  already  for  moderate  values  of 
the  field,  whereas  the  electron  dispersions  keep  the 
single  minima  at  k^  =  0.  To  emphasize  the  behav¬ 
ior  of  the  hole  bands.  Figure  6  depicts  the  hole 
energy  dispersions  only,  in  a  larger  scale,  for  the 
5-doped  QW  of  =  400  A  and  Nj  =  7.0  X  10^^ 
cm“^  for  the  several  values  taken  for  B.  The  dou- 
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FIGURE  5.  Energy  dispersions  of  electrons  and  holes,  at  different  magnetic  fields,  for  a  sheet  donor  concentration 
Ns  =  7.0  X  10^^  cm  ^  and  well  width  (a)  1 00  A,  (b)  200  A,  and  (c)  400  A.  The  zeros  in  the  energy  scales  correspond  to 
the  conduction  band  edge  (electrons)  and  valence  band  edge  (holes)  for  =  0.  The  Fermi  energy  Ep  is  also  shown. 
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FIGURE  6.  Heavy  hole  subbands,  as  extracted  from 
Figure  5(c),  for  different  values  of  the  magnetic  field,  for 
an  n-6-doped  QW  of  =  400  A  and  sheet  donor 
concentration  Ns  =  7.0  x  10’^  cm  “  ^  The  zero  of  energy 
was  placed  at  the  top  of  the  valence  band  QW  for  =  0. 

ble-maximum  character  is  clearly  seen.  This  char¬ 
acteristic  behavior  is  also  observed  for  the  nar¬ 
rower  QWs,  although  it  is  more  pronounced  as  the 
QW  width  increases.  Therefore,  under  the  influ¬ 
ence  of  the  magnetic  field,  these  systems  become 
indirect-gap  materials.  This  is  a  novel  effect,  which 
has  been  predicted  for  MDQWs.  Nevertheless,  the 
induced  indirect-gap  in  the  later  systems  is  due  to 
the  displacement  of  the  minimum  in  the  first  elec¬ 
tron  subband  and  has  been  attributed  to  the  effect 
of  the  magnetic  field  on  the  electronic  charge  re¬ 
distribution  [14],  The  effect  is  more  remarkable  as 
the  QW  becomes  larger  and  the  electrons  lying  in 
the  lowest  subband  are  more  sensitive  to  the  self- 
consistent  repulsive  Coulombic  potential,  which  in 
the  case  of  MDQWs  is  generated  by  the  doping  in 
the  barriers.  For  the  5-doped  QWs  studied  here, 
such  a  situation  takes  place  for  the  carriers  in  the 
lowest  hole  subband.  Similarly,  the  holes  in  this 
subband  are  directly  affected  by  the  repulsive  po¬ 
tential  created  by  the  positive  charges  in  the  8- 
sheet. 


Conclusions 

We  have  shown  that  the  in-plane  magnetic  field 
leads  to  significative  effects  on  the  electronic 
ground-state  properties  of  S-doped  GaAs/GaAlAs 
QWs.  The  magnetic  field  brakes  the  symmetry  of 
the  V-shaped  5-doped  QW  by  bending  one  side  of 


the  square  potential,  thus  changing  the  electronic 
charge  distribution  and  the  electron  energy  disper¬ 
sion  curves.  However,  we  verified  that  even  for 
intense  magnetic  fields  the  Fermi  level  position 
remains  quite  stable  and  the  minimum  of  the  occu¬ 
pied  conduction  bands  stays  at  the  center  of  the 
BZ.  This  finding  differs  from  that  observed  for  the 
MDQW  structures  where  the  magnetic  field  in¬ 
duces  a  double-minimum  character  in  the  occu¬ 
pied  conduction  bands. 

The  magnetic  field-dependent  SCF  potential  ob¬ 
tained  from  the  calculations  is  then  used  to  solve 
the  effective-mass  Schrodinger  equation  for  a  sin¬ 
gle  hole  at  the  valence  band  of  the  S-doped 
GaAs/GaAlAs  QWs.  We  show  that  the  one-hole 
energy  dispersion  curves  display  a  remarkable  fea¬ 
ture  due  to  the  magnetic  field,  which  is  the  dis¬ 
placement  of  the  maxima  of  the  valence  bands  out 
from  the  center  of  the  BZ.  In  conclusion,  the  mag¬ 
netic  field  leads  to  a  double  maximum  character 
for  the  valence  band,  therefore  indicating  that  an 
indirect  band  gap  behavior  is  expected  for  these 
systems.  It  is  worth  mentioning  that  this  interest¬ 
ing  behavior  is  predicted  to  occur  even  for  moder¬ 
ate  values  of  the  magnetic  field. 

The  results  reported  here  are  expected  to  be 
useful  for  the  analysis  of  optical  measurements 
performed  on  S-doped  layers:  first,  because  of  the 
extra  degree  of  control  provided  by  the  QW  barri¬ 
ers,  which  allows  for  the  necessary  hole  confine¬ 
ment  for  electron-to-hole  recombinations  be  ob¬ 
served,  and,  second,  as  previously  done  for 
MDQWs,  the  indirect-band  gap  behavior  predicted 
here  for  5-doped  QWs  could  be  further  studied, 
e.g.,  through  photoluminescence  experiments  un¬ 
der  parallel  magnetic  fields. 
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ABSTRACT  - - - - 

The  contribution  reports  results  of  ongoing  computations  of  various  cage  structures 
differing  from  the  well-known  buckminsterfullerene  in  the  number  of  carbon  atoms  (Cgo) 
in  their  coordination  or  in  the  types  of  rings  involved  (C59).  The  computations  have 
supplied  a  complete  description  of  the  seven  isolated-pentagon-rule  (IPR)  isomers  of  Cgo 
(i.e.,  a  system  not  yet  observed)  at  the  semiempirical  AMI  and  SAMI  levels;  their 
energetics  was  also  evaluated  at  ab  initio  Hartree— Fock  level  in  small  basis  sets.  The 
ground-state  structure  of  the  system  possesses  a  symmetry,  but  at  supposed 
synthetic  conditions  a  structure  of  D2  symmetry  is  the  most  populated,  i.e.  temperature 
represents  an  important  factor  in  the  stability  relationships.  One  of  the  IPR  isomers 
exhibits  an  4  topological  symmetry,  but  it  undergoes  a  Jahn-Teller  distortion  and  its 
symmetry  is  actually  reduced  to  D2-  The  C59  serves  as  an  example  of  odd-numbered 
carbon  clusters,  and  new  rules  are  reported  for  the  pentagon/hexagon  pattern  in  the  odd 
clusters,  yielding  a  variable  number  of  the  pentagons.  Moreover,  other  rings  are 
considered  and  the  computed  ground  state  of  the  system  in  fact  contains  an 
eight-membered  ring  (while  the  next  lowest  species  exhibits  a  nine-membered  ring). 
Their  relative  stabilities  are  not  very  sensitive  to  temperature.  Altogether  19  isomers  of 
C59  are  treated  at  the  AMI  level.  In  contrast  to  even-numbered  fullerenes,  with  odd 
fullerenes  the  cages  cannot  be  built  from  three-coordinated  carbon  atoms  only.  Hence,  we 
have  to  allow  for  two-  or  even  four-coordinated  atoms.  For  example,  if  we  consider  one 
two-coordinated  carbon  atom  (and  the  rest  three-coordinated)  the  number  of  pentagons 
drops  to  10.  If  we  allow  for  just  one  four-coordinated  atom,  the  number  of  pentagons 
changes  to  14.  Let  the  number  of  two-coordinated  carbon  atoms  be  p,  and  the  number  of 
four-coordinated  q.  If  we  still  allow  only  for  pentagons  and  hexagons,  it  holds  for  the 
number  of  pentagons  —  —2p  2q  +  12.  ©  1996  John  Wiley  &  Sons,  Inc. 
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Introduction 

Fullerenes  became  known  to  chemist  recently 
enough,  i.e.,  at  times  when  computational 
chemistry,  and  especially  molecular  mechanics  and 
semiempirical  quantum  chemistry,  could  provide 
useful  information  during  the  surge  of  interest  in 
these  molecules.  Owing  to  the  tremendous  devel¬ 
opment  of  computational  hardware  in  the  1970s 
and  1980s,  computational  chemistry  could  well 
follow  the  Cgg  discovery,  and  even  supply  useful 
support  to  further  advances.  Computational  chem¬ 
istry  is  now  playing  the  role  of  a  real  partner  in 
fullerene  research  where  theory  and  experiment 
are  mutually  complementary  tools.  In  this  article 
we  shall  illustrate  fullerene  computations  with  two 
less  conventional  systems,  with  Cgg  and  C59. 


The  Isolated-Pentagon-Rule  (IPR) 
Isomers  of  Cg® 

Three  mixtures  of  fullerene  isomers  were  stud¬ 
ied  in  detail  by  now,  namely  [1-5],  €32  [6-13], 
and  C34  [14-31].  The  missing  system,  Cgg,  has 
received  limited  attention  [32,  33],  apparently  be¬ 
cause  the  particular  fullerene  set  has  not  been 
isolated  yet.  For  the  three  isolated  and  character¬ 
ized  systems  it  has  been  demonstrated  [5,  13,  30, 
31]  that  they  cannot  be  understood  without  con¬ 
sidering  temperature  effects.  Although  interiso- 
meric  separation  energies  are  important,  they 
themselves  alone  cannot  determine  the  relative 
stabilities  at  elevated  temperatures.  The  tempera¬ 
tures  in  fullerene  syntheses  are  high,  and  thus 
entropy  contributions  can  even  overcompensate 
the  enthalpy  terms.  This  was  shown  not  only  on 
fullerene  systems  but  also  for  small  carbon  clusters 
[34]  and  with  silicon  clusters  [35]. 

There  are  seven  topologically  different  Cgg 
structures  [36]  which  obey  the  isolated  pentagon 
rule  (the  structures  are  coded  A-G,  which  corre¬ 
spond  to  the  letters  used  in  Ref.  33,  see  Fig.  1).  In 
this  section,  the  AMI  computations  on  Cgg  are 
reported  together  with  their  relative  stability  eval¬ 
uations. 

The  geometry  optimizations  were  performed 
with  the  implementation  of  the  AMI  semiempiri¬ 
cal  quantum  chemical  method  [37]  in  the  Spartan 
program  package  [38]  installed  at  an  Iris/Silicon 


FIGURE  1.  AM1  optimized  IPR  structures  of  Cgg. 


Graphics  XZ4000  workstation.  The  optimizations 
were  performed  without  any  symmetry  constraints 
in  Cartesian  coordinates  and  with  analytically  con¬ 
structed  energy  gradient.  In  the  AMI  optimized 
geometries,  the  harmonic  vibrational  analysis  was 
carried  out  by  a  numerical  differentiation  of  the 
analytical  energy  gradient.  The  vibrational  calcula¬ 
tions  were  mainly  performed  with  the  Gaussian  92 
[39]  though  some  cases  of  poor  self-consistent  field 
(SCF)  convergency  were  retreated  with  the  SCF 
convergers  built  in  the  MOP  AC  5.0  [40]  program 
package.  The  vibrational  analysis  helped  in  clarifi¬ 
cation  of  the  stationary  point  nature,  in  the  vibra¬ 
tional  spectra  prediction,  and  in  partition  function 
construction. 

Relative  concentrations  of  the  seven  isomers 
were  computed  as  the  mole  fractions  zc,,  using  the 
isomeric  partition  functions  within  the  rigid- 
rotor  and  harmonic-oscillator  approximation.  In  the 
terms  of  and  the  ground-state  energy  A  Hq  ,  the 
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mole  fractions  are  given  [41]: 

q,exp[-AHl,/(RT)] 

ivi  =  -j- - ,  (1; 

Z<liexp[-AHSj/(RT)] 

/-I 

where  R  is  the  gas  constant  and  T  the  absolute 
temperature.  The  conventional  heats  of  formation 
at  room  temperature  were  converted  to  heats  of 
formation  at  the  absolute  zero  temperature,  and 
finally  the  vibrational  zero-point  energy  was  ex¬ 
tracted.  Hence,  the  relative  potential  energies  A 
resulted  from  the  treatment. 

Chirality  contribution  is  to  be  considered  with 
Eq.  (1).  There  is  no  asymmetric  carbon  atom  in  the 
conventional  sense  in  the  fullerene  cages  (with 
three  coordinated  carbon  atoms),  nevertheless, 
some  of  the  structures  are  still  chiral,  i.e.,  they  are 
not  superimposable  upon  their  mirror  image.  This 
structural  dissymmetry  can  readily  be  recognized 
from  the  point  group  of  symmetry  as  the  presence 
of  no  reflection  symmetry,  i.e.,  absence  of 
rotation-reflection  axes  S„:  only  the  C„,  D„,  T,  O, 
and  I  groups  obey  the  requirement.  For  an  enan¬ 
tiomeric  pair  its  partition  function  in  Eq.  (1)  has 
to  be  doubled  (if  we  assume  the  presence  of  both 
optical  isomers,  which  seems  natural  under 
fullerene  synthesis  conditions). 

Symmetry  of  the  optimized  structures  has  been 
determined  by  a  procedure  that  allows  for  a  preci¬ 
sion  of  the  computed  coordinates  as  a  variable 
parameter,  e.  The  origin  of  the  coordinate  system 
is  placed  in  the  center  of  charge  (defined  similarly 
to  the  center  of  mass) — the  point  is  the  only  candi¬ 
date  for  possible  center  of  symmetry.  Candidates 
for  C2  axes  are  either  lines  connecting  any  two 
nuclei  or  perpendicular  bisectors  of  the  distance 
between  any  two  nuclei  of  the  same  kind.  Then 
axes  with  n  >  2,  S„  and  S2„  axes,  and  planes  of 
symmetry  are  investigated.  The  symmetry  opera¬ 
tions  found  have  to  create  one  of  the  known  sym¬ 
metry  groups.  For  each  symmetry  operation  con¬ 
sidered,  coordinates  of  the  interrelated  atoms  be¬ 
fore  and  after  symmetry  operation  are  checked 
with  respect  to  e.  If  the  coordinates  are  identical 
within  the  accuracy  e  (i.e.,  the  largest  difference  is 
smaller  than  e)  a  symmetry  element  has  been 
found  at  the  accuracy  level  e. 

Of  the  seven  IPR  Cgo  structures,  the  species  B  is 
especially  interesting.  It  has  a  topological  symme¬ 
try  4.  However,  according  to  the  concept  of  the 
Goldberg  polyhedra  [42],  it  has  to  be  an  open  shell 


and  it  has  to  undergo  a  Jahn-Teller  distortion  to¬ 
ward  lower  energy  and  lower  symmetry.  This  in¬ 
deed  happens  in  our  AMI  calculations,  though  the 
geometry  distortions  are  quite  small.  This  is  in 
agreement  with  the  findings  for  charged  C^q 
species  [43-45]  where  the  Jahn-Teller  geometry 
distortions  are  of  the  order  of  10“^  A  only.  The 
symmetry  of  the  B  structure  is  relaxed  to  D2, 
however,  the  determination  of  the  symmetry  de¬ 
serves  a  comment.  The  computations  were  per¬ 
formed  in  the  Cartesian  coordinates,  hence,  the 
coordinates  of  the  symmetry-related  atoms  can  be 
subject  to  a  numerical  inaccuracy.  If  we  want  to 
extract  the  related  symmetry  group,  we  have  to 
allow  for  some  threshold  variation  or  accuracy  e  in 
the  coordinates  when  we  judge  symmetry  relation¬ 
ships.  For  very  low  value  (e.g.,  10“^  A)  of  such  a 
threshold  e  we  can  find  simply  symmetry  and 
only  after  its  increase  (say  to  10“^  or  10“^  A)  a 
correct  symmetry  appears  (and  a  further  increase, 
e.g.,  to  10  A,  may  produce  an  artificially  high 
symmetry).  Apparently  the  threshold  choice  should 
be  related  to  the  precision  with  which  the  coordi¬ 
nates  themselves  are  computed.  It  should  also  be 
realized  that  a  usage  of  internal  coordinates  with 
the  strict  symmetry  requirements  may  not  neces¬ 
sarily  help  as  a  possible  imaginary  frequency  may 
be  quite  small  (HF/STO-3G  calculations  [46]  lead 
to  symmetry,  however,  without  vibrational 
analysis).  Using  such  threshold  variation  we  deter¬ 
mined  the  symmetries  presented  in  Table  I.  The 
results  were  consistent  with  exception  of  the  E  and 
G  cases  where  reoptimization  of  already  opti¬ 
mized  structure  sometimes  resulted  in  a  symmetry 
change  that,  however,  was  not  permanent.  Appar¬ 
ently,  the  numerical  precision  available  was  not 
sufficient  to  resolve  the  problem,  and  therefore  we 
list  two  alternatives  of  symmetry  for  the  two  par¬ 
ticular  cases.  However,  we  preferred  the  lower 
symmetry  for  them  in  the  relative  stability  calcula¬ 
tions,  though  we  checked  the  effects  of  the  higher 
symmetries,  too — vide  infra.  Clearly  enough,  each 
of  the  E  and  G  species  is  just  one  structure  with 
well-established  energy,  etc.,  only  the  symmetry  is 
not  easy  to  decide  owing  to  numerical  impreci- 
sions  in  Cartesian  coordinates  (the  details  on  the 
computerized  symmetry-finding  procedure  will  be 
published  elsewhere  [47]). 

The  computations  (both  at  room  and  at  absolute 
zero  temperature)  point  out  the  C  isomer 
symmetry)  as  the  system  ground  state,  being  fol¬ 
lowed  by  the  A  species  of  a  D2  symmetry.  This  is 
in  agreement  with  density  functional  computa- 
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TABLE  ! 


The  AM1  structures®  and  energetics^  of  the  IPR  structures  of  Cqo- 


Species 

A 

(cm“‘') 

B 

(cm“^) 

c 

(cm“') 

^^°298,rel 

(kJ  /  mol) 

Afrel 
(kJ  /  mol) 

SAM1= 

(kJ/mol) 

HF  73-216“ 

(kJ  /  mol) 

A 

D, 

0.00179 

0.00150 

0.00142 

21 

22 

17 

10 

B 

D, 

0.00157 

0.00156 

0.00155 

273 

301 

314 

423 

C 

Dsd 

0.00188 

0.00142 

0.00142 

0 

0 

0 

0 

D 

^2v 

0.00157 

0.00156 

0.00155 

132 

144 

154 

216 

E 

C2  /  C21/ 

0.00167 

0.00156 

0.00146 

73 

77 

75 

98 

F 

C2 

0.00166 

0.00165 

0.00140 

132 

137 

138 

154 

G 

CJC, 

0.00163 

0.00156 

0.00151 

104 

110 

115 

158 

®The  rotational  constants  A,B,  and  C,  and  symmetries. 

^Relative  heats  of  formation  at  room  temperature  (A/^°298.rei)  relative  potential  energies  (AErg,). 
'^The  SAM1  computed  AE^gi  terms. 

^The  HF/3-21G  AE^gi  terms  computed  in  the  SAM1  optimized  geometries. 


tions  [33].  We  also  computed  the  HOMO-LUMO 
gaps  of  the  species  and  found  the  values  4.0,  4.02, 
4.14,  4.15,  4.31,  4.72,  and  4.95  eV  for  the  D,  F,  B,  G, 
E,  C,  and  A  isomers.  Hence,  the  stability  order 
deduced  from  the  HOMO-LUMO  gaps  is  not  iden¬ 
tical  with  the  potential-energy  order.  We  also 
checked  the  computed  separation  energies  by  the 
newly  introduced  SAMI  semiempirical  method 
[48]  (i.e.,  the  geometries  were  fully  reoptimized  at 
the  SAMI  level).  Figure  2  shows  that  the  results 
are  basically  parallel.  However,  the  total  values  of 
the  heats  of  formation  differ  as  the  SAMI  terms 
are  considerably  lower  (but  for  the  relative  stabili¬ 
ties  only  relative  heats  of  formation  matter).  More¬ 
over,  we  also  checked  the  relative  potential  ener¬ 
gies  AEj.gi  at  an  ab  initio  Hartree-Fock  level, 
namely  in  the  standard  3-21G  basis  set  (HF /3-21G), 
though  the  ab  initio  computations  were  performed 
in  the  optimized  SAMI  geometries  (Table  I).  The 
semiempirical  and  ab  initio  results  essentially 
agree.  There  is  only  a  stability  interchange  be¬ 
tween  structures  F  and  G.  Experiments  cannot 
supply  separation  energies  for  fullerene  isomers  at 
present.  With  the  experimental  information  it  is 
virtually  impossible  to  decide  which  set  of  the 
computed  separation  energies  is  the  best  (though 
the  question  is  not  critical  owing  to  the  proximity 
of  the  results). 

The  vibrational  calculations  confirmed  that  all 
the  seven  localized  stationary  points  are  local  en¬ 
ergy  minima  as  there  was  no  imaginary  frequency. 
Table  II  presents  only  12  wavenumbers,  selected 
out  of  234  terms,  for  each  isomer,  namely  the 
lowest,  highest,  and  10  infrared  (IR)  most  intense 
wavenumbers.  Even  for  those  few  representative 


lines,  we  can  see  that  the  isomers  are  vibrationally 
different.  In  particular,  the  most  intense  line  is 
located  in  different  spectral  regions  for  various 
structures.  Of  the  seven  isomers  only  the  C  species 
exhibits  symmetry-adapted  degeneracies  (which  is 
also  seen  in  Table  II).  It  has  been  well  known  that 
quantum  chemical  vibrational  frequencies  are 
somewhat  overestimated  comparing  to  experi¬ 
ment,  and  thus,  scaling  down  (by  a  factor  of  about 
0.9)  is  important  for  spectral  predictions.  But  in 


80 

AErei 

[kcal/mol] 


60 


B 


- A 


AMI  SAMI 

FIGURE  2.  AM1  and  SAM1  computed  relative  heats  of 
formation  of  the  IPR  structures  of  Cgo- 
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The  AM1  lowest  (to^j„),  10  most  intense®  (coj^t),  and  highest  harmonic  vibrational  wavenumbers 

(In  cm  "  for  the  IPR  structures  of  Cqo- _ _ 


Species 


^min  /  ^in 


"int 


^int 


A 

B 

C 

D 

E 

F 

G 


211 

1616  (79) 
150 

1369  (89) 
201 

1610  (69) 
214 

1404  (45) 
224 

1505  (100) 
214 

1528  (27) 
227 

1505  (32) 


918  (61) 

1383  (65) 

1540(57) 

1557  (51) 

1604  (52) 

1630  (53) 

1747  (82) 

1777  (67) 

1832  (100) 

1843 

1102  (57) 

1221  (64) 

1235  (51) 

1350  (100) 

1363  (94) 

1457  (72) 

1532  (89) 

1615  (67) 

1641  (88) 

1775 

1436  (17) 

1465  (26) 

1582  (24) 

1582  (24) 

1610  (62) 

1778  (100) 

1802  (29) 

1804  (19) 

1804(21) 

1818 

897  (43) 

1247  (42) 

1253  (36) 

1326  (100) 

1327  (40) 

1487  (58) 

1524(36) 

1670  (38) 

1761  (34) 

1902 

904  (77) 

1342  (58) 

1347  (69) 

1370  (92) 

1380  (67) 

1530  (43) 

1622  (90) 

1630  (78) 

1830  (57) 

1881 

896  (100) 

1295  (27) 

1307  (64) 

1334  (72) 

1381  (85) 

1677  (38) 

1797  (52) 

1807(43) 

1807  (42) 

1863 

900  (19) 

1328  (100) 

1356  (27) 

1378  (16) 

1489  (20) 

1507  (14) 

1611  (16) 

1656  (19) 

1779  (17) 

1902 

a  jhe  relative  IR  intensities  of  are  given  in  parentheses;  the  most  intense  frequency  has  the  relative  intensity  of  100  (arbitrary 
units). 


fact,  such  a  uniform  scaling  has  a  negligible  effect 
[41]  for  Eq.  (1).  Therefore,  no  scaling  of  the  com¬ 
puted  vibrational  frequencies  was  applied  in  this 
study.  Needless  to  add,  IR  intensity  has  spectral 
but  no  thermodynamic  significance — in  Eq.  (1)  all 
vibrational  modes  must  be  considered  with  equal 
significancy.  Although  lower  frequencies  are  gen¬ 
erally  more  important  for  thermod5mamic  func¬ 
tions,  it  is  the  overall  interplay  of  all  the  234 


modes  that  after  all  matters  for  the  relative  iso¬ 
meric  concentrations. 

Table  III  reports  the  temperature  evolution  of 
the  relative  concentrations,  w,,  for  the  seven- 
membered  mixture  under  the  thermodynamic 
equilibrium  conditions.  At  very  low  temperatures 
the  groxmd-state  structures,  C,  has  to  prevail,  and 
the  relative  concentrations  of  other  structure  have 
to  obey  the  order.  But  at  higher  tempera- 


IMDI-C  III  _ _ _ _ — — - — - - - 

The  AMI  equilibrium  relative  popuiations  (%)  of  the  IPR  structures  of  Cao- 

r(K) 

A 

B 

C 

D 

E 

F 

G 

500 

3.20 

4  X  lO^^'' 

96.8 

5  X  10^" 

3  X  10“® 

2  X  10“” 

4  X  10“® 

1000 

30.0 

1  X  10-® 

69.8 

5  X  10"“ 

0.17 

1  X  lO-'* 

0.01 

1485® 

48.6 

9  X  10“® 

48.6 

0.08 

2.19 

0.02 

0.46 

1500 

49.0 

1  X  10  ■’ 

48.1 

0.08 

2.31 

0.02 

0.50 

2000'' 

53.9 

0.03 

34.3 

1.00 

7.48 

0.23 

3.06 

56.9 

0.03 

36.2 

1.06 

3.95 

0.24 

1.62 

2500 

49.3 

0.67 

24.2 

3.87 

13.1 

0.83 

7.93 

3000 

39.8 

4.82 

16.4 

8.19 

16.4 

1.66 

12.8 

3345^ 

31.9 

12.0 

12.0 

10.8 

16.4 

2.14 

14.7 

3711^ 

23.7 

23.7 

8.28 

12.3 

14.8 

2.38 

14.9 

4000 

18.1 

34.4 

6.00 

12.5 

12.8 

2.37 

13.9 

5000 

6.55 

65.5 

1.91 

9.39 

6.45 

1.71 

8.51 

^  Two-structure  equimolarity  point. 

The  lower  line  for  2000  K  (in  italics)  results  from  the  Cj,,  and  symmetries  of  the  E  and  G  isomers,  respectively  (see  Table  I). 
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tures  preexponential  factors  in  Eq.  (1)  become  im¬ 
portant  (while  the  importance  of  the  exponential 
part  is  gradually  decreased).  At  a  temperature  of 
1485  K  the  A  species  reaches  equimolarity  with  the 
C  species  and  beyond  that  point  is  always  more 
populated.  Nevertheless,  at  considerably  high  tem¬ 
peratures  (above  3700  K)  actually  the  B  species 
becomes  the  most  populated  one  (though  the  tem¬ 
peratures  are  probably  already  too  high  from  a 
practical  point  of  view).  This  surprising  high- 
temperature  importance  of  the  B  isomer  originates 
in  the  fact  that  it  has  the  lowest  low-frequency 
terms,  though  the  chirality  enhancement  is  rele¬ 
vant,  too.  Other  four  structures  (D~G)  are  less 
important  though  not  entirely  negligible.  It  is  of 
interest  to  note  that  the  two  structures  with  some¬ 
what  unclear  symmetries,  E  and  G,  in  fact  do  not 
change  the  overall  picture  even  if  their  alternative 
(higher)  symmetries  are  considered  instead  (as 
demonstrated  for  a  temperature  of  2000  K).  Unfor¬ 
tunately,  we  cannot  confront  the  computations 
with  an  observation  yet, 

A  complex  isomeric  interplay  can  also  be  visual¬ 
ized  in  terms  of  heat  capacity,  namely  as  the  dif¬ 
ference  between  the  standard  molar  heat  capacity 
at  constant  pressure  of  the  equilibrium  isomeric 
mixture  (the  overall  value  Cp  and  a  partial  value 
y  belonging  to  one  selected  reference  isomer,  for 
example,  C,  This  difference,  the  isomerism  contri¬ 
bution  can  frequently  exhibit  [41]  a  sharp 

temperature  maximum  (which  could  even  be  seen 
in  the  overall  term).  Such  maximum  is  present 
in  the  CgQ  system,  too,  and  it  is  quite  pronounced. 
It  is  reached  at  about  3782  K  and  amounts  to 
substantial  98.8  J/K  moU^  (though  it  is  only  about 
5%  of  the  term  at  the  temperature). 

The  reported  considerable  thermal  effects  on  the 
relative  isomeric  stabilities  result  from  an  interplay 
between  rotational,  vibrational,  potential-energy 
terms,  and  chirality  factors.  The  whole  scheme, 
however,  refers  to  a  thermodynamic  equilibrium, 
though  only  to  the  interisomeric  equilibrium.  We 
can  hardly  say  anything  on  the  degree  to  which 
this  presumption  is  satisfied  in  a  future  experi¬ 
ment.  After  all,  fullerene  synthesis  is  probably 
controlled  by  kinetics  rather  than  by  thermody¬ 
namics  and  it  may  explain  why  the  Cg^  has  not  yet 
been  isolated.  Other  explanation  could  be  based  on 
different  charge  distributions  in  the  molecules 
(possibly  connected  with  different  solubility  or  ad¬ 
sorption  properties).  However,  the  most  positive 
and  most  negative  Mulliken  charge  in  the  A  and  C 
structures  is  about  comparable  with  the  AMI  val¬ 


ues  for  CjQ  (though  the  spatial  distribution  of 
charges  matters,  too).  Overall,  computations  pre¬ 
dict  that  under  the  supposed  fullerene-synthesis 
conditions  the  species  A  and  C  should  primarily 
be  produced  (and  the  former  one  in  a  larger 
amount),  and  possibly  other  isomers  in  less  signifi¬ 
cant  concentrations.  The  proportions  are,  however, 
quite  temperature  sensitive  and  thus  the  particular 
temperature  regime  must  be  known. 


Compulations  of 

Fullerenes  are  usually  considered  as  even-atom 
cages  built  from  pentagons  and  hexagons  (while 
other  related  species  are  sometimes  called  pseudo- 
fullerenes).  However,  odd-numbered  carbon  ag¬ 
gregates  have  also  been  observed  [49-51]  around 
both  n  =  60  and  n  =  120.  The  observations  are 
based  on  mass  spectrometry  and  structural  rela¬ 
tionships  are  not  well  known.  Hence,  computa¬ 
tions  are  useful  and  the  computed  structure  of 
was  reported  [42].  The  computations  [42]  pointed 
out  that  rings  other  than  pentagons  and  hexa¬ 
gons  could  also  be  significant,  e.g.,  three-  and 
seven-membered  cycles.  In  fact,  even  with  even- 
numbered  fullerenes  some  other  rings  have  occa¬ 
sionally  been  considered  [43-57].  This  section  deals 
with  computations  of  C59,  its  structures,  energet¬ 
ics,  vibrations,  and  thermodynamics. 

In  contrast  to  even-numbered  fullerenes,  with 
odd  fullerenes  we  cannot  build  the  cage  from 
three-coordinated  carbon  atoms  only;  this  immedi¬ 
ately  follows  from  the  relationship  valid  in  con¬ 
ventional  fullerenes  between  the  number  of  atoms 
V  and  the  number  of  bonds  E:  3V  ==  2  E.  Hence, 
we  have  to  allow  for  two-  or  even  four-coordi¬ 
nated  atoms.  Using  Euler's  theorem  we  can  show 
that  the  number  of  pentagons  (if  we  allow  only 
for  five-  and  six-membered  rings)  is  generally  dif¬ 
ferent  from  12.  For  example,  if  we  consider  one 
two-coordinated  carbon  atom  (and  the  rest  three- 
coordinated)  the  number  of  pentagons  drops  to  10. 
If  we  allow  for  just  one  four-coordinated  atom,  the 
number  of  pentagons  changes  to  14.  With  six  two- 
coordinated  atoms  we  can  completely  avoid  pen¬ 
tagons.  Rules  are  of  course  different  if  we  include, 
for  example,  heptagons.  With  one  heptagon  and 
one  two-coordinated  carbon  atom,  the  number  of 
pentagons  drops  to  11,  etc. 

We  can  go  even  further  and  consider  simultane¬ 
ously  two-,  three-,  and  four-coordinated  atoms  in  a 
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cage.  Let  the  number  of  two-coordinated  carbon 
atoms  be  p,  and  the  number  of  four-coordinated  q. 
If  we  still  allow  only  pentagons  and  hexagons,  we 
can  evaluate  the  number  of  pentagons,  n^,  from 
Euler's  formula: 

^5=  —2p-\-2q-\-  12,  (2) 

which  of  course  reduces  to  the  12  pentagons  if 
p  =  q  ^  0.  There  is  a  possibility  that  =  0,  and 
also  some  parity  requirements  and  value  limita¬ 
tions. 

One  possible  way  to  generate  C59  topologies  is 
to  take  a  C58  structure  and  add  two-coordinate 
carbon  atom  to  a  selected  C-C  bond.  It  is  a  useful 
approach  though  not  all  C59  structures  can  be 
generated  this  way.  Moreover,  we  should  consider 
all  possible  C58  isomers.  This  is  of  course  impossi¬ 
ble  as  it  is  estimated  [58]  that  there  should  be,  for 
example,  1037  isomers  of  C58  of  symmetry 
(thus  producing  1037  X  87  =  90,219  structures  of 
C59).  Therefore  we  decided  to  limit  ourselves  only 
to  the  lowest  known  [59]  structure  of  C58.  It  has  a 
topological  symmetry  but  it  undergoes  a 

Jahn-Teller  distortion  (Fig.  3). 

We  first  selected  the  shortest  and  the  longest 
bonds  in  the  ground-state  structure  of  C58  and 
added  one  carbon  atom  to  them.  It  has  turned  out 
that  the  long  C-C  bonds  led  to  considerably  lower 
energies  of  the  final  C59  species.  Therefore  we 
concentrated  on  particularly  long  6/6  and  5/6 
bonds  of  the  original  C58  cage.  Table  IV  surveys 


FIGURE  3.  AM1  structure  of  Cgg  used  in  generation  of 
some  C59  isomers. 


TABLE  IV _ 

AM1  energetics  of  the  selected  C59  structures. 


Labei^ 

Bridge^ 

Bridged  Bond 
(A) 

(kJ  /  mo!) 

AF/°298,r 
(kJ  /  mol) 

6/8'= 

5/7 

2.319 

4617 

0 

5/9'= 

4/8 

2.264 

4670 

52 

12290'^ 

5/6 

2.305 

4695 

77 

0308'^ 

6/6 

2.322 

4701 

83 

7/7'= 

6/6 

2.331 

4736 

119 

1335 

5/5 

1.592 

4757 

139 

6/7'= 

5/6 

2.357 

4780 

163 

13350 

5/5 

2.177 

4789 

172 

0203O 

5/6 

2.338 

4810 

193 

0203 

5/6 

1.631 

4819 

201 

1229 

5/6 

1.691 

4822 

205 

0102 

6/6 

2.220 

4822 

205 

2022 

6/6 

1.559 

4828 

211 

2140 

6/6 

1.550 

4829 

212 

2456 

6/6 

1.574 

4840 

222 

3047 

6/6 

1.582 

4878 

260 

3i® 

5269 

651 

6/8i 

5/7 

1.524 

5319 

702 

2456i 

6/6 

1.461 

5356 

739 

^The  four-digit  labels  (see  Fig.  3)  for  the  numbering  of 
atoms  involved  in  the  bridging.  Suffix  o  denotes  an  open 
structure  (in  order  to  distinguish  from  the  related  closed 
structure):  suffix  i  denotes  an  internal  (In  cage)  location  of 
the  59th  carbon  atom.  The  two-digit  labels  refer  to  structures 
not  derived  by  a  simple  bridging  and  denote  the  rings  in 
which  the  two-coordinated  carbon  atom  is  involved. 

^  The  type  of  the  bond  bridged  over. 

^See  Figure  4. 

^  See  Figure  5. 

®  The  59th  atom  is  bonded  Inside  the  cage  to  one  atom  only. 


the  computed  results  and  uses  numbering  from 
Figure  3.  Owing  to  addition  of  the  C  atom,  C-C 
bond  (6/6,  5/6,  or  5/5)  can  either  remain  or  can 
be  broken  as  clearly  indicated  by  the  bond  lengths 
of  the  critical  bonds  after  the  full  geometry  opti¬ 
mization.  Figure  4  presents  two  lowest  C59  struc¬ 
tures  derived  from  C58.  The  bridged  bonds  in 
Figure  4  are  of  5/6  and  6/6  types,  and  both 
structures  are  open  (the  critical  bond  broken);  this 
creates  one  and  two  heptagons,  respectively,  in  the 
C59  cage. 

Another,  less  systematic  way  of  C59  generation 
can  start  from  C^q:  remove  one  carbon  atom,  and 
reconstruct  the  cage.  By  this  approach  we  could 
generate  four  additional  structures  (Fig.  5).  They 
contain  one  eight-  (6/8),  one  nine-  (5/9),  three 
seven-  (7 /7),  and  two  seven-membered  rings  (6/7). 
In  both  generation  approaches  we  tried  to  create  a 
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FIGURE  4.  Two  low-energy  Cgg  structures  derived  from 
CgQ  in  Figure  3. 


FIGURE  5.  Four  low-energy  C59  structures  not  derived 
from  C58  in  Figure  3. 


structure  with  a  four-coordinated  carbon  atom  but 
the  geometry  optimizations  never  converged  to 
such  a  structure.  Finally,  we  also  optimized  three 
structures  with  the  59th  atom  inside  (and  not  out¬ 
side)  the  cage;  they  are  all  high  in  energy  (Table 
IV). 

We  can  select  a  limited  set  of  four  isomers,  the 
ground  state  and  three  species  separated  from  it 
by  less  than  100  kj /mol.  The  ground  state  contains 
eight-membered  ring  (6/8,  symmetry),  being 
followed  by  the  species  with  nine-membered  ring 


(5/9,  Cg  symmetry).  Then,  two  structures  derived 
from  Figure  3  follow,  1229o  (C^  symmetry)  and 
0308  (Cg  symmetry).  The  computed  HOMO-LUMO 
gaps  of  the  6/8,  5/9,  1229o,  and  0308  structures 
are  5.7,  6.4,  3.9,  and  4.9  eV,  respectively.  Hence, 
they  do  not  exactly  follow  the  stability  order  given 
by  the  heats  of  formation  at  room  temperature 
(Table  IV). 

Table  V  presents  rotational  constants  and  three 
intense  IR  lines  for  the  four  low-energy  structures 
of  C59.  Variation  in  the  rotational  constants  is  not 
very  significant.  However,  there  are  differences  in 


TABLE  V _ 

AM1  computed  structural  and  vibrational  characteristics  of  the  four  lowest  C59  isomers.® 


ABC 


Isomer 

(cm 

(cm 

(cm  h 

6/8 

0.00299 

0.00279 

0.00277 

1692/100 

1742/95 

1747/96 

5/9 

0.00297 

0.00279 

0.00275 

1286/100 

1742/89 

1745/87 

12290 

0.00300 

0.00281 

0.00278 

1061/100 

1532/71 

1592/67 

0308 

0.00296 

0.00286 

0.00278 

1444/79 

1651/100 

1700/47 

^  See  Table  IV  and  Figures  4  and  5. 

Only  the  three  IR  most  intense  vibrational  wavenumbers  are  presented  followed  by  their  relative  IR  intensities  (after  the  slash). 


1574 


QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  30 


LESS  COMMON  FULLERENE-RELATED  SPECIES 


TABLE  VI _ _ 

Temperature  dependency  of  the  equilibrium  mole  fractions  iv,  of  the  four  lowest  C59  isomers.^ 


Wj  (%) 


Isomer 

500" 

1000" 

1500" 

2000" 

2500^ 

3000^ 

4000^^ 

6/8 

100.0 

99.83 

98.40 

94.89 

89.73 

83.87 

72.70 

5/9 

3X10“'' 

0.16 

1.31 

3.67 

6.59 

9.46 

14.00 

12290 

7  X  10-'^ 

8x10'® 

0.20 

0.99 

2.47 

4.42 

8.69 

0308 

1X10^^ 

3  X  10“® 

0.09 

0.46 

1.21 

2.25 

4.62 

^See  Table  IV  for  the  structure  labeling. 
^  Temperature  in  K. 


the  positions  of  the  IR  intense  lines,  and  they  could 
play  some  role  in  experimental  identification  of  the 
structures. 

Although  the  energy  separation  of  the  four 
low-energy  isomers  is  rather  small,  entropy  does 
not  compensate  it  in  this  particular  case.  As  can  be 
seen  in  Table  VI,  the  ground-state  structure  6/8 
remains  the  far  most  populated,  prevailing  species 
even  at  very  high  temperatures.  However,  this 
result  is  valid  only  for  the  thermodynamic  equilib¬ 
rium,  and  we  cannot  decide  if  this  should  really 
happen  in  an  experiment.  There  may  be  differ¬ 
ences  in  kinetic  barriers  between  the  structures 
originated  in  simple  additions  of  one  carbon  atom 

^58  and  the  structures  produced  by  reorganiza¬ 
tion  after  a  carbon  release  from  C^o  (the  first  chan¬ 
nel  may  be  more  common).  Finally,  it  is  interesting 
to  note  the  position  of  C59  with  respect  to 
the  AMI  computed  heats  of  formation  per  particle 
— it  is  higher  by  about  10.4  kj/mol  atom.  The 
supposed  AMI  ground  state  of  C58  is  placed  about 
8.17  kJ/mol  atom  above  C^q.  From  this  point  of 
view,  the  C59  species  are  actually  not  particularly 
high  in  energy. 

As  a  two-coordinated  carbon  atom  is  involved, 
we  can  ask  about  triplet  electronic  states.  There  is 
a  difficulty  in  computing  their  separation  energies. 
The  unrestricted  HF  (UHF)  treatment  has  better 
convergency  properties  over  the  open-shell  re¬ 
stricted  Hartree-Fock  procedure  (ROHF).  How¬ 
ever,  the  energies  from  the  UHF  treatment  are  not 
directly  comparable  with  the  values  for  closed-shell 
structures  (owing  to  the  higher  flexibility  of  the 
UHF  wave  function  the  stabilities  of  triplet  species 
are  overestimated  [52]).  On  the  other  hand,  the 
ROHF  computations  frequently  diverge. 

This  study  points  out  some  interesting,  new 
geometrical  relationships  in  fullerenes,  once  we 
relax  the  carbon  coordination  number.  Moreover, 
it  indicates  that  arrangements  with  larger  rings 


tend  to  be  energetically  more  convenient  in  the 
odd  fullerenes.  A  deeper  insight,  however,  re¬ 
quires  some  knowledge  on  kinetics  of  the  forma¬ 
tion  processes.  The  suggested  approach  could  also 
be  applied  to  other  odd-numbered  species,  like  C57 
or  C55,  known  from  mass  spectra  [50],  too. 

Note  Added  in  Proof.  Michel  et  al.  [60]  reported 
separation  of  a  C  go  species  for  which  they  con¬ 
cluded  D2  symmetry,  this  being  in  agreement 
with  our  prediction  for  higher  temperatures  (Table 
III,  species  A). 
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ABSTRACT  - - 

The  interaction  on  and  the  rotation  of  C^q  in  akali-doped  C^o  solids, 

(X  =  1,2,3;  A,  A'  =  alkali),  have  been  calculated  with  Buckingham  potential  model  The 
results  show  that  the  total  interaction  on  C^o  changes  dramatically  when  the  pure  C^o 
solid  is  alkali-doped  into  K3C5Q.  The  interaction  on  C^q  in  K3C5Q  is  about  20  times 
greater  than  that  in  pure  And  the  main  component  in  the  former,  occupying  >  90% 
is  electrostatic,  while  in  the  latter,  the  main  components,  occupying  >  90%,  are  dispersive 
and  repulsive.  The  results  also  show  that  in  contrast  to  the  whole-region  free  rapid 
rotation  of  C^o  molecule  in  its  pure  solid,  the  rotation  of  C^q  in  K3C60  is  mostly 
forbidden  due  to  a  10  times  increase  (reaching  about  300  kj/mol)  in  potential  barrier, 
except  for  the  region  from  0°  to  50°  where  a  broad,  smooth,  and  shallow  potential  well 
exists.  Calculations  for  alkali-doped  complexes  other  than  K3C60/  i-C.,  AxA^_xC^q 
(X  =  1,2,3;  A,  A'  =  K,Rb,Cs),  come  to  the  same  conclusion.  Finally,  an  interesting  and 
meaningful  result  is  that  the  superconducting  transition  temperatures  of  AxA3-xC60 
(X  =  1, 2, 3;  A,  A'  =  K,  Rb,  Cs)  change  inversely  with  the  total  interactions  on  C^o- 
©  1995  John  Wiley  &  Sons,  Inc. 
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Introduction 

Pure  CgQ  solid  is  an  insulator,  whose 

molecules  rotate  very  rapidly  on  the  lattice 
points  [1-4].  However,  alkali-doped  solids, 
AxA's-xCeo  (X  =  1,2,3;  A,  A'  =  K,Rb,Cs),  are 
not  only  conductors  but  also  superconductors  at 
low  temperatures  [5-16],  and  the  rotation  of  C^o 
molecules  are  assumed  to  be  forbidden  by  the 
lattice  geometry  [5,  11],  Clearly,  the  properties  of 
alkali-doped  C^q  solids  are  very  different  from 
those  of  pure  solid.  Scientists  have  been  inter¬ 
ested  in  this  issue  and  contributed  a  great  deal  to 
the  literature  [15,  17-37].  Generally,  they  have 
paid  more  attention  to  the  lattice  geometry.  It  is 
held  that  the  lattice  of  solid  expands  after 
being  doped  with  alkali  atoms,  and  then  the  fron¬ 
tier  energy  bands  become  narrow  and  the  density 
of  states  on  the  Fermi  surface  increases,  so  that  the 
superconducting  transition  temperature  Tg-  in¬ 
creases  [15, 16,  29,  37].  As  for  the  d5mamics  proper¬ 
ties,  some  authors  also  hold  that  the  free  rapid 
rotations  of  C^g  molecules  are  forbidden  after  the 
solid  is  doped  with  alkali  atoms,  since  there  is  no 
room  for  Cgg  molecules  to  rotate  [5,  11].  However, 
the  expansions  of  the  lattice  constant  are  small, 
about  0.004  nm  from  Cgg  solid  to  KgCgg  solid  and 
about  0.032  nm  from  KjCgg  solid  to  RbCs2Cgo 
solid.  It  is  not  convincing  that  simply  such  small 
expansions  change  an  insulator  into  superconduc¬ 
tors  and  even  raise  the  transition  temperatures 
from  2.5  to  33  K  [6,  7,  12,  13].  Furthermore,  the 
distance  between  an  A  (or  A')  and  a  C  on  Cgg  is 
much  longer  than  the  sum  of  their  van  der  Waals 
radii,  therefore  one  may  wonder  how  the  free 
rapid  rotation  of  Cgg  could  be  forbidden  simply  by 
the  lattice  geometry.  In  short,  the  insignificant 
variations  of  lattice  constant  carmot  satisfactorily 
explain  the  sudden  property  changes  for  the  solid 
from  being  insulator  to  being  superconductor  and 
for  Cgg  molecules  from  being  able  to  rotate  freely 
and  rapidly  to  being  forbidden  from  that.  In  this 
study  we  consider  the  sudden  change  of  physical 
interaction  on  C^g,  instead  of  the  lattice  geometri¬ 
cal  change,  as  the  essential  reason  for  the  sudden 
property  changes  mentioned  above.  The  interac¬ 
tions  on  Cgg  in  both  pure  Cgg  solid  and  alkali- 
doped  C^o  solids,  AxA'3_xCgo  (X  =  1,2,3;  A,  A' 


=  K,  Rb,  Cs),  have  been  calculated,  and  the  results 
show  that  there  is  a  sudden  change  in  interaction 
from  the  former  to  the  latter,  which  corresponds  to 
the  sudden  change  of  C^g's  rotation  and  the  con¬ 
version  from  insulator  to  superconductor. 


Computational  Model 

Alkali-doped  Cgg  solids,  AxA'3_xCgo  (X  = 
1/2,3;  A,  A'  =  K,  Rb,  Cs),  all  have  the  same  crys- 
tallology  features  as  the  pure  Cgg  solid.  They  all 
belong  to  the  face-centered-cubic  (fee)  structure 
form,  in  which  A  and/or  A'  atoms  occupy  both 
the  tetrahedral  and  the  octahedral  interstitial  sites 
[7,  8,  10-12,  19].  For  each  Cgg  in 
there  are  12  nearest  neighbors  of  C^g  and  14  near¬ 
est  neighbors  of  A  and/or  A'.  Of  these  alkali 
atoms  8  are  filled  in  the  rooms  of  8  tetrahedrons 
and  6  in  the  rooms  of  6  octahedrons.  Since  the 
room  volume  of  the  tetrahedron  is  smaller  than 
that  of  the  octahedron,  the  alkali  atom  with  smaller 
radius  will  first  reside  in  the  room  of  tetrahedron. 
In  order  to  express  the  computational  model  for 
the  interactions  on  Cgg  clearly  and  accurately,  the 
form  of  Table  I  is  used,  where  the  center  Cgg  is  put 
at  the  origin  point  in  a  Cartesian  coordinate  sys¬ 
tem  and  it  rotates  around  the  axis  [1  1  1]. 

In  Cgg  solid,  although  each  C  atom  in  Cgg  is 
electroneutral,  because  each  C  links  with  one  short 
(or  double)  bond  and  two  long  (or  single)  bond, 
the  atomic  point  dipole  moment  on  each  C  atom  is 
not  equal  to  zero,  shown  as  follows; 

Atomic  dpole 


It  is  inconvenient  to  operate  atomic  point  dipole 
moments.  Therefore,  to  make  the  operation  conve¬ 
nience,  the  distribution  of  atomic  site  dipole  mo¬ 
ments  is  transformed  to  the  distribution  of 
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TABLE  I _ ^ - - 

The  coordinates  of  the  neighbors  around  the  center  Cgo- 

Sort  and  number 

State 

of  neighbors 

Coordinate 

Four  Ceo 

On  xy  plane  and  rotating 
synchronously  with  center 
Cgo  around  the  [1  T  T ]  axis 

Four  Cgo 

(0.^1),  (0, -1,1). 

(0,  -1,  (O.i  -5) 

On  yz  plane  and  rotating 
synchronously  with  center 

Cqq  around  the  [T  1  T] 
axis 

Four  Ceo 

(1,0, 1).(1  0,-1), 
(-io,-i),(-io,  1) 

On  zx  plane  and  rotating 
synchronously  with  center 

Cgo  around  the  [T  T  1] 
axis 

Eight  A  and /or  A' 

(1  1  1)  r_l  i  1) 

\4J4>4/j\  4>4J4/» 

(_1  _1  1)  (1  _1  1) 

V  4J  4J4/sV4i  4»4/> 

pi  i\  /  1  1  1) 

\4»4>  4/jV  4’4»  4' 

/I  1  1\  /I  1  1) 

V  4  J  41  4/»  \4  J  4 »  A' 

At  the  center  of  tetrahedron 
composed  of  four  Cgo’s 

Six  A  and  /  or  A' 

(1,0,0),  (-1  0,0), 

(0,1,0),  (0,  -1,0), 

(0,0,  |),  (0,0,  -1) 

At  the  center  of  octahedron 
composed  of  six  CqqS 

atoms  on  different  molecules  or  components  is 
larger  than  the  sum  of  their  van  der  Waals  radii; 
therefore  it  is  convenient  and  accurate  using 
Buckingham  [40,  41]  potential  to  calculate  the  in¬ 
teraction  of  atom  pairs  mentioned  above. 

For  the  sake  of  conciseness,  the  calculating  for¬ 
mulas  of  interaction  on  the  center  C5Q  are  only 
given  in  some  detail  in  the  case  of  when  A  = 
A'(  =  K).  It  is  easy  to  transform  these  formulas  into 
those  when  A  #  A'.  The  calculation  includes  two 
steps,  in  which  the  first  step  is  to  calculate  the 
interaction  on  iri  pure  C50  solid,  namely  ECCgg)- 

60  12  60 

^  E  D  L  ^C-C 

i=l  j=l 

X  exp[  -  Cc_  c  •  <^)]  /  0) 

60  12  60 

Edis(Qo)(‘<>)  =  E  L  L  (2) 

Jt=l  j=l 

£jQo)(w)=  E  E  L 

i=l  k=l  j=l 
12  12  30 

+  E  E  E  QpiQnj^iilk)^^^ 

1=1 k=l  j=l 


nonatomic  site  charges  as  follows: 

Negatiw  charge  at 


This  is  to  say  that  there  is  a  nonatomic  site  positive 
charge  with  0^^=  5^  on  each  of  the  12  centers  of 
five-membered  ring,  and  a  nonatomic  site  negative 
charge  with  Q~=  2q  on  each  of  the  30  centers  of 
short  bond,  q  being  the  effective  charge.  It  has 
been  already  known  from  simulation  that  q  = 
0.13|e|  [38].  It  follows  that  the  interactions  on 
nonatomic  site  charges  should  also  be  considered 
in  calculation. 


Interaction  Potential 

In  solids  of  pure  and  AxA'3_xC6o  = 
1, 2, 3;  A,  A'  =  K,  Rb,  CsX  the  distance  between  two 
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30  12  12 

+  E  E 

Z  =  1  j=l 

30  12  30 

+  E  E  LQuiQniriUoj).  (3) 

;=1  k=l  /=1 

The  second  step  is  to  calculate  the  interactions  on 
Cgo  in  alkali-doped  solid,  namely  ECKjQp): 


=  £;ep(Qo)(^) 

60  8 

+  E  E  Sc-fcexp[-Cc_Kh7(«)] 

i=l  j=l 
60  6 

+  E  E  Sc-K  exp[-Cc_jch7(")]/  (4) 

(  =  1  ;  =  1 

Edis(^^3Qo)(w)  =  Ek(Qo)(‘^) 

60  8 

+  E  L 

60  6 

+  E  (5) 

/=1  /=1 

60  8 

+  E  E  Qc;Qx/,7H*') 

f=i  /  =  i 
60  6 

+  E  EQoQK;r;7\a>) 

1=1 j=i 
12  8 

+  E  E  Qp/Qx/ivnoi) 

/=1 7=1 

12  6 

+  E  E  Q./Qk/^:;- 

/=1 7=1 

30  8 

+  E  E  Q„iQK/,7’(w) 

i=l /=! 


30  6 

+  E  LQniQK/u\<o) 

7=1 /=1 
60  12  60 

“i"  E  E  E  QckQcj(k)^ij(k)^<^') 

7  =  1  1  =  1  ;  =  1 
60  12  12 

E  E  E  QciQpj(k)^ij(k)^^'^ 

7=1  1=1  ;=1 

60  12  30 

4  E  E  E  QciQnj(k)^ij(k)^^^ 

7=1  1=1  ;=1 
12  12  60 

4  E  E  E  QpiQcj(k)^ij(k)^^^ 

,=i  k=k  j=\ 

30  12  60 

4  E  E  E  QniQcj(k)^ij(k)^^^ > 

7  =  1  1=1  j=\ 

(6) 


where  w  is  the  rotation  angle  of  the  molecules, 
and  the  E'fQp)  denotes  the  corresponding  interac¬ 
tion  on  Cpo  as  in  the  pure  C^p  solid,  but  the  lattice 
constant  is  already  changed  into  the  one  as  that  in 
the  K-doped  C^p  solid,  K3C50  solid. 

The  coupling  potential  parameters  needed  in 
the  calculation  are  listed  in  Table  IT 


Interaction  on  and  Transition 
Temperatures  of  the  Superconductors 

Pure  C50  solid  is  an  insulator,  but  AxA'j.^Cgp 
(Y  =  1, 2, 3;  A,  A'  =  K,  Rb,  Cs)  are  superconduc¬ 
tors,  and  it  has  been  noted  that  the  transition 
temperatures  Tq  have  a  parallel  relationship  with 
their  lattice  constants  a.  In  the  literature,  it  has 
been  held  because  of  the  increasing  of  the  lattice 
constant,  the  energy  bands  will  be  changed  to 
narrow,  and  then  the  density  of  states  on  the  Fermi 
surface  increases,  so  that  the  transition  tempera¬ 
tures  Tc  increase.  After  A  and/or  A'  are  filled  into 
the  Cpg  solid,  the  lattice  of  the  doped  solid  is 


TABLE  II _ _ 

Coupling  potential  parameters  for  some  atom  pairs. 


Atom  pair  e{kJ/mol)  C  (nm  ’)  A  (10"®  kJ  ■  nm®/mol)  Refs. 

0.36000  -24.39800  [39,40] 

0.30450  -6.89781  [41,42] 

0.28675  -8.67123  [41,42] 

0.28184  -9.34565  [41,42] 


C”C 

369743.0 

K~C 

11791.062 

Rb-C 

11058.562 

Cs-C 

10215.256 
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expanded,  but  it  is  a  gradual,  not  a  sudden  change. 
Therefore,  this  geometrical  factor  of  cell  change 
may  not  be  a  good  reason  for  the  sudden  change 
of  the  electricity  and  the  superconductivity. 

In  this  work,  the  interactions  on  C^q  in  solid 
and  in  AxA'3_xC6o  ~  1/2,3;  A, A'  =  K,Rb,Cs) 
solids  have  been  calculated.  From  the  computa¬ 
tional  results,  it  is  seen  that  there  is  a  sudden 
change  for  the  interactions  on  C50  from  in  pure  C^o 
solid  (  -  435.5  kj /mol)  to  in  alkali-doped  0^0  solids, 
AxA'3_xC6o  (X  =  1,2,3;  A, A' =  K, Rb, Cs),  for 
example,  -9517.1  kj/mol  for  K3C6o-  Therefore,  it 
will  be  important  and  interesting  to  associate  the 
physical  factor  of  interaction  on  C^q  with  the  su¬ 
perconducting  transition  temperature  T^,  and  the 
relationship  between  them  is  shown  in  Table  III. 

From  the  data  listed  in  Table  III,  it  can  be  easily 
seen  that  there  is  an  antiparallel  change  relation¬ 
ship  between  the  superconducting  temperatures 
Tq  and  the  interactions  on  Cgg  in  AxA'3_xC6o 
(X  =  1,2,3;  A,  A'  =  K,Rb,Cs)  solids.  One  can  see 
this  relationship  more  clearly  from  Figure  1. 


The  Interaction  on  and  the  Rotation 
Dynamics  of 

A  series  of  interaction  on  i^  *1^®  range  of 
w  =  0-120°  have  been  calculated,  which  are  shown 
in  Figure  2.  For  ease  of  comparison,  the  relative 
interaction  RE(w)  =  E(w)  -  E(0)  is  used  in  this 
figure. 

Figure  2  shows  that  the  absolute  values  of  the 
total  interaction  on  C50  in  K3C50  solid  is  about 
20  times  greater  than  that  in  pure  solid,  and 
the  corresponding  relative  variation  in  K3Cgg  solid 
is  about  10  times  greater  than  that  in  C^g  solid,  the 
absolute  value  of  the  electrostatic  interaction  E^^ 
on  Ceo  in  KgCgg  solid  is  about  60  times  greater 
than  that  in  solid  Cgg,  and  the  corresponding  rela¬ 
tive  variation  in  KgCgg  solid  is  about  15  times 


a(mn) 

1.421  1.428  1.435  1.442  1.449  1.456 


FIGURE  1.  Transition  temperatures  Tq  vs.  interactions 
|E(&>)1  on  Cgo  or  lattice  constants  a. 


greater  than  that  in  Cgg  solid;  the  repulsive  and 
the  dispersive  interactions  in  two  systems  do  not 
change  too  much.  It  is  also  seen  that  the  electro¬ 
static  interaction  is  the  essential  component  in 
KjCgg  solid,  being  >  90%,  but  the  repulsive  and 
the  dispersive  interactions  are  the  essential  compo¬ 
nents  in  the  C^g  solid,  being  >  90%. 

Making  a  comparison  of  the  potential  barriers 
between  two  systems  of  the  pure  Cgg  solid  and  the 
alkali-doped  Cgg  solid,  KgCgg  solid,  it  can  be  seen 
that  in  pure  Cgg  solid,  the  rotation  potential  barrier 
is  low,  ~  31  kJ/mol,  which  can  be  overcome  by 
the  thermomotion.  Then  the  molecule  can  ro¬ 
tate  rapidly  and  freely;  while  in  alkali-doped  Cgg 
solid,  KjCgg  solid,  the  potential  barrier  is  high,  up 
to  ~  280  kJ /mol,  which  cannot  be  overcome  by 
thermomotion.  Therefore,  the  rotation  of  the  Cgg 
molecule  will  be  forbidden  in  KgCgg  solid.  But  it  is 
also  seen  that  there  is  a  broad  and  smooth  poten¬ 
tial  well  in  the  region  from  0°  to  50°.  Then  the  Cgg 
molecule  can  rotate  in  this  region  at  room  temper¬ 
ature.  In  other  words,  the  rotation  of  Cgg  molecule 


TABLE  III 


Comparison 


nf  thP  interactions  on  and  the  superconducting  transition  temperatures  (nm;  kJ  /  mol;  K). 


Solid 


Ks^eo 


KjRbCg 


K2CsCg 


KRbgCgo  RbjCso  RbaCsCgg  RbCSgCgp 


a 


1.420 

-435.5 


1 .4240 
-9517.1 
19.3 


1 .4243 
-9515.1 
23 


1 .4292  1 .4323  1 .4384 

-9480.7  -9452.9  -9403.4 

24  27  29 


1 .4431  1 .4555 

-9368.1  -9271.7 

31  33 
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©(degree) 


253.22 

^d£,(^3<^6o)(0)  = -684.32 
^-9086.0 
^/./(^3<^6oKO)=-9517.1 


^^p(QoKO)=  261,78 
^^(C6oX0)= -679.92 
^„(C«XO)  = -177.77 
^..(C'«X0)=-435.49 


FIGURE  2.  Relative  interaction  energy  RE(a;)  vs. 
rotation  angle 


is  only  forbidden  to  a  small  region  about  0-50°. 
That  is  to  say  that  the  rotation  of  molecule  in 
alkali-doped  solid  is  neither  wholly  free  nor 
wholly  forbidden,  where  C^p  can  rotate  in  a  small 
region.  This  conclusion  is  not  in  agreement  with 
those  from  other  authors  [5,  11,  24].  About  the 
rotation  of  C^q  is  K3Cgg  solid,  there  are  two  points 
of  view  in  the  literature,  namely  one  is  that  the 
rotation  of  C^q  molecule  is  wholly  forbidden  be¬ 
cause  of  the  geometrical  factor  [5,  11].  From  our 
computational  results  it  is  seen  that  it  is  the  higher 
potential  barrier  coming  from  the  interaction  on 
Cpo  iri  its  alkali-doped  complex  that  confines  the 
free  rotation  of  Cgg  molecule.  Another  viewpoint  is 
that  the  rotation  of  Cgg  molecule  is  expected  to  be 
free  wholly  according  to  the  line  width  of  ^^C-NMR 
spectra  of  KgCgg  solid  [24].  Although  the  spectrum 
peak  is  not  broad,  it  is  not  narrow  enough  to  be 
seen  as  a  line.  Therefore,  it  is  not  exact  evidence  to 
demonstrate  that  the  rotation  of  molecule  is 
wholly  free  as  that  in  pure  Cgg  solid.  In  addition,  it 
is  not  very  reasonable  to  make  a  conclusion  just 
from  the  test  of  the  '^C-NMR  spectra.  Now,  these 
two  distinctly  contrary  viewpoints  are  united  into 
one  viewpoint  as  shown  above. 
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ABSTRACT _ 

In  this  work,  we  examine  several  continuum  models  for  estimating  the  free  energy  of 
solvation.  We  find  statistically  that  the  best  overall  one-parameter  fit  depends  only  on 
whether  there  is  a  hydrogen  bond  or  not.  Within  the  non-hydrogen-bonded  set  of 
molecules,  a  three-parameter  fit  including  molecular  volume,  molecular  surface  area,  and 
the  electrostatic  component  from  any  reaction  field  model  is  quite  successful.  We  do  not 
find  a  strong  bias  for  any  of  the  dielectric  models,  although  the  PCM  model  of  Tomasi 
and  co-workers  seems  to  slightly  surpass  the  others.  Within  the  hydrogen-bonded  set  of 
compounds,  a  good  fit  can  be  obtained  simply  by  considering  the  surface  accessible  area 
of  each  atom,  together  with  a  dispersion  and  electrostatic  interaction  descriptor  (a 
six-parameter  fit).  Such  a  model,  containing  primarily  nonquantum  chemical  descriptors, 
extrapolates  very  well  to  the  solvation  energies  of  the  non-hydrogen-bonded  set  with  an 
overall  =  0.9334.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

The  use  of  continuum  models  for  simulating 
solvent  effects  in  quantum  chemistry  has  a 
history  of  more  than  two  decades.  The  publication 
of  several  recent  reviews  on  the  subject  [1,  2] 
indicates  that  the  area  has  reached  a  certain  level 
of  maturity  and  acceptance.  Many  articles  are  now 
in  the  literature  using  one  model  or  another  ad¬ 


dressing  the  differences  that  one  obtains  between 
quantum  chemical  calculations  in  the  gas  phase 
and  modeled  in  solution. 

In  spite  of  some  debate,  most  of  these  models 
are  very  successful  in  describing  the  chemistry 
that  they  examine  if  the  solvent  is  not  water  or 
potentially  hydrogen  bonding  to  the  solute.  It  is 
possible  in  these  cases  that  models  based  upon  the 
dielectric  properties  of  the  solvent  alone  have 
reached  their  limits,  at  least  in  predicting  such 
properties  as  heats  of  solution.  Current  research  is 
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shifting  toward  the  dynamic  aspects  of  solvent- 
solute  interactions,  using  various  combined  classi¬ 
cal/  quantum  models  [3~9].  Another  approach  that 
we  are  examining  is  super-molecule  quantum  me¬ 
chanics  in  which  the  quantum  motif  contains  the 
solute  and  several  solvent  molecules  from  the  first 
solvation  shell,  all  embedded  in  a  dielectric  contin¬ 
uum. 

Despite  the  existence  of  several  continuum 
models  of  solvation,  most  of  them  appear  to  be 
implemented  in  different  quantum  chemical  envi¬ 
ronments  and,  in  many  cases,  exist  only  as  local 
modifications  to  available  codes.  Access  to  these 
codes  is  in  some  cases  limited  to  the  research 
groups  themselves  and  their  closest  collaborators. 
Only  a  limited  number  of  methods  (SCRF  in 
MOPAC  [10]  and  GAUSSIAN  [11],  COSMO  [12]  in 
MOPAC  [13],  SM*  methods  [14-18]  in  AMSOL 
[19])  have  been  made  publicly  available  in  quan¬ 
tum  chemical  packages. 

As  a  consequence  of  the  above,  there  appear  to 
be  very  few  (if  any)  comparisons  of  the  several 
different  approaches  to  the  problem  of  solvation, 
in  which  all  factors  not  relevant  to  the  particular 
model  would  be  eliminated.  Here,  we  examine 
and  compare  several  theoretical  approaches  to  the 
different  components  of  the  free  energies  of  solva¬ 
tion,  using  the  same  Hamiltonian,  basis  set,  and 
set  of  test  compounds. 


Methods  Studied 

The  free  energy  of  solvation  is  usually  described 
as  consisting  of  the  following  components  [2, 
20-22]: 


=  W(M/S)  -  RT  In 


^rot,  g  ^vib,  g 
^rot,  s  ^vib,  s 


-  RT  In 
W(M/S)  =  AG,,  + 


(1) 


+  ^^rep-  ^2) 


Here,  W( M/S)  is  the  coupling  work  of  solvent  (S) 
and  solute  (M);  g,  Ws/  and  are 

the  microscopic  partition  functions  for  rotation  and 
vibration  of  M,  in  the  gas  phase  and  in  solution; 

g  and  g  are  the  numerical  densities  of  M 
molecules;  and  and  ^  are  the  momentum 
partition  functions.  The  free-energy  components  of 


the  coupling  work  come  from  the  electrostatic  in¬ 
teraction  (AG,|),  cavitation  (AG^^v),  dispersion 
(AG,„g),  and  repulsion  (AG^,p). 

The  current  work  concentrates  on  the  AG,j,  AG^g^ 
and  AG^is/  as  these  are  the  major  components  of 
AGg^,.  The  other  components  are  generally  consid¬ 
ered  to  be  negligibly  small. 


ELECTROSTATIC  lIXTERACTION  ENERGY 

The  term  most  studied,  and  that  with  the  great¬ 
est  number  of  methods  available  for  its  calcula¬ 
tion,  is  the  electrostatic  interaction  term,  AG,,. 
Within  the  continuum  approach,  it  is  typically 
modeled  as  the  energy  of  building  up  a  charge  on 
a  cavity  surface,  in  the  presence  of  the  electric  field 
created  by  the  solute.  The  solvent  is  modeled  as  a 
(usually  isotropic)  dielectric  medium,  character¬ 
ized  by  its  dielectric  constant,  e.  For  cavities  of 
spherical  or  ellipsoidal  shape,  analytical  solutions 
exist  for  the  Poisson  equation: 


V20(r) 


47rp(r) 

6 


(3) 


where  </>  is  the  electrostatic  potential  and  p  is  the 
charge  density  at  the  point  r.  For  other  shapes  of 
cavities,  and  e  different  from  infinity,  one  has  to 
resort  to  numerical  methods  of  solving  the  electro¬ 
static  problem. 

The  Poisson  equation  for  a  point  dipole  in  a 
spherical  cavity  was  applied  to  the  phenonemon  of 
solvation  by  Onsager  in  the  1936  [23].  Inclusion  of 
the  respective  interaction  terms  into  a  quantum- 
chemical  Hamiltonian  was  first  proposed  by 
Hylton,  Christoffersen,  and  Hall  [24],  with  imple¬ 
mentations  following  shortly  thereafter  from  sev¬ 
eral  groups  [25,26].  In  the  current  study,  we  com¬ 
pared  the  models  employing  spherical  and  ellip¬ 
soidal  cavities.  Due  to  limitations  of  the  imple¬ 
mentation  available  to  us,  the  electric  field  created 
by  the  solute  was  expanded  up  to  hexadecapole 
moments  when  spherical  cavities  were  considered. 
For  ellipsoidal  cavities,  only  the  dipole-level  ex¬ 
pansion  was  available  and  studied.  Traditionally, 
this  class  of  methods  has  been  called  the  self-con¬ 
sistent  reaction  field  (SCRF)  [25,26]. 

There  are  two  approaches  for  inclusion  of  the 
solvent-solute  interactions  into  the  Hamiltonian, 
depending  on  the  assumptions  of  the  derivations 
[27].  The  terms  ''model  A"  and  "model  B"  have 
been  used  for  these  in  the  past.  Model  A  freely 
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varies  the  total  energy  of  the  system 


=  Equantum('P)  +  4 


(4) 


subject  to  the  normalization  of  the  wave  function 
'P,  «^|'I^>  =  1)  and  a  Hamiltonian  is  derived  of 
the  form 


Hn  +  2H,„ 


(5) 


and 


^  (6) 


The  factor  of  two  arises  because  of  the  quadratic 
dependence  of  the  interaction  energy, 
on  the  wave  function  '4'',  and  for  this  reason,  W 
must  be  corrected  for  the  loss  of  free  energy  that 
the  solvent  experiences  when  disturbed  by  the 
solute: 


lated  within  the  ZDO  approximation,  and  the  cor¬ 
responding  ZDO  point  charges  were  used  as  the 
remaining  part  of  the  expansion.  For  an  ab  initio 
implementation,  inclusion  of  the  two-center  dipole 
terms  would  be  required. 

A  method  which  can  be  viewed  as  positioned 
between  the  above  two  is  the  multicavity  (MCa) 
SCRF  approach,  proposed  by  Karelson,  Tamm,  and 
Zerner  in  1993  [37].  Here,  the  molecule  is  parti¬ 
tioned  into  rotationally  invariant  groups  (frag¬ 
ments),  each  embedded  in  a  apherical  cavity  with 
its  own  Onsager-type  reaction  field.  The  original 
formulation  included  additional  terms  for  inter¬ 
fragment  reaction  field  interactions.  The  approach 
was  revised  prior  to  the  current  study,  and  in  the 
latest  implementation,  the  interfragment  interac¬ 
tion  terms  have  been  omitted.  The  justification 
behind  this  change  will  be  published  elsewhere. 


p  =  wA  _  r 

■^univ  ^interaction* 


(7) 


CAVITATIOIN  ENERGY 


Model  B  introduces  a  second  Lagrange  con¬ 
straint  such  that  the  quantum  chemical  energy 
yields  directly,  with 

^  ^0  ^interaction 


SCRF  models  are  generally  of  type  A,  whereas 
direct  reaction  field  models  are  generally  of  type  B. 
For  more  details,  the  reader  is  referred  to  [27]. 
Both  approaches  were  included,  when  available,  in 
the  current  study. 

The  most  successful  and  widely  known  algo¬ 
rithm  for  solving  the  Poisson  equation  for  a  cavity 
of  a  general  shape  in  quantum  chemical  calcula¬ 
tions  is  likely  the  polarizable  continuum  model 
(PCM),  introduced  by  Miertus,  Scrocco,  and 
Tomasi  in  1981  [28].  In  our  implementation  of  the 
PCM  model  [29],  the  electric  field  of  the  solute  was 
derived  from  its  molecular  charge  distribution  ex¬ 
panded  as  a  set  of  point  charges  and  point  dipoles 
centered  at  the  nuclei.  While  less  rigorous  than  the 
direct  calculation  of  the  electrostatic  potential  at 
the  cavity  surface,  it  is  remarkably  faster  and  has 
been  shown  to  produce  results  of  comparable  ac¬ 
curacy  to  the  full  treatment  [30].  Different  ap¬ 
proaches  can  be  used  for  estimating  the  atomic 
point  charges,  of  which  the  Mulliken  and  ZDO 
charges  are  readily  available.  These  approxima¬ 
tions  bring  the  model  close  to  the  Generalized 
Born  model  [31-36].  In  the  case  where  atomic 
point  dipoles  were  also  included,  these  were  calcu- 


The  energy  associated  with  the  formation  of  a 
cavity  inside  the  solvent  is  usually  termed  the 
cavitation  energy,  The  most  widespread 

approach  toward  its  estimation  is  likely  the  scaled 
particle  theory,  originally  introduced  by  Pierotti 
[381. 

Another  widely  used  approach  is  based  on  the 
assumption  that  the  cavitation  energy  is  roughly 
proportional  to  the  surface  area  of  the  solute.  Vari¬ 
ous  solvent-accessible  surface  area  (SASA)-based 
approaches  are  known  from  the  literature  [39-41]. 
For  the  current  study,  a  straightforward  linear 
relationship 

AG,,,  =  crA,  (9) 

where  A  is  the  surface  area  and  <t  is  a  proportion¬ 
ality  factor,  was  adopted. 

A  more  sophisticated  elaboration  of  the  same 
approach  has  been  used  in  the  successful  SM1-SM5 
solvent  models  introduced  by  Cramer  and  co¬ 
workers  [14-18].  In  this  case,  proportionality  fac¬ 
tors  (named  ''surface  tensions"  by  the  authors)  are 
established  for  each  type  of  atom  in  the  molecule, 
where  "type"  denotes  either  a  chemical  element  or 
an  element  in  a  certain  molecular  environment.  In 
the  later  models,  hydrogen  atoms  are  not  assigned 
separate  surface  tensions,  but  those  of  the  bonded 
atoms  are  adjusted  according  to  the  number  of 
attached  hydrogens.  For  this  study,  the  most 
straightforward  approach  was  chosen,  where  the 
exposed  surface  area  for  each  different  element 
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was  multiplied  by  a  proportionality  factor: 

N 

Eo-.A.  (10) 

k=l 

where  N  is  the  number  of  atoms  in  the  molecule; 

the  exposed  surface  area  of  the  atom;  and  a 
proportionality  factor  (surface  tension).  In  this  case, 
there  are  different  values  for  ctj,.  for  each  different 
element,  independent  of  its  bonding  environment. 

DISPERSION  ENERGY 

For  the  current  study,  only  two  methods  of 
evaluating  the  dispersion  energy  were  at  our  dis¬ 
posal.  The  first  one  is  the  configuration  interaction 
(Cl)-based  approach  developed  by  Rosch  and 
Zerner  [42],  which  has  given  very  good  results  for 
describing  solvent  shifts  in  electronic  spectra.  The 
second  approach  is  based  on  the  experimentally 
known  fact  that  larger  molecules  have  higher  dis¬ 
persion  energies,  and  the  dispersion  is  roughly 
proportional  to  the  volume  of  the  molecule: 

=  qV,  (11) 


The  compounds  and  the  related  experimental  data 
are  assembled  in  Table  I.  Crystallographic  struc¬ 
tures  were  used,  and  no  geometry  optimization 
was  attempted  in  the  solution-phase  quantum 
chemical  calculations.  This  might  have  allowed  for 
a  more  complete  comparison  of  the  different  mod¬ 
els  of  solvation.  However,  we  felt  that  we  did  not 
want  to  compare,  at  least  not  yet,  errors  in  geomet¬ 
ric  predictions  using  a  model  only  of  modest  accu¬ 
racy  is  estimating  gas-phase  geometries  (lNDO/1). 
This  will  be  a  source  of  error,  however,  in  our 
comparisons  [43,44]. 

All  calculations  were  performed  using  the 
INDO/1  Hamiltonian  [45-47]  as  implemented  in 
the  ZINDO  [48]  program.  The  Cl  calculations  nec¬ 
essary  for  calculation  of  the  dispersion  energies 
were  performed  using  a  spherical  cavity. 

Cavity  sizes  for  spherical  cavities  were  calcu¬ 
lated  from  the  mass  density  (d),  molecular  weight 
(M),  and  Avogadro's  number  (N^): 


R  = 


10^^  •  M 
~d-N, 


1/3 


/M' 

0.732  —  |(A),  (12) 


where  V  is  molecular  volume,  and  p,  a  propor¬ 
tionality  factor. 

ENERGY  OF  HYDROGEN  BONDS 

Hydrogen  bonding  is  a  phenomenon  which  is 
traditionally  studied  outside  of  continuum  solva¬ 
tion  models,  due  to  its  high  specificity  and  direc¬ 
tionality,  caused  by  association  energies  consider¬ 
ably  greater  than  kT,  the  thermal  energy.  For  the 
pure  energetic  effect,  however,  it  was  found  im¬ 
portant  (and  sufficient  for  the  current  study)  to 
include  a  single  parameter,  ''hydrogen-bonding 
energy,"  which  would  be  set  at  a  fixed  value  for 
the  molecules  which  are  known  to  form  hydrogen 
bonds  and  to  zero  otherwise.  This  parameter  will 
be  called  the  "hydrogen-bonding  descriptor"  in 
the  subsequent  text. 


Calculations 

For  the  current  study,  a  set  of  30  small  organic 
molecules  was  assembled.  The  main  selection  cri¬ 
teria  were  a  diversity  of  functional  groups  and 
structures,  availability  of  experimental  geometries, 
densities,  and  free  energies  of  sovation  in  water. 


assuming  no  empty  space  between  the  molecules. 
For  the  elliptical  cavities,  the  same  volume  was 
preserved,  but  the  ratio  of  the  axes  of  the  ellipsoid 
was  adjusted  according  to  the  moments  of  inertia 
of  the  molecule.  For  purposes  of  calculating  these 
moments,  the  atomic  volumes,  not  masses,  were 
used. 

The  cavity  shapes  for  the  PCM  models  were 
obtained  from  the  GEPOL93  [49]  algorithm,  as  van 
der  Waals  surfaces,  using  radii  compiled  by  Bondi 
[50].  There  have  been  suggestions  in  the  literature 
to  multiply  Bondi's  radii  by  1.2  (see,  e.g.,  [2]  for  a 
discussion  and  further  references).  We  found  that 
such  multiplication  decreases  the  electrostatic  sol¬ 
vation  energies  on  the  average  by  1.73  =  1.2^  times, 
with  a  very  high  correlation  between  the  energies 
obtained  by  using  the  modified  and  unmodified 
radii  (R^  =  0.998).  Therefore,  we  dropped  the  mul¬ 
tiplier  and  assumed  that  any  necessary  adjust¬ 
ments  will  be  done  at  the  data  analysis  stage.  The 
GEPOL93  algorithm  also  supplied  us  with  the  to¬ 
tal  and  partial  surface  areas  for  cavitation  energy 
calculations,  as  well  as  another  set  of  molecular 
volumes.  The  latter  correlated  very  well  with  the 
mass  density  volumes  [eq.  (12)],  and  due  to  their 
general  availability,  the  GEPOL  volumes  were  used 
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TABLE  I _ 

The  solute  molecules  in  this  study. 


Name  of  molecule  Classification 


Methane 

1.93 

Ethane 

1.77 

Propane 

1.98 

2-Methylpropane 

2.28 

Cyclohexane 

1.24 

Octane 

2.89 

Benzene 

-0.88 

Toluene 

-0.77 

Ethylbenzene 

-0.60 

Acetic  acid 

-6.70 

Pyridine 

-4.70 

Chloromethane 

-0.56 

Chloroform 

-1.07 

Dichloromethane 

-1.41 

Acetone 

-3.85 

Methylamlne 

-4.56 

Nitroethane 

-3.71 

Acetonitrile 

-3.90 

Trimethylamine 

-3.24 

Aniline 

-4.90 

Nitrobenzene 

-4.12 

Methanol 

-5.10 

Ethanol 

-5.05 

1 -Propanol 

-4.83 

1 -Butanol 

-4.72 

tert- Butanol 

-4.51 

1-Hexanol 

-4.36 

Cyclohexanol 

-5.48 

Phenol 

-6.62 

Hydrocarbon  N 

Hydrocarbon  N 

Hydrocarbon  N 

Hydrocarbon  N 

Hydrocarbon  N 

Hydrocarbon  N 

Aromatic  N 

Aromatic  N 

Aromatic  N 

Acid  Y 

Heterocycle  Y 

Chlorinated  hydrocarbon  N 

Chlorinated  hydrocarbon  N 

Chlorinated  hydrocarbon  N 

Ketone  Y 

Amine  Y 

Nitro  compound  Y 

Nitrile  Y 

Amine  Y 

Aromatic  amine  Y 

Aromatic  nitro  Y 

Alcohol  Y 

Alcohol  Y 

Alcohol  Y 

Alcohol  Y 

Alcohol  Y 

Alcohol  Y 

Alcohol  Y 

Aromatic  alcohol  Y 


for  the  present  study.  The  experimental  AGg^^  of 
solvation  in  water  were  taken  from  the  compila¬ 
tion  by  Cabani  et  al.  [51]. 

The  results  were  analyzed  using  the  CODESSA 
[52]  statistical-analysis  package,  utilizing  the  corre¬ 
lation  and  heuristics  methods  available  therein.  In 
keeping  with  the  terminology  used  in  the  field  of 
Linear  Structure- Activity  Relationships,  the  vari¬ 
ous  calculated  values  from  the  various  models  will 
be  called  descriptors.  The  quality  of  the  fits  was 
judged  by  the  square  of  the  correlation  coefficient 
(R^),  by  the  sign  and  value  of  the  regression  coef¬ 
ficients  (multipliers  in  front  of  the  corresponding 
descriptor  values  in  the  regression  formula),  and 
by  the  cross-validated  R^.  The  latter  is  a  figure 
which  measures  the  predictive  power  of  a  fit  by 
leaving  out  the  data  points  one  by  one,  repeating 
the  correlation  for  the  rest,  predicting  the  left-out 
experimental  value  from  the  regression,  and  then 


correlating  predicted  values  with  the  actual  one. 
We  consider  and  R^(CV)  values  above  0.9 
satisfactory  for  predictions  in  complex  areas  like 
studies  of  solvation  energetics. 


Results  and  Discussion 

DEPENDENCE  OF  THE  COMPONENTS  OF 

SOLVATION  ENERGY  ON  CAVITY  SIZE 

The  only  solute-specific  parameter  that  enters 
the  equations  of  the  solvation  energy,  which  is  not 
directly  obtainable  from  quantum  chemistry,  is  the 
size  (and  shape,  for  PCM-type  models)  of  the  cav¬ 
ity.  To  investigate  this  dependence,  the  various 
components  for  the  simplest  spherical  cavity 
model,  along  with  cavitation  energy  by  Pierotti, 
and  dispersion  energy  by  Rosch  and  Zerner,  were 
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calculated  and  plotted  as  a  function  of  cavity  size. 
For  the  case  of  acetone,  the  plot  is  reproduced  in 
Figure  1. 

It  is  immediately  clear  that  all  components  of 
the  solvation  energy  become  more  positive  as  the 
size  of  the  cavity  increases.  The  dependence  is 
especially  sharp  for  the  sum  of  the  three  compo¬ 
nents,  which  can  be  used  as  the  estimate  for  the 
total  free  energy  of  solvation.  For  example,  when 
the  cavity  size  is  decreased  by  10%  from  the 
mass-density  value,  the  total  calculated  solvation 
energy  decreases  from  -3  to  -8  kcal/mol — far 
beyond  any  experimental  error  and  probably  be¬ 
yond  the  expected  error  of  the  calculation. 

Since  there  is  no  generally  approved  and  tested 
method  to  calculate  the  cavity  size,  it  is  often 
approached  as  a  parameter  of  the  model.  There  is 
also  no  proof  that  the  optimum  values  for  the  radii 
for  calculating  the  different  components  of  the 
solvation  energies  are  similar.  However,  in  the 
range  close  to  the  optimal  radius,  the  components 
of  the  solvation  energy  depend  monotonically  upon 
the  radius.  Therefore,  it  was  found  justified  to  use 
multilinear  regression  in  fitting  the  calculated 
components  of  the  solvation  energy  to  experimen¬ 
tal  data.  The  regression  coefficients  can  be  con¬ 
verted  back  into  coefficients  usable  for  scaling  the 
sizes  of  the  cavity  from  the  values  used  in  this 
study  into  a  single  optimal  scaling  factor  for  each 
model,  but  we  have  not  done  so  here. 


SINGLE-PARAMETER  CORRELATIONS 

In  some  early  studies,  the  electrostatic  interac¬ 
tion  energies  have  been  used  as  a  measure  of  the 
whole  solute-solvent  interaction,  without  addi¬ 
tional  corrections  for  the  cavitation  and  dispersion. 
According  to  the  results  of  the  current  study,  none 
of  the  available  electrostatic  interaction  energy 
models  yields  results  which  correlate  well  with  the 
experimental  data.  The  best  correlation  [  = 

0.4882,  R^(CV)  ==  0.4165,  Fig.  2]  was  achieved  for 
the  multicavity  SCRF  model: 

AQ,,,  =  0.52AGeiectrostatic  “  0-60.  (13) 

Surprisingly,  the  hydrogen-bonding  descriptor 
correlated  remarkably  well  with  the  experimental 
solvation  energies  [R^  =  0.8285,  R^(CV)  =  0.7997, 
Fig.  3)],  dividing  the  data  into  two  classes  of 
non-hydrogen-bonding  and  hydrogen-bonding 
molecules,  with  the  average  energy  of  the  hydro¬ 
gen  bond  estimated  at  -5.3  kcal/mol.  The  latter 
value  is  in  good  agreement  with  the  generally 
accepted  values  of  3-9  kcal/mol  for  a  hydrogen 
bond  [see,  e.g.,  [53],  p.  454].  Trying  to  anticipate 
how  many  hydrogen  bonds  a  compound  pos¬ 
sessed  did  not  improve  the  statistical  fit. 

Since  the  solvation  energies  appear  to  be  largely 
dominated  by  the  hydrogen  bonding,  the  com¬ 
pounds  were  subdivided  into  two  groups:  hydro- 


Acetone:  components  of  solvation  energy 


FIGURE  1 ,  Components  of  the  calculated  solvation  energy  of  acetone  in  water. 
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FIGURE  2.  Correlation  between  the  calculated 
multicavity  SCRF  and  the  experimental  solvation  energies 
of  all  30  compounds. 


gen-bonding  and  non-hydrogen-bonding.  There 
were  17  hydrogen-bonding  and  13  non-hydrogen- 
bonding  structures  in  the  set.  Most  of  the  remain¬ 
ing  correlations  were  performed  on  the  two  groups 
separately. 


FIGURE  3.  Correlation  between  the  hydrogen-bonding 
descriptor  and  the  experimental  solvation  energies  of  all 
30  compounds. 


I\0]N-HYDR0GEI\-B0]\DI1\G  COMPOUINDS 

For  this  group,  three  parameters  appeared  to  be 
the  minimum  amount  necessary  to  achieve  a  corre¬ 
lation  of  the  experimental  solvation  energies  with 
the  higher  than  0.9.  The  best  fit  was  obtained 
with  the  PCM  model  (electrostatic  potential  from 
Mulliken  charges),  surface  area,  and  volume,  as 
parameters  [  =  0.9513,  R^(CV)  =  0.9144,  Fig.  4)]: 

AG,,,,  =  0.63AGe,e,, +  0.27S  -  0.34^  -  1.7. 

(14) 

The  coefficient  0.63  in  front  of  the  calculated  elec¬ 
trostatic  contribution  would  be  brought  to  1.0  by 
using  a  scaling  factor  for  the  van  der  Waals  radii 
equal  to  1.17,  close  to  1.2  as  discussed  above. 
Replacing  the  PCM/Mulliken  electrostatic  interac¬ 
tion  descriptor  with  any  other  one  of  the  electro¬ 
static  descriptors  did  not  lead  to  a  much  worse 
correlation;  in  all  cases  was  above  0.87. 

Leaving  out  any  of  the  three  descriptors,  how¬ 
ever,  did  lead  to  a  significant  degradation  of  the 
fit,  with  typical  correlation  coefficients  as  low  as 
R^  =  0.5.  We  conclude  here  that  at  least  one  de¬ 
scriptor  for  each  of  cavitation,  dispersion,  and  elec¬ 
trostatic  components  of  the  solvation  energy  is 
required  to  describe  the  energetics  of  solvation  of 
the  non-hydrogen-bonding  compounds. 


FIGURE  4.  The  best  three-parameter  correlation  for  the 
non-hydrogen  bonding  compounds. 
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At  the  same  time,  there  is  a  little  dependence  of 
the  final  results  on  the  choice  of  the  particular 
models  chosen  for  either  of  the  components.  Also, 
the  more  complex  (Cl)  method  for  the  calculation 
of  dispersion  energy  [42]  does  not  appear  to  have 
advantages  over  the  far  more  trivial  molecular 
volume  model,  at  least  for  AGg^j  (R^  =  0.7777  for 
PCM  electrostatics,  surface  area,  and  Cl  disper¬ 
sion;  =  0.5993  if  Pierotti's  cavitation  energy  is 
substituted  for  the  surface  area). 

HYDROGErV-BOrVDIlVG  COMPOLIVDS 

Aside  from  the  overall  energy  of  a  hydrogen 
bond,  as  described  earlier,  there  appears  to  be  no 
single  good  descriptor  to  characterize  the  solvation 
energy  for  this  group  of  compounds.  Reasonably 
good  correlations  could  only  be  achieved  when  the 
partial  atomic  surface  area  descriptors  were  in¬ 
volved.  This  apparently  accounts  for  the  different 
ability  of  the  atoms  to  form  hydrogen  bonds  and  is 
at  the  heart  of  such  methods  as  described  by 
Cramer  and  Truhlar  [14-18].  These  are  features  of 
geometric  structure  rather  than  directly  dependent 
on  electronic  structure. 

The  best  correlation  results  from  inclusion  of 
the  partial  surface  areas,  along  with  spherical  cav¬ 
ity  dipole-level  SCRF  electronic  energies.  The  cor¬ 
relation  coefficient  is  still  not  very  good,  R~  — 
0.6296.  Addition  of  dispersion  term  raises  the  cor¬ 
relation  coefficient  to  R^  =  0.7812  (here  the  ellipti¬ 
cal  cavity  model  performed  better  than  the  spheri¬ 
cal),  but  the  R^(CV),  i.e.,  the  predictive  power  of 
the  model,  remains  extremely  low,  0.3952  (Fig.  5). 
The  plot  indicates  a  relatively  uniform  distribution 
of  errors,  unlikely  to  be  improved  by  adding  an¬ 
other  descriptor.  The  coefficient  in  front  of  the 
electrostatic  term,  -0.54,  is  indicative  of  the  low 
influence  of  the  electrostatic  energy  contribution 
into  the  total 

A  similar  set  of  parameters,  with  the  addition  of 
the  surface  area  of  chlorine  and  the  hydrogen- 
bonding  descriptor,  and  using  the  spherical  cavity 
electrostatic  contribution,  is  well  capable  of  pre¬ 
dicting  the  solvation  energies  in  the  whole  data 
set.  The  errors  are  all  below  1  kcal/mole,  with  a 
R^  =  0.9443  and  R\CV)  =  0.8226  (Fig.  6).  We  con¬ 
clude  that  the  approach  adopted  by  the  developers 
of  the  SM*  models  will  be  the  most  successful  one, 
applicable  to  both  non-hydrogen-bonding  and  hy¬ 
drogen-bonding  compounds. 

The  downside  of  this  multiparameter  approach, 
however,  is  the  fact  that  removal  of  the  electro- 


FIGURE  5.  The  best  six-parameter  correlation  for  the 
hydrogen-bonding  compounds. 


static  component  does  not  lower  of  the  quality  of 
the  fit  significantly  (R^  =  0.6296).  It  appears  that 
solvation  energy  can  be  described  by  descriptors 
devoid  of  quantum  theory.  The  influence  of  the 
solvent  on  the  electronic  structure,  as  well  as  elec¬ 
tronic  properties  of  the  solute  other  than  energy,  is 
not  expected,  however,  to  be  described  accurately 


FIGURE  6.  The  best  correlation  (8  parameters)  for  the 
entire  set  of  compounds. 
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without  quantum  chemistry.  It  is  also  a  reasonable 
conclusion  to  expect,  that  for  hydrogen-bonded 
systems,  supermolecule  calculations  are  required  if 
greater  accuracy  is  the  objective. 


Conclusions 

The  current  study  indicates  that  the  best  de¬ 
scriptors  for  free  energy  of  solvation, 
presently  come  from  models  that  do  not  require 
quantum  chemical  calculations.  While  the  impor¬ 
tance  of  various  continuum  models  in  the  descrip¬ 
tion  of  the  influence  of  solvents  on  the  electronic 
structure  of  the  solute  cannot  be  overestimated, 
the  property  most  often  sought  after  and  quoted, 
namely,  free  energy  of  solvation,  can  still  be 
best  described  by  simple  multilinear  correlation 
models. 

For  hydrogen  bonding,  there  appears  to  be  no 
general  quantum  mechanical  approach  aside  from 
supermolecule  calculations  or  mixed  quantum 
classical  models  that  include  strong  interactions 
specifically.  While  the  non-quantum-mechanical 
simple  descriptors  are  quite  capable  of  estimating 
the  free  energies  of  solvation,  these  models  have 
no  influence  whatsoever  on  the  electronic  structure 
resulting  from  single-molecule  quantum  chemistry 
calculations.  The  various  continuum  methods 
studied  all  give  results  of  similar  quality,  and  none 
of  them  can  be  singled  out  by  this  study  to  be  the 
"'best"  in  calculating  AGg^/^. 
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ABSTRACT _ 

Experimental  observations  of  the  photoinduced  excited-state  lifetime  of  c/s-stilbene  have 
shown  a  distinct  dependence  on  solvation  processes.  The  rate  of  decay,  dominated  by  a 
cis-trans  isomerization,  is  more  rapid  in  polar  solvents  than  in  nonpolar  solvents.  Linear 
solvation  energy  relationship  (LSER)  techniques  show  that  this  can  be  explained  in  terms 
of  polarity  and  polarizability  parameters  for  the  solvent.  Theoretical  linear  solvation 
energy  relationship  (TLSER)  analysis  shows  that  this  can  be  explained  in  terms  of  solvent 
polarizability  and  electrostatic  basicity.  New  TLSER  descriptors  based  on  calculated 
solvent  bond  diplole  parameters  are  also  successful  in  describing  this  solvent  dependence. 
Solvent-dependent  dipole  moments  are  calculated  for  an  approximate  stilbene 
transition-state  geometry  using  the  polarizable  continuum  model  (PCM),  which  suggests 
the  usefulness  of  a  more  detailed  study  of  this  photoisomerization  process  using  current 
solvation  theory  and  computational  techniques.  ©  1996  John  Wiley  &  Sons,  Inc. 
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Introduction 

Reaction  dynamics  in  solution  is  a  complex 
phenomenon  not  easily  explained  by  theoret¬ 
ical  or  experimental  models  [1].  The  photoinduced 
isomerization  of  stilbene  betw^een  cis  and  trans 
forms  is  a  well-studied  unimolecular  phenomenon 
[2,  3]  and  has  been  previously  treated  with  solvent 
viscosity  models  to  describe  solvent  influence  on 
the  internal  rotation  about  the  ethylenic  central 
bond.  These  models,  however,  do  not  seem  to 
account  for  some  experimental  observations,  par¬ 
ticularly  when  studied  in  dipolar  solvents  [1,  3]. 
Considerable  research  has  focused  on  the  mecha¬ 
nisms  that  contribute  to  this  phenomenon  using 
concepts  such  as  internal/ external  energy  reorga¬ 
nizations  and  time-dependent  solvent  '"friction"' 
[1]. 

This  manuscript  describes  the  results  of  regres¬ 
sion  studies  of  the  solvent  dependence  of  photoin¬ 
duced  excited-state  c/s-stilbene  relaxation  rates 
which  are  dominated  by  cis- trans  isomerization. 
Regression  techniques  have  a  long  history  of  suc¬ 
cessful  elucidation  of  basic  phenomena  affected  by 
solute/solvent  interactions  [4,  5].  Using  well- 
established  descriptors  associated  with  various 
models  for  solute/ solvent  interactions,  regression 
techniques  employ  statistical  analysis  to  identify 
important  solvent  and  solute  solubility  factors  as¬ 
sociated  with  variations  in  experimental  observa¬ 
tions.  These  calculations  are  fast  and  are  extremely 
useful  for  identifying  models  worthy  of  more  elab¬ 
orate  development  [6]. 

Previously,  Rice  and  Baronavski  [9]  and  McGill 
and  co-workers  [7]  modeled  the  effects  of  solvent 
on  the  excited-state  c/s-stilbene  lifetime.  The  ultra¬ 
fast  dynamics  of  czs-stilbene  in  solution  were  mon¬ 
itored  using  100  ns  laser  systems:  Experimental 
lifetime  data  in  a  wide  range  of  dipolar  and  non¬ 
polar  solvents  were  gathered  from  several  inde¬ 
pendent  laboratories  [8-14]. 

LSER  ANALYSIS 

The  solvent  dependence  of  these  lifetimes  were 
studied  using  the  techniques  of  linear  solvation 
energy  relationships  (LSER).  Multiple  regression 
analysis  of  log  isomerization  rates.  In  /c,  were  per¬ 
formed  based  on  variations  of  empirical  multipa¬ 


rameter  solvation  equations  developed  by  Koppel, 
Palm,  Makitra,  and  Pirig  [KPMP]  [15-18]  and 
Kamlet,  Abraham,  and  Taft  [KAT]  [19].  These 
equations  have  been  shown  to  be  capable  of  corre¬ 
lating,  predicting,  and  elucidating  himdreds  of 
physiochemical  properties  and  reactivity  para¬ 
meters  of  diverse  types  [4,  19,  20].  Both  the  KPMP 
and  KAT  equations  contain  solvent  descriptors  for 
solvent  polarity  such  as  or  f(s);  hydrogen¬ 
bonding  acidity  or  electrophilicity  E;  hydrogen¬ 
bonding  basicity  or  nucleophilicity  B;  and  cav¬ 
ity  formation  8^.  However,  in  contrast  to  the 
KPMP  solvation  equation,  the  KAT  equation  does 
not  contain  a  separate  parameter  that  describes  the 
solvent  molecular  polarizability.  To  include  such  a 
parameter,  the  refractive  index  function  f(n)  was 
used  to  replace  the  solvent  parameter  which  mod¬ 
els  the  polarizability  of  the  solvent  lone  pair  and  tt 
electrons  R^,  as  opposed  to  the  whole  molecule.  In 
addition  to  using  the  usual  solvent  dipolar  de¬ 
scriptors,^ /(^r)  and  TT^  for  the  KPMP  and  KAT 
solvation  equations,  the  molecular  dipole  moment, 
/X,  was  also  examined  as  an  alternative  to  these 
parameters.  The  three  primary  equations  utilized 
were 

ln(k)  ==  In(^o)  +  p  X/(n)i  -h  s  X  TTi 

-(-  X  b  X  -h  /z  X  (l) 

ln(k)  =  ln(/Co)  +  p  X  f(n)i  m  X  /jl 

-\-aXa^+bXp^-\-hX  8^  (2) 

ln(k)  =  InC/co)  -h  X  f(n)i  +  A2f(s)i 

-h  A5  X  E  -h  A4  X  B  -h  A3  X  6^ .  (3) 

TLSER  ANALYSIS 

The  work  reported  here  focuses  on  using  the 
theoretical  linear  solvation  energy  relationships 
technique  developed  by  Famini  and  co-workers  [6, 
20,  21].  They  developed  the  theoretical  linear  sol¬ 
vation  energy  relationship  (TLSER)  method  based 
on  a  systematic  study  of  this  problem.  The  TLSER 
formulation  of  the  regression  equations  is  given  by 
[6] 

log(P)  =  a  X  b  X  7ri-\-cXsQ-EdXq_ 

e  X  -\-  f  X  +  +  g.  (4) 

The  definitions  of  these  terms  are  given  in  Table  1. 
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TABLE  I 


Definitions  of  TLSER  descriptors. 

Symbol 

Name 

Definition 

Units 

Meaning 

Molecular  volume 

Molecular  volume 

100  A 

Cavity  /  steric 

Polarizability  index 

Polarizability  (Y^^) 

none 

Polarizability 

“Covalent”  HB 

basicity 

0.30  -  (I  (£(h)  -  E(lw))i/100) 

heV 

HBAB 

q  - 

“Electrostatic”  HB 

Maximum  |(-)|  charge  on 

acu 

HBAB 

basicity 

an  atom 

“Covalent”  HB  acidity 

0.30-  (I  (Ed)  -  E  (hw))|/100) 

heV 

HBDA 

Qh^ 

“Electrostatic”  HB 

Maximum  |(-l-)|  charge  on 

acu 

HBDA 

acidity 

an  H  atom 

A  =  angstrom;  heV  =  hectoelectron  volt;  acu  =  atomic  charge  unit,  HB  =  hydrogen  bond;  E(1)  =  LUMO  energy; 
E(h)  =  HOMO  energy;  E  (Iw)  and  E  (hw)  refer  to  the  E  (LUMO)  and  E  (HOMO)  for  water,  respectively:  I  I  indicates 
absolute  magnitudes;  HBAB  =  hydrogen  bond  acceptor  basicity;  HBDA  =  hydrogen-bond  donor  acidity. 


The  molecules  of  interest  for  these  studies  are 
relatively  large.  Therefore^  the  TLSER  method  re¬ 
lies  on  the  semiempirical  formulations  for  molecu¬ 
lar  orbital  calculations  [22,  23],  most  notably  the 
well-documented  and  extensive  functionalities 
available  in  the  MOP  AC  package  [24].  These  de¬ 
scriptors  have  been  successfully  used  to  character¬ 
ize  a  range  of  properties  ranging  from  charcoal 
absorption,  HPLC  retention  index,  electronic  ab¬ 
sorption  of  a  pyridinium  ylide,  octanol-water  par¬ 
tition  coefficients,  pK^  and  the  reaction  rate  con¬ 
stant  for  hydrolysis  of  organophosphonothioates  in 
different  solvents  [20,  25]. 

One  of  the  principal  reasons  for  this  study  was 
the  recent  development  of  an  empirical  model  by 
McGill  and  co-workers  [7]  which  characterizes  the 
excited-state  c/s-stilbene  decay  rate  as  a  process 
dependent  on  two  main  opposing  effects  con¬ 
trolled  by  the  solvent  molecular  polarity  and  po¬ 
larizability.  In  the  spirit  of  the  TLSER  philosophy. 
Eager  and  Famini  [26]  developed  proposed  de¬ 
scriptors  derived  from  bond  dipoles  as  calculated 
by  MOP  AC  [24].  Use  of  these  dipole  descriptors  is 
still  in  the  preliminary  development  stage.  The 
proposed  descriptors  are  the  total  molecular  dipole 
fjJ,  the  maximum  bond  dipole  /jl^,  and  the  aver¬ 
age  bond  dipole  /jl^.  The  bond  dipole  is  defined  as 
the  positive  difference  between  the  calculated 
atomic  charges  multiplied  by  the  bond  length  for 
the  two  atoms  participating  in  a  bond. 


Results 

The  TLSER  descriptors  for  the  various  solvents 
along  with  the  associated  experimental  lifetimes 
and  rates  are  given  in  Table  11.  Calculations  were 
made  for  some  hydrocarbons  which  were  not  part 
of  the  experimental  data  set  for  the  purpose  of 
seeing  if  the  regressions  predict  useful  values  for 
future  experiments.  Since  molecular  dipoles  are  of 
interest  in  this  study,  TLSER  descriptors  were  cal¬ 
culated  for  stilbene  geometries  optimized  with 
MNDO  using  a  two-state  configuration  interaction. 
Both  the  cis  and  trans  forms  were  optimized  and 
an  approximate  geometry  for  a  transition  state  was 
obtained  by  searching  the  ethylenic  torsion  coordi¬ 
nate  at  5°  intervals.  The  Cl-optimized  geometry 
holding  the  torsion  at  105.0°  exhibited  the  highest 
heat  of  formation.  The  customary  TLSER  descrip¬ 
tors  were  calculated  holding  these  three  geome¬ 
tries  fixed  [26].  These  geometries  are  shown  in 
Figure  1.  The  descriptors  for  the  cis,  trans,  and  the 
approximate  transition  state  are  given  in  Table  III. 
Since  the  TLSER  descriptors  are  often  not  greatly 
sensitive  to  molecular  conformation  [6],  the  ap¬ 
proximate  transition-state  geometry  is  a  good  rep¬ 
resentation  for  the  purposes  of  this  study.  This 
may  be  observed  in  Table  III  where  most  of  the 
TLSER  descriptors  vary  between  the  cis  and  trans 
conformations  by  1%  or  less.  The  calculated  de- 
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TABLE  II _ 

TLSER  descriptors  for  solvents. 


Experimental  Experimental 


Solvent® 

Volume 

Polarizability  Eb^ 

0  ~ 

Ea^ 

0  -f 

/ 

lifetime  (ps) 

In  k  Rate‘s 

Acetonitrile 

41.06 

0.1034 

0.1177 

0.1146 

0.1622 

0.0209 

0.7675 

0.3577 

0.1535 

0.38 

0.968 

Cycloheptane 

116.71 

0.1119 

0.1276 

0.0054 

0.1460 

0.0037 

0.1237 

0.0100 

0.0059 

1.52 

-0.419 

Cyclohexane 

99.97 

0.1116 

0.1286 

0.0092 

0.1461 

0.0050 

0.1840 

0.0158 

0.0102 

1.37 

-0.315 

Cyclononane 

150.94 

0.1117 

0.1287 

0.0040 

0.1462 

0.0065 

0.1078 

0.0115 

0.0040 

c 

c 

Cyclooctane 

137.53 

0.1094 

0.1292 

0.0029 

0.1457 

0.0032 

0.0765 

0.0068 

0.0032 

1.62 

-0.482 

Cyclopentane 

86.09 

0.1062 

0.1250 

0.0165 

0.1462 

0.0082 

0.2744 

0.0274 

0.0183 

1.25 

-0.223 

n-Decane 

184.44 

0.1056 

0.1275 

0.0218 

0.1466 

0.0045 

0.7295 

0.0787 

0.0235 

1.27 

-0.239 

1-Decanol 

184.76 

0.1092 

0.1328 

0.3236 

0.1466 

0.1805 

2.3157 

0.6742 

0.0724 

1.13 

-0.122 

Diethyl  ether 

87.02 

0.1034 

0.1365 

0.3423 

0.1456 

0.0071 

2.8536 

0.7162 

0.2038 

0.66 

0.416 

DMSO® 

68.54 

0.1101 

0.1475 

0.7204 

0.1734 

0.0500 

8.0154 

2.4601 

0.8906 

0.59 

0.528 

1 ,4-Dioxane 

83.48 

0.1075 

0.1391 

0.3230 

0.1478 

0.0327 

3.6174 

0.6384 

0.2584 

0.73 

0.315 

n-Dodecane 

224.52 

0.1037 

0.1281 

0.0218 

0.1470 

0.0045 

0.7887 

0.0787 

0.0213 

1.35 

-0.300 

Ethanol 

50.92 

0.0983 

0.1324 

0.3235 

0.1442 

0.1783 

1.7092 

0.6501 

0.2136 

0.62 

0.478 

Ethyl  acetate 

84.38 

0.1073 

0.1314 

0.3570 

0.1687 

0.0260 

3.7324 

0.9377 

0.2871 

0.54 

0.616 

n-Heptane 

133.64 

0.1032 

0.1258 

0.0217 

0.1455 

0.0045 

0.6390 

0.0786 

0.0290 

c 

c 

n-Hexadecane 

289.49 

0.1067 

0.1289 

0.0218 

0.1474 

0.0045 

0.9074 

0.0787 

0.0185 

1.59 

-0.464 

n-Hexane 

115.2 

0.1033 

0.1252 

0.0218 

0.1450 

0.0045 

0.6062 

0.0787 

0.0319 

1.05 

-0.049 

hmpa® 

177.11 

0.1153 

0.1451 

0.6559 

0.1874 

0.0111 

24.0137 

3.1901 

0.8576 

0.83 

0.186 

Methanol 

33.67 

0.0932 

0.1314 

0.3291 

0.1402 

0.1803 

1.9036 

0.7258 

0.3807 

0.50 

0.693 

mth-2-butn® 

95.89 

0.1040 

0.1255 

0.0666 

0.1453 

0.0126 

1 .0273 

0.1670 

0.0642 

1.00 

0.000 

mthcycihxn® 

118.85 

0.1096 

0.1293 

0.0560 

0.1465 

0.0110 

0.7224 

0.1523 

0.0344 

1.37 

-0.315 

n-Nonane 

169.3 

0.1039 

0.1270 

0.0218 

0.1463 

0.0045 

0.6997 

0.0787 

0.0250 

1.21 

-0.191 

n-Octane 

151.42 

0.1036 

0.1265 

0.0218 

0.1460 

0.0045 

0.6699 

0.0787 

0.0268 

1.16 

-0.148 

n-Pentadecane 

271.8 

0.1066 

0.1288 

0.0218 

0.1473 

0.0045 

0.8780 

0.0787 

0.0191 

c 

c 

n-Pentane 

97.4 

0.1028 

0.1240 

0.0217 

0.1446 

0.0045 

0.5809 

0.0786 

0.0363 

0.97 

0.030 

1-Pentanol 

101 .93 

0.1045 

0.1336 

0.3234 

0.1462 

0.1790 

2.2997 

0.6679 

0.1353 

0.94 

0.062 

2-Propanol 

68.04 

0.1010 

0.1335 

0.3200 

0.1450 

0.1781 

1.5821 

0.5893 

0.1438 

0.80 

0.223 

Propionitrile 

61.23 

0.1005 

0.1197 

0.1055 

0.1620 

0.0291 

0.5967 

0.2802 

0.0746 

0.46 

0.777 

n-Tetradecane 

247.04 

0.1096 

0.1286 

0.0218 

0.1472 

0.0045 

0.8475 

0.0787 

0.0197 

1.44 

-0.365 

n-Tridecane 

238.85 

0.1054 

0.1284 

0.0218 

0.1471 

0.0045 

0.8186 

0.0787 

0.0205 

1.37 

-0.315 

tm224pntne® 

145.9 

0.1065 

0.1293 

0.1045 

0.1461 

0.0057 

2.4571 

0.2421 

0.0983 

1.10 

-0.095 

n-Undecane 

202.35 

0.1057 

0.1278 

0.0218 

0.1468 

0.0045 

0.7592 

0.0787 

0.0223 

c 

c 

^DMSO  =  dimethyl  sulfoxide,  hmpa  =  hexamethylphosphoramide,  mth-2-butn  =  2-methylbutane,  mthcycihxn  ^  methylcyclohexane, 
tm224pntne  =  2,2,4-trimethylpentane. 

^Eb  =  electrostatic  basicity;  Ea  =  electrostatic  acidity. 

^Not  part  of  experimental  data  set;  calculations  included  for  illustrative  purposes. 


scriptors  for  the  approximate  transition  state  show 
variations  that  range  from  <1%  to  as  large  as 
11%.  The  changes  observed  in  the  calculated  dipole 
descriptors  in  Table  III  support  the  molecular 
dipole  model  for  the  solvent  dependence  of  this 
isomerization.  These  calculations  are  viewed  as 
probes  of  significant  trends  in  complex  processes 
and  are  meant  to  be  suggestive  rather  than  quanti¬ 
tative. 

The  TLSER  regressions  were  performed  using 
the  isomerization  rate  data  shown  in  Table  II. 
Because  of  the  limited  amount  of  experimental 
data,  and  because  of  large  cross  correlations  be¬ 
tween  descriptors  for  this  data  set,  it  was  essential 


to  search  for  the  minimum  set  of  descriptors  that 
appeared  significant.  Using  the  established  TLSER 
descriptors,  the  regression  analysis  demonstrated 
that  the  stilbene  isomerization  rate  data  were  best 
represented  by  the  molecular  polarizability  tt-  and 
the  electrostatic  basicity  q_.  The  regression  results 
are  given  in  Table  IV.  During  the  search  for  mini¬ 
mum  descriptors,  it  became  apparent  that  the  data 
for  acetonitrile,  propionitrile,  and  ethyl  acetate 
were  not  well  represented  by  the  TLSER  descrip¬ 
tors  and  they  were  removed  from  the  regression  as 
outliers  whose  residuals  rates  were  well  over  three 
standard  deviations  removed  from  the  rest  of  the 
data  (Fig.  2).  Nitrogen  compounds,  in  particular 
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Approximate  Transition  State 


FIGURE  1 .  MNDO  geometries  of  c/s-stilbene,  an 
approximate  transition  state  with  torsion  around  the 
ethyleneic  bond  of  105.0°  and  trans-stWbene  using  a 
two-state  configuration  interaction. 


acetonitrile,  have  often  proven  difficult  to  repre¬ 
sent  with  this  methodology,  perhaps  because  of 
well-known  difficulties  in  semiempirical  calcula¬ 
tions  for  these  molecules  [27].  At  this  time,  it  is  not 
understood  why  ethyl  acetate  is  not  well  repre¬ 
sented  by  TLSER  for  this  data  set. 

The  VIF  is  the  variable  inflation  factor  used  to 
measure  cross-correlation  among  descriptors  in  the 
regression  analysis.  It  is  defined  as 

VIF  -  1/(1  -  K^),  (5) 

where  R  is  the  correlation  coefficient  of  one  vari¬ 
able  against  the  others  in  the  regression  set.  In  the 
TLSER  methodology,  values  of  1  <  VIF  <  5  are 
considered  acceptable.  Because  the  proposed  dipole 
descriptors  are  not  yet  well  characterized,  and 
because  of  the  limited  amoimt  of  experimental 
data,  much  stricter  criteria  were  employed  here  in 
suggesting  the  significance  of  various  descriptors 
for  the  rate  of  isomerization.  For  example,  includ¬ 
ing  the  molecular  volume  descriptor  in  the  above 
regression  improves  the  value,  increasing  it  to 
0.941  and  reducing  the  RMS  residual  to  0.086. 
However,  the  two-tail  P  value  for  the  volume 
term  is  0.0005  and  the  VIF  factor  for  the  volume  is 
1.41.  While  these  are  acceptable  values,  and  the 
coefficient  for  the  volume  contribution  is  negative 
in  accord  with  the  proposed  model  for  isomeriza¬ 
tion,  the  changes  in  statistical  measures  suggest 
that  more  data  are  necessary  for  a  definitive  analy¬ 
sis.  Similarly,  a  regression  using  the  TLSER  polar¬ 
izability  and  covalent  basicity  yielded  goodness- 
of-fit  values  R  =  0.921  and  R^  =  0.848,  however, 
the  VIF  for  the  polarizability  vs.  the  electrostatic 
basicity  (Eb)  was  1.1. 

Regression  analysis  of  this  isomerization  data 
using  the  proposed  dipole  descriptors  was  greatly 
aided  by  the  VIF  analysis  of  cross-correlation.  There 
is  not  a  strictly  defined  direct  relationship  between 
the  TLSER  descriptors  and  these  proposed  dipole 
quantities,  even  though  they  are  all  strictly  defined 
in  terms  of  results  from  the  MOP  AC  calculations 


TABLE  III _ _ _ 

TLSER  descriptors  for  stilbene  MNDO  geometries  optimized  using  two-state  configuration  interaction. 


Stilbene 

configuration 

V  mol 

Pi-1 

Eb 

0  - 

Ea 

0/7  + 

Total 

Max 

Average 

c/s -Stilbene 

178.15 

0.1358 

0.1535 

0.0644 

0.1758 

0.0599 

1.7417 

0.1353 

0.0645 

frans-Stilbene 

177.00 

0.1415 

0.1535 

0.0645 

0.1764 

0.0602 

1 .7786 

0.1357 

0.0659 

TS  -  105° 

179.24 

0.1519 

0.1670 

0.0655 

0.1844 

0.0680 

1 .7983 

0.1442 

0.0666 
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TABLE  IV 


TLSER  regression  on  stilbene  isomerization  rates:  Ln(fc)  =  b  x 
(n  =  24,  f  =  0.945,  sd  =  0.114);  R  =  0.945,  =  0.892,  Adj  R2  = 

TTj  +  d  X  q  -  -1-  intercept 
:  0.882,  and  RMS  residual  =  0.114. 

b 

d 

Intercept 

Coefficient 

-38.800 

1.289 

3.850 

± 

5.022 

0.810 

0.531 

t  value 

-7.726 

1 1 .540 

7.248 

P  (two-tail) 

<  0.001 

<  0.001 

<  0.001 

VIF 

1.00 

1.00 

Std.  coefficient 

-0.542 

0.810 

3.850 

Solvent 

Experimental  ln(/f) 

Calculated  In(A') 

Residual 

Acetonitrile  (avg,  [11,  13]) 

.968 

-.014 

.981 

Cycloheptane  [8] 

-.419 

-.484 

.066 

Cyclohexane  [9] 

-.315 

-.468 

.153 

Cyclononane 

— 

-.478 

_ 

Cyclooctane  [8] 

-.482 

-.391 

-.092 

Cyclopentane  [8] 

-.223 

-.249 

.026 

n-Decane  [11] 

-.239 

-.219 

-.020 

1-Decanol  [8] 

-.122 

.030 

-.153 

Diethyl  ether  [8] 

.416 

.280 

.136 

DMSO  (avg,  [8,  11]) 

.528 

.507 

.021 

Dioxane 

.315 

.096 

.219 

n-Dodecane  [11] 

-.300 

-.145 

-.155 

Ethanol  [8] 

.478 

.453 

.025 

Ethyl  acetate  [8] 

.616 

.147 

.469 

Heptane 

— 

-.126 

— 

n-Hexadecane  [14] 

-.464 

-.262 

-.202 

n-Hexane  (avg,  [8,  10,  12-14]) 

-.049 

-.130 

.081 

hmpa  [8] 

.186 

.222 

-.036 

Methanol  (avg,  [9,  11,  12]) 

.693 

.658 

.035 

mth-2-butn  [14] 

.000 

-.099 

.099 

mthcycihxn  [8] 

-.315 

-.330 

.015 

n-Nonane  [13] 

-.191 

-.153 

-.038 

n-Octane  [13] 

-.148 

-.141 

-.007 

Pentadecane 

— 

-.258 

_ 

n-Pentane  [8] 

.030 

-.110 

.141 

Pentanol  [1 1  ] 

.062 

.213 

-.151 

Propan-2-ol  [11] 

.223 

.344 

-.121 

Propionitrile  [8] 

.777 

.087 

.690 

Tetradecane  [10] 

-.365 

-.374 

.009 

Tridecane 

— 

-.211 

_ 

tm224patne  [14] 

-.095 

-.147 

.052 

Undecane 

— 

-.223 

— 

See  footnote  a  to  Table  II  for  abbreviations. 


[24,  26].  Based  on  the  regression  results  above,  and 
the  model  proposed  by  McGill  and  co-workers  [7], 
the  polarizability  term  was  chosen  as  the  most 
significant  TLSER  descriptor.  In  the  process  of 
minimizing  additional  descriptors,  the  best  regres¬ 
sion  analysis  was  achieved  by  combining  the  po¬ 


larizability  and  the  average  dipole  descriptors 
These  regression  statistics  are  given  in  Table  V. 

Regression  using  the  polarizability  and  the  total 
dipole  descriptor,  ijJ,  yielded  goodness  of  fit 
values  R  =  0.787  and  =  0.64.  Using  the  maxi¬ 
mum  dipole  descriptor,  /x^,  and  the  polarizability 
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FIGURE  2.  Observed  vs.  TLSER  calculated  transition  rates  for  solvent-dependent  photoisonnerization  of  stilbene.  The 
regression  line  is  shown  for  the  final  regression  with  the  outliers  removed. 


yielded  goodness  of  fit  values  R  =  0.900  and  = 
0.810.  The  VIF  factor  for  the  polarizability  and  the 
fjL^  descriptor  is  1.09,  suggesting  a  stronger  cross¬ 
correlation  between  these  two  descriptors  as  per¬ 
haps  the  explanation  for  the  less  satisfactory  good- 

TABLE  V _ 

Regression  equation  and  statistics  using 
polarizabiiity  and  average  dipole  descriptors: 

Ln(fc)  =  b  X  TT,  -H  m3  X  -H  intercept 
(n  =  24,  r  =  0.913,  sd  =  0.141);  R  =  0.913, 

R2  =  0.833,  Adj  R2  =  0.818,  and  RMS 
residual  =  0.141. 


b 

m3 

Intercept 

Coefficient 

-44.827 

1.086 

4.544 

± 

6.326 

0.783 

0.667 

t  value 

-7.086 

8.858 

6.808 

P  (two-tail) 

<  0.001 

<  0.001 

<  0.001 

VIF 

1.03 

1.03 

Std. 

coefficient 

-0.626 

0.783 

4.544 

ness  of  fit.  The  volume  descriptor  is  found  to 
contribute  to  the  regression  analysis  in  a  manner 
similar  to  the  contribution  with  the  polarizability 
and  electrostatic  basicity.  Using  the  volume  de¬ 
scriptor  along  with  the  dipole  descriptor  in¬ 
creases  the  goodness  of  fit  to  yield  R  =  0.953  and 
R^  =  0.907,  but  the  VIF  for  the  volume  is  1.35  and 
the  VIF  for  the  AVG  descriptor  increases  to  1.20. 

Because  models  for  the  isomerization  process 
for  cis-transstilbene  need  not  necessarily  involve 
significant  hydrogen-bonding  or  cavity  term  ef¬ 
fects,  it  is  possible  to  simplify  Eqs.  (l)-(3)  by 
removal  of  the  last  three  terms.  The  resulting  re¬ 
gressions  for  all  the  data  shown  in  Table  II  are 

ln(k)  =  2.21  -  10.07  X  f(nh  +  1,06  X 

(n  =  30,  r  =  0.980,  sd  =  0.09)  (6) 
In(fc)  =  2.14  -  9.61  X/(n)i  +  0.24  X  fji 

in  -  30,  r  =  0.975,  sd  =  0.11)  (7) 
Inik)  =  1.33  -  8.32  X/(n)i  +  2.45  Xfis) 

in  =  30,  0.906,  sd  =  0.19),  (8) 
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From  these  equations,  it  is  clear  that  the  photoiso 
merization  of  cfs-stilbene  to  frans-stilbene  may  be 
explained  in  terms  of  a  polar  transition  state.  These 
equations  show  positive  contributions  from  terms 
representing  the  polarity  of  the  solvent  such  that 
more  polar  solvents  increase  the  isomerization  rate. 
Opposing  this  is  a  term  representing  induced 
molecular  polarizability  which  retards  the  isomer¬ 
ization  and  dominates  the  different  equations.  This 
suggests  that  polarizable  term  interactions,  be¬ 
tween  the  transition  state  and  polarizable  solvents, 
result  in  attractions  between  molecules.  Coupled 
with  polarizability  interactions  between  solvent 
molecules,  this  indicates  an  increased  '"sip-stick'"- 
type  friction  for  the  phenyl  group  as  it  rotates 
about  the  central  carbon-carbon  bond  toward 
frans-stilbene.  More  polarizable  solvents  thus  re¬ 
tard  the  isomerization  rate. 


Discussion 

The  results  of  the  MNDO-CI(2)  calculations  are 
consistent  with  other  theoretical  models  for  the 
isomerization  of  czs-stilbene  [3].  The  trans  form 
exhibits  a  slightly  lower  heat  of  formation  (60.7 
kcal)  than  does  the  cis  form  (61.1  kcal)  and  both 
forms  exhibit  similar  structures  around  the  eth- 
ylenic  bond  with  only  minor  variations  in  opti¬ 
mized  geometric  parameters.  The  approximate 
model  for  the  transition-state  geometry,  calculated 
for  ethylenic  torsion  fixed  at  lOS.O""  away  from  the 
cis  orientation,  shows  sharp  differences  in  opti¬ 
mized  geometric  parameters,  with  the  central  bond 
at  1.467  A  compared  to  1.345  A  for  the  initial  cis 
and  trans  forms.  The  C — C  bond  length  to  the 
phenyl  groups  is  approximately  the  same  as  the 
central  bond  in  the  twisted  form,  averaging  1.466 
A,  which  is  sightly  smaller  than  the  average  1.480 
A  found  in  the  planar  forms.  The  lengthening  of 
the  central  bond  is  consistent  with  results  from 
vibrational  analysis  [3]. 

EMPIRICAL  LSER  VS.  THEORETICAL  TLSER 

MODEL 

The  different  cis-trans-sti\bene  equations  devel¬ 
oped  by  multiple  linear  regression  of  empirical 
and  theoretically  derived  solubility  parameters 
show  very  similar  trends  and  similar  statistical 
quality.  However,  the  LSER  empirical  model  does 
provide  a  slightly  better  fit  to  the  data  and  can 
model  all  the  different  solvent  data,  whereas  for 


the  TLSER  model,  the  nitrogen-containing  solvents 
and  ethyl  acetate  had  to  be  removed  from  the  data 
set  as  outliers  from  the  model.  In  both  models, 
increasing  solvent  ''polarity"  advances  isomeriza¬ 
tion  rates  and  increasing  solvent  polarizability  re¬ 
tards  isomerization  rates,  and  the  latter  dominates 
the  process  even  for  the  most  dipolar  solvents 
studied.  Dipolar  solvents  interact  with  a  polar  stil- 
bene  transition  state,  providing  a  more  favorable 
reaction  pathway  via  a  lower-energy  transition 
state.  Polarizable  solvents  interact  with  stilbene 
and  surrounding  layers  of  solvent  molecules  to 
form  a  friction  barrier  to  rotation  of  the  phenyl 
group  from  the  transition  state  through  to  the 
trans  isomer.  Increasing  solvent  polarizability  leads 
to  larger  microscopic  friction. 

From  the  stilbene  equations  developed,  we  can 
now  use  them  to  predict  new  isomerization  rates 
and  evaluate  term  contributions  to  examine  which 
solvation  factors  dominate  the  process.  For  the 
empirically  developed  LSER  Eq.  (9)  and  the  theo¬ 
retically  developed  TLSER  Eq.  (10),  the  term 
contributions  are  listed  for  some  representative 
solvents. 

A  comparison  of  "polar"  terms  in  Table  VI(a) 
shows  that  there  is  very  good  agreement  in  the 
LSER  and  TLSER  term  contributions.  It  is  not  sur¬ 
prising  therefore  to  find  that  fx  and  q-  are  very 
well  correlated  [Eq.  (11)]  for  the  stilbene  solvent 
set  after  elimination  of  the  two  nitrile  containing 
solvents. 

Similarly,  a  comparison  of  "polarizability" 
terms  [Table  VI(b)]  also  shows  some  very  similar 
trends  for  homologs  and  between  different  fami¬ 
lies.  The  relatively  constant  difference  between  the 
magnitude  of  the  terms  for  solvents  can  be  traced 
to  the  difference  of  the  regression  constants  from 
the  two  LSER  and  TLSER  equations.  The  direct 
comparisons  are  shown  in  Table  VI(c).  A  plot  of 
fin)  vs.  TT^  for  all  the  solvents  in  Eq.  (12)  is  also 
quite  reasonable,  although  the  scatter  of  the  plot 
reduces  the  correlation  coefficient: 

In(fc)  =  2.14  -  9.61  ’fin)i  +  0.24  •  fx  (9) 

In(^)  =  3.85  -  38.8  •  tti  +  1.29  •  (10) 

/X  =  -0.13  +  5.39^j_  r  =  0.99  (11) 

f(n)i  =  -0.15  +  3.72  •  r  =  0.85.  (12) 

Also  in  this  work,  stilbene  was  studied  using 
the  polarizable  continuum  model  (PCM)  [28]  as 
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TABLE  VI _ 

Comparison  of  LSER  and  TLSER  polarity  terms,  polarizability  terms,  and  regressions. 


(a)  Comparison  of  LSER  and  TLSER  polarity  terms 

LSER  dipolar  term 

TLSER  dipolar  term 

Solvent 

0.24 

1.29  q  - 

n-Hexane 

0.00 

0.03 

n-Decane 

0.00 

0.03 

n-Tetradecane 

0.00 

0.03 

Cyclohexane 

0.00 

0.01 

Cyclooctane 

0.00 

0.00 

Methanol 

0.41 

0.42 

Pentanol 

0.40 

0.42 

Decanol 

0.39 

0.42 

Dimethyl  sulfoxide 

0.95 

0.93 

Ethyl  acetate 

0.43 

0.46 

Diethyl  ether 

0.28 

0.44 

(b)  Comparison  of  LSER  and  TLSER  polarizability  terms 


Solvent 

LSER  polarizability  term 
-9.61  fin) 

TLSER  polarizability  term 
-38.8  77 1 

n-Hexane 

-2.21 

-4.01 

n-Decane 

-2.40 

-4.10 

n-Tetradecane 

-2.50 

-4.25 

Cyclohexane 

-2.50 

-4.33 

Cyclooctane 

-2.59 

-4.24 

Methanol 

-1.92 

-3.62 

Pentanol 

-2.40 

-4.05 

Decanol 

-2.50 

-4.24 

Dimethyl  sulfoxide 

-2.69 

-4.27 

Ethyl  acetate 

-2.21 

-4.16 

Diethyl  ether 

-2.11 

-4.01 

(c)  Comparison  of  LSER  and  TLSER  regressions 


Solvent  Ln  obs  (1  / 1) 

In  ko 

0.24  mu 

-9.61  in) 

In  ko 

1 .29  q- 

1 

CO 

00 

bo 

n-Pentane 

0.000 

2.14 

0.00 

-2.11 

3.85 

0.03 

-3.99 

n-Hexane 

-0.100 

2.14 

0.00 

-2.21 

3.85 

0.03 

-4.01 

n-Octane 

-0.148 

2.14 

0.00 

-2.31 

3.85 

0.03 

-4.02 

n-Nonane 

-0.191 

2.14 

0.00 

-2.40 

3.85 

0.03 

-4.03 

n-Decane 

-0.239 

2.14 

0.00 

-2.40 

3.85 

0.03 

-4.10 

n-Dodecane 

-0.300 

2.14 

0.00 

-2.40 

3.85 

0.03 

-4.02 

n-Tetradecane 

-0.322 

2.14 

0.00 

-2.50 

3.85 

0.03 

-4.25 

Cyclopentane 

-0.223 

2.14 

0.00 

-2.40 

3.85 

0.02 

-4.12 

Cyclohexane 

-0.315 

2.14 

0.00 

-2.50 

3.85 

0.01 

-4.33 

Cycloheptane 

-0.419 

2.14 

0.00 

-2.59 

3.85 

0.00 

-4.34 

Cyclooctane 

-0.482 

2.14 

0.00 

-2.59 

3.85 

0.00 

-4.24 

Methylcyclohexane 

-0.315 

2.14 

0.00 

-2.40 

3.85 

0.07 

-4.25 

Methanol 

0.693 

2.14 

0.41 

-1.92 

3.85 

0.42 

-3.62 

Ethanol 

0.478 

2.14 

0.41 

-2.11 

3.85 

0.42 

-3.81 

Propan-2-ol 

0.223 

2.14 

0.40 

-2.21 

.85 

0.41 

-3.92 

Pentanol 

0.062 

2.14 

0.40 

-2.40 

.85 

0.42 

-4.05 

(Continued) 
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TABLE  VI 
(Continued) 


(c)  Comparison  of  LSER  and  TLSER  regressions 


Solvent 

Ln  obs  (1  It) 

In  /Cq 

0.24  mu 

-9.61  (n) 

In  /Co 

1.29  q  - 

—  38.87r.| 

Decano! 

-0.122 

2.14 

0.39 

-2.50 

.85 

0.42 

-4.24 

Acetonitrile 

0.968 

2.14 

0.94 

-2.02 

3.85 

0.15 

-4.00 

Propionithle 

0.777 

2.14 

0.97 

-2.11 

3.85 

0.14 

-3.90 

Dimethyl  sulfoxide 

0.598 

2.14 

0.95 

-2.69 

3.85 

0.93 

-4.27 

HMPA 

0.186 

2.14 

-2.59 

3.85 

0.85 

-4.47 

Ethyl  acetate 

0.616 

2.14 

0.43 

-2.21 

3.85 

0.46 

-4.16 

Diethyl  ether 

0.416 

2.14 

0.28 

-2.11 

3.85 

0.44 

-4.01 

implemented  in  the  ZINDO  program  [29-33].  The 
model  develops  a  realistic  shape  of  the  cavity 
around  the  solute  by  surrounding  each  atom  by 
intersecting  spheres  with  radii  equal  to  the  van  der 
Waals  radii.  The  surface  is  divided  into  patches 
(tesserae)  by  a  triangulation  technique  [34-36]. 
Each  tessera  is  represented  by  a  point  which  car¬ 
ries  the  entire  charge  of  that  patch. 

The  PCM  model  used  in  this  work  was  pre¬ 
sented  in  detail  elsewhere  [37,  38].  The  new  fea¬ 
tures  of  the  PCM  model  developed  include  several 
different  approaches  for  estimating  the  electro¬ 
static  potential  of  the  solute.  Charges  derived  from 
the  Mulliken  population  analysis,  from  the  zero 
differential  overlap  (ZDO)  density  matrix,  and 
from  the  ZDO  density  matrix  plus  point  atomic 
dipoles  were  implemented  to  estimate  the  electro¬ 
static  potential  on  the  tessera.  In  addition,  we 
made  algorithmic  improvements,  including  calcu¬ 
lating  the  charges  on  the  tessera  internal  to  the 
self-consistent  field  (SCF)  procedure,  that  speed  up 
the  calculations  significantly  and  lead  to  a  good 
agreement  between  the  calculated  and  experimen¬ 
tal  solvation  energies. 

The  results  obtained  from  preliminary  calcula¬ 
tions  [37,  38]  show  that  the  ''ZDO  charge  dipole" 
approximation  to  estimate  the  solution  electro¬ 
static  potential  appears  to  be  most  suitable  for  the 
description  of  the  solvent-solute  interactions. 
Therefore,  we  chose  this  latter  method  for  our 
studies  on  czs-stilbene. 

According  to  the  general  expression  (PCM), 

C7(r)  =  4^- Ar^  (,)  +  (13) 

47r^  ^  ^ 


the  charge  density  on  the  cavity  surface  cr(r)  is  a 
function  of  the  dielectric  constant  of  the  medium, 
e.  The  electrostatic  part  of  the  total  energy,  Epoiar' 
is  thus  screened  through  e  which  is  the  only 
solvent  characteristic  within  the  time-independent 
PCM  model. 

The  dipole  moments  of  cfs-stilbene  and  those 
for  the  geometry  calculated  as  an  approximation 
for  a  transition-state  structure  in  gas  phase  (twist 
angle  =  105.0°)  were  calculated  via  PCM  in  a  vari¬ 
ety  of  solvents.  The  results  are  given  in  Table  VII. 
These  dipole  moments  alone  can  reproduce  the 
experimental  decay  rates  with  a  statistical  data  fit 
of  =  0.774  with  an  rms  error  of  0.26.  A  regres¬ 
sion  with  these  dipoles  and  the  molecular  volume 
yields  a  statistical  data  fit  of  R  ^  0,84  and  an  rms 
error  of  0.23. 

A  significant  question  in  this  study  is  the  link 
between  different  computational  approaches  for 
obtaining  the  solvation  descriptors.  A  regression 
analysis  on  the  PCM  calculated  TS  dipoles  shows 
that  the  TLSER  q^  is  correlated  with  a  goodness  of 
fit  of  0.95  over  the  solvent  molecules  data  set 
without  the  outliers.  This  goodness  of  fit  does  not 
change  appreciably  when  the  solvent  molecules  in 
Table  II,  for  which  there  is  no  experimental  data, 
are  also  included  in  the  regression.  However,  the 
goodness  of  fit  deteriorates  dramatically  to  0.77 
when  the  outlier  molecules,  acetonitrile,  propio- 
nitrile,  and  ethyl  acetate,  are  included  in  the  re¬ 
gression.  This  suggests  that  current  advances  in 
solvation  theory  such  as  the  PCM  can  provide 
significant  insight  into  solvation  descriptors  such 
as  those  used  in  the  TLSER  and  in  the  LSER 
empirical  descriptor  set. 
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TABLE  VII _ 

PCM  dipole  moments  calculated  for  approximate  transition-state  (TS-105)  and  c/s-stllbene  geometries. 


Dielectric  Experimental 


Solvent 

Constant 

TS-105  (D) 

c/s-Stilbene  (D) 

lifetime  (ps) 

Gas  phase 

1.000 

0.590 

0.319 

0.32 

Acetonitrile 

37.500 

0.846 

0.548 

0.38 

Cycloheptane 

1.965 

0.680 

0.402 

1.52 

Cyclohexane 

2.023 

0.683 

0.405 

1.37 

Cyclononane® 

2.000 

0.682 

0.404 

— 

Cyclooctane® 

2.000 

0.682 

0.404 

1.37 

Cyclopentane 

1.965 

0.680 

0.402 

1.25 

Decane 

1.991 

0.682 

0.403 

1.27 

Decanol 

8.100 

0.803 

0.511 

1.26 

Diethyl  ether 

4.335 

0.763 

0.475 

0.66 

Dimethyl  sulfoxide 

4.700 

0.767 

0.480 

0.59 

Dioxane 

2.209 

0.695 

0.414 

0.73 

Dodecane 

2.015 

0.683 

0.405 

1.35 

Ethanol 

24.300 

0.839 

0.542 

0.62 

Ethyl  acetate 

6.020 

0.787 

0.496 

0.54 

Heptane 

1.920 

0.678 

0.399 

— 

Hexadecane® 

2.000 

0.682 

0.404 

1.59 

Hexane 

1.890 

0.675 

0.397 

1.05 

hmpa® 

2.000 

0.682 

0.404 

0.83 

Methanol 

32.630 

0.844 

0.546 

0.50 

2-Methylbutane 

1.843 

0.672 

0.394 

1.00 

Methylcyclohexane 

2.020 

0.683 

0.405 

1.37 

Nonane 

1.972 

0.681 

0.402 

1.21 

Octane 

1.948 

0.679 

0.401 

1.16 

Pentadecane® 

2.000 

0.682 

0,404 

— 

Pentane 

1.844 

0.672 

0.394 

0.97 

Pentanol 

13.900 

0.825 

0.530 

0.94 

Propan-2-ol 

18.300 

0.835 

0.537 

0.46 

Propionitrile^ 

37.50 

0.846 

0.548 

0.46 

Tetradecane® 

2.000 

0.682 

0.404 

1.37 

Tridecane® 

2.000 

0.682 

0.404 

1.44 

2,2,4-Trimethylpentane 

1.940 

0.679 

0.401 

1.10 

Undecane 

2.005 

0.682 

0.404 

— 

^hmpa  =  hexamethylphosphoramide.  Dielectric  constant  assumed  to  be  =  2.0. 
^Dielectric  constant  assumed  equal  to  that  of  acetonitrile. 
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ABSTRACT _ 

In  recent  experimental  studies  of  infrared  (IR)  spectra  of  hydrated  phenol  clusters,  the 
Mikami  and  Ebata  group  at  Tohoku  University  attributed  apparently  two  very  different 
spectra  to  Ph0H(H20)4.  The  first  spectrum  has  a  wide  transparent  region  between  3500 
and  3700  cm  which  they  named  a  window  region.  The  second  spectrum  has  several 
absorption  bands  in  this  window  region.  Our  previous  study  revealed  that  the  first 
spectrum  was  assigned  to  the  isomers  which  have  a  single-ring  structure  of  a  hydrogen¬ 
bonding  network  of  OH's  of  phenol  and  waters.  The  experimentalists  suggested  that  the 
second  spectrum  might  be  identified  to  a  proton-transferred  phenol-water  cluster.  In  the 
present  article,  the  geometrical  structures  of  proton-transferred  hydrated  phenol  clusters 
were  determined  with  the  ab  initio  molecular  orbital  method  and  their  IR  spectra  were 
calculated.  The  proton-transferred  phenol-water  cluster  is  at  a  stable  local  minimum,  but 
the  energy  is  much  higher  than  that  of  the  most  stable  nontransferred  cluster.  The 
calculated  IR  spectrum  has  a  still  wider  window  region  and  is  far  different  from  the 
experimental  spectrum  of  the  second  type.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

The  proton-transfer  reaction  is  one  of  the  typical 
chemical  reactions.  As  described  even  in  high 
school  textbooks,  a  part  of  the  water  molecules  in 
pure  bulk  water  are  dissociated  to  the  oxionium 
ion  H30'^  and  the  hydroxyl  ion  OH”.  The  dissoci¬ 
ation  constant  of  pure  water  is  1.0  X  10”^^  at 
room  temperature.  Here  is  an  intriguing  question 
in  cluster  chemistry:  Does  the  proton-transfer  reac¬ 


tion  take  place  in  small  water  clusters?  Very  re¬ 
cently,  Lee  et  al.  theoretically  predicted  that  pro- 
ton-transferred  clusters  could  exist  in  such  small 
water  clusters  as  H30^0H”(H20)3  and 
H30^0H”(H20)6  [1].  They  are  much  less  stable 
than  are  the  well-investigated  normal  water  clus¬ 
ters  (H20)5  and  (H20)8  [2-12],  but  the  proton- 
transferred  water  clusters  are  at  a  stable  local  min¬ 
ima.  When  the  cluster  has  an  acidic  nature,  the 
proton-transfer  reaction  more  easily  takes  place. 
Recently,  we  reported  that  in  the  hydrated  singly 
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charged  boron  cation-water  clusters  B^(H20)„  the 
proton-transfer  reaction  proceeds  without  any  bar¬ 
rier  for  n  >  2  [13]: 

^  B0H(H30^)(H20)„_2.  (1) 

Phenol  is  one  of  the  fundamental  organic  acids. 
One  can  expect  that  the  possibility  of  the  proton- 
transfer  reaction  in  hydrated  phenol  clusters  is 
higher  than  in  pure  water  clusters.  This  possibility 
was  first  noticed  by  the  Mikami  group  at  Tohoku 
University,  when  they  studied  the  infra-red  (IR) 
spectra  of  phenol- water  clusters  PhOH(H20)^,  (for 
n  <  4),  [14,  15].  Up  to  n  =  3,  the  observed  spectra 
have  a  transparent  region  between  antisymmetric 
and  symmetric  stretching  modes  of  water 
molecules  around  3500-3700  cm”h  The  region  is 
named  the  '"window  region"  by  Mikami's  group 
[14-16].  Our  previous  theoretical  studies  revealed 
that  the  spectra  having  a  window  region  are  as¬ 
signed  to  the  isomers  of  a  single-ring  structure 
consisting  of  an  OH  of  phenol  and  an  OH  of  each 
of  n  water  [17].  In  their  "mass-selected"  experi¬ 
ments,  two  different  IR  spectra  were  attributed  to 
Ph0H(H20)4.  One  of  the  spectra  is  similar  to  those 
for  n  =  2  and  3  and  is  identified  to  the  isomers  of 
a  ring  structure.  On  the  other  hand,  the  other  has 


no  window  region  and  shows  several  absorption 
bands  in  this  region.  To  explain  this  second  spec¬ 
trum,  the  proton-transferred  phenol-water  clus¬ 
ters  were  suggested  [14,  15]. 

In  the  present  article,  the  geometric  structures 
of  proton-transferred  phenol-water  clusters,  which 
are  regarded  as  derivatives  of  proton-transferred 
water  clusters  reported  in  [1],  are  determined  with 
the  ab  initio  molecular  orbital  method.  The  IR 
spectra  of  the  proton-transferred  phenol-water 
clusters  as  well  as  pure  water  clusters  are  calcu¬ 
lated,  and  the  possibility  of  identification  of  the 
second  type  of  spectrum  in  [14,  15]  is  examined. 


Method 

The  closed-shell  self-consistent  field  (SCF) 
molecular  orbital  method  is  used  for  optimization 
of  the  geometries  and  for  evaluation  of  the  IR 
spectra.  The  basis  sets  used  are  6-31  -h  G*  for 
oxygen  atoms  and  the  6-31G*  set  for  the  rest  of 
atoms  (carbon  and  hydrogen).  It  is  important  to 
place  diffuse  functions  on  the  oxygen  for  describ¬ 
ing  the  hydroxyl  anion  OH“.  In  [1],  the  basis  set 
used  is  6-31  +  G**  for  all  atoms.  For  (H20)5  and 


FIGURE  1.  Structures  of  ring  type  (1)  of  PhOH(H20)4  cluster  and  proton-transferred  clusters  of  (2)  PhO^dHgO^) 
(H20)3  and  (3)  (H30^)Ph0H(H20)3(0H ").  The  geometric  parameters  were  determined  with  the  SCF/6-31  -r  G* 
method.  Units  are  given  in  A. 
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H30'^(H20)30H',  the  total  energies  are  recalcu¬ 
lated  with  the  frozen-core  second-order  Moller- 
Plesset  perturbation  (MP2)  method.  The  present 
basis  set  is  smaller  than  that  in  [1],  but  large 
enough  to  describe  the  dissociation  to  ion-pair 
clusters.  The  scaling  factor  for  harmonic  frequen¬ 
cies  is  0.8976,  which  is  fixed  by  adjusting  the 
symmetric  and  antisymmetric  stretching  modes  of 
a  free  water  molecule.  Programs  used  are  GAUSS¬ 
IAN  92  and  GAUSSIAN  94  packages  [18,  19]. 


Results  and  Discussion 

In  our  previous  work  [17],  the  most  stable  iso¬ 
mer  of  PhOH(H20)4  has  a  single-ring  structure 
consisting  of  the  hydrogen-bonding  network  of 
OH  groups  of  phenol  and  water  molecules.  Struc¬ 
ture  1  in  Figure  1  depicts  the  ring  structure 
of  the  Ph0H(H20)4  cluster.  Structures  2  and  3 
are  two  proton-transferred  phenol-water  clusters 
Ph0H(H20)4,  which  are  regarded  as  derivatives 
of  the  proton-transferred  pure  water  cluster 
(H30'^)(H20)3(0H  )  reported  in  [1].  Isomer  2  has 
the  structure  Ph0“(H20)3H30^,  in  which  a  pro¬ 
ton  of  phenol  is  transferred  to  an  oxionium  ion.  On 
the  other  hand,  isomer  3  has  the  structure 
H30^Ph0H(H20)20H“,  in  which  a  proton  of  wa¬ 
ter  is  transferred.  The  proton-transferred  cluster 
(HjO'^XHjOljCOH”)  has  a  C3  symmetry  and  the 
C3  axis  is  through  the  oxygen  atom  of  the  oxion¬ 
ium  ion  HjO"^  via  the  hydroxyl  ion  OH  .  In  struc¬ 
tures  2  and  3,  the  local  C3  symmetry  is  almost 
preserved,  although  an  OH”  ion  in  the  water 
clusters  is  replaced  with  a  PhO”  ion  in  the  phe¬ 
nol-water  clusters. 


The  relative  energies  of  proton-transferred  phe¬ 
nol-water  and  pure  water  clusters  are  summa¬ 
rized  in  Table  I.  Although  a  stable  local  minimum 
is  found  for  Ph0”(H20)3H30^  (structure  2),  the 
total  energy  is  138.43(144.49)  kj/mol  higher  than 
that  of  the  ring  Ph0H(H20)4  (structure  1)  without 
(with)  zero-point  vibration  correction.  Isomer  2  is 
more  stable  than  is  isomer  3,  as  is  expected  from 
chemical  common  sense  that  the  acidity  of  PhOH 
is  stronger  than  that  of  H2O.  The  energy  difference 
between  isomers  2  and  3  is  23.62  kJ/mol  without  a 
zero-point  vibration  collection  and  23.56  kJ/mol 
with  a  zero-point  vibration  collection.  The  energy 
difference  between  isomers  1  and  3  is  almost  the 
same  as  the  energy  difference  between  (H20)5  and 
H30'^(H20)30H“,  because  the  generated  ion  pair 
is  the  same.  The  energy  difference  between  (H20)5 
and  H30^(H20)30H”  with  the  SCF  approxima¬ 
tion  is  overestimated  compared  with  that  in  [1]. 
But  the  single-point  MP2  (frozen  core)  energy  is  in 
good  agreement  with  that  in  [1],  in  spite  of  the 
smaller  basis  set.  Thus,  if  we  assume  that  the 
electron  correlation  effect  on  the  proton-transfer 
reaction  is  similar  to  each  other  for  the  clusters 
(H20)5  and  Ph0H(H20)4,  we  may  be  able  to  dis¬ 
cuss  qualitatively  the  stability  of  the  PhOH(H20)4 
and  its  ion-pair  clusters. 

The  calculated  infrared  stick  spectra  of  the  clus¬ 
ters  in  the  — OH  frequency  region  are  shown  in 
Figure  2.  Their  numerals  are  summarized  in  Table 
II.  Only  dominant  components  are  given  for  as¬ 
signments  of  each  absorption  band  in  Figure  2  and 
Table  II.  Spectrum  A  in  Figure  2  shows  the  stick 
spectra  of  a  free  water  molecule,  an  oxionium 
ion  HjO^,  and  a  hydroxyl  ion  OH“  for  compari¬ 
son.  The  calculated  stick  spectra  of  proton-trans- 


Relative  energy  difference  without  zero-point  vibration  (- AE)  and  with  zero-point  vibration  (- AE^p^) 
correiation  of  the  proton-transferred  Ph0H(H20)4  and  (1120)5  clusters. 

—  AE^pv 

-AE  SCF  MP2//SCF 


PhOH(H20)4  (ring) 

(1) 

+0.00 

+0.00 

Ph0”(H20)3H30  + 

(2) 

+  138.43 

+  144.49 

H30-"Ph0H(H20)2(0H-) 

(3) 

+  162.05 

+  168.05 

(H20)5  (ring) 

+0.00 

+0.00 

H30'"(H20)30H- 

+  159.84 

+  168.69 

-tO.OO  (+0.00) 

+  100.74  (+92.05) 


For  -AE^  „  with  MP2  (frozen  core)  evaluation,  the  zero-point  vibration  energies  with  the  SCF  level  of  approximation  are  added  into 
the  total  energies  calculated  with  the  single-point  MP2  (frozen  core)  level  of  approximation.  Energy  standards  are  set  at  the  most 
stable  ring  isomers  of  PhOH(H20)4  and  (HaOlj.  The  values  in  parentheses  are  in  [1]  with  the  MP2/6-31  +  G**  level  of 
approximation.  Units  are  given  in  kJ  /  mol. 
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ferred  clusters  (HjO^XHjOljCOH"),  PhO^ 
(H20)3(H30+)  (structure  1),  and  (HjO-^lPhOH 
(H20)2(0H“)  (structure  3)  are  spectra  B,  C,  and  D, 
respectively. 

In  previous  theoretical  studies,  the  IR  spectra  of 
the  pure  water  cluster  (H20),^  [2,  3]  and  hydrated 
phenol  clusters  Ph0H(H20)„  [17],  both  of  which 
have  a  single-ring  structure,  had  similar  character¬ 


istics  in  the  spectra.  The  antisymmetric  stretching 
modes  of  water  molecules  do  not  move  much.  In 
contrast,  the  symmetric  stretching  modes  of  water 
molecules  show  large  shifts  toward  the  lower  en¬ 
ergy  from  a  free  water.  The  former  mode  is  more 
or  less  localized  in  nonhydrogen  bonding  OH's, 
and  the  latter  is  on  a  hydrogen-bonding  OH,  form¬ 
ing  a  ring.  In  phenol-water  clusters,  the  OH 


A.  Free  H2O  ,  OH  ,  and  (H3O+) 

Free  OH 


_ 

Free  H30‘^ 

- 

1  1 

1 

Free  H2O 

1 - 1 

1  1 

1 

1 

2800 

3000 

r  1 

3200 

T  *1 - ■— 1 

3400 

— , - . , - - — 1 

3600  3800 

B.  (H30+)(H20)3(OH-) 


HsO'*’  H2O  symm. 

I - Fj  I 


i 

1  HoO  anti.  OH 

- - r— 

J  _ 

1  (  1  I  ^  I  I  I  I  j - j-fci - j 

2800  3000  3200  3400  3600  3800 


c. 


Ph0-{H20)3(H30+) 


H2O  symm. 


n - 1 


D.  (H30+)Ph0H(H20)20H- 

H3O+ 


FIGURE  2.  Calculated  IR  spectra  of  proton-transferred  clusters  and  relative  speices:  (A)  water,  oxionium  ion  and 
hydroxyl  ion;  (B)  HgO-^lHsOgOH-;  (C)  PhO^lHgO^KHaOa;  (D)  (HgO-^lPhOHlHsOalOH-).  Only  dominant 
components  are  given  for  assignments. 
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stretching  mode  of  the  phenol  molecule  also  shifts 
toward  the  low  frequency.  The  low-frequency  shifts 
of  symmetric  stretching  modes  of  water  and  the 
OH  mode  of  phenol  become  larger  for  a  large 
cluster  size.  In  other  words,  the  window  region 
[14-16]  is  wider  for  a  larger  cluster  size  n  in 
(H20)„  and  Ph0H(H20)„  of  a  ring  structure.  For 
the  ring  structure  of  PhOH(H20)4,  the  experimen¬ 
tal  absorption  bands  of  the  OH  stretching  of  PhOH 
and  symmetric  stretching  of  water  appear  in  the 
region  about  3150-3450  cm"\  and  the  window 
region  is  around  3450-3700  cm^h 

But  as  shown  in  Figure  2,  the  window  region 
of  proton-transferred  clusters  is  much  wider  than 
that  of  the  nontransferred  ring  structure  of  PhOH 
(H20)4.  In  the  calculated  spectrum  B  for  (H30‘^) 
(H20)3(0H")  in  Figure  2,  the  window  region  is 
range  3050-3700  cm“h  The  window  region  of 
spectrum  D  (for  structure  3)  is  almost  as  large  as 
spectrum  B.  The  window  region  of  spectrum  C 
(for  structure  2)  is  3200-3700  cm“^,  a  little  nar¬ 
rower  than  that  of  spectrum  B.  It  is,  however,  still 
wider  than  of  the  window  region  of  Ph0H(H20)4, 
By  hydrogen-bonding  formation,  the  stretching 
modes  of  water  and  the  OH  stretching  of  phenol  as 
well  as  the  degenerate  OH  modes  of  oxionium  ion 
H30^  shift  substantially  toward  the  low  fre¬ 
quency. 

According  to  the  analysis  by  Schibberg  and  Luck 
[20],  the  band  shifts  of  water  molecules  are  related 
to  the  strength  of  the  basicity  of  the  proton-accept¬ 
ing  molecule.  When  the  strong  base  molecule  is 
bonded  to  a  water  as  a  proton  acceptor,  the  base 
molecule  attracts  strongly  a  hydrogen  atom  of  a 
water  molecule.  Thus,  the  OH  bond  of  the  water 
becomes  weak  and  the  frequency  of  OH  stretching 
becomes  lower.  The  very  wide  window  regions  of 
proton-transferred  clusters  are  due  to  the  much 
strong  basicity  of  the  hydroxyl  ion  OH“.  Because 
the  basicity  of  PhO~  is  weaker  than  that  of  OH“, 
the  window  region  in  spectrum  C  for  PhO" 
(H30^)(H20)3  is  somewhat  narrower  than  in 
spectrum  B.  The  width  of  the  window  region  of 
spectrum  D  is  almost  same  as  that  of  spectrum  B 
because  the  anion  in  (H30^)Ph0H(H20)2(0H“) 
is  a  hydroxyl  ion,  the  same  as  in  (H30'^)(H20)3 
(OH). 

In  contrast,  the  high-frequency  modes  of  water 
molecules  in  the  cluster  do  not  move  much;  the 
parentage  of  these  modes  is  the  antisymmetric 
mode  of  a  water  molecules.  When  two  OH  bonds 
are  not  equivalent  by  hydrogen  bonding,  this  mode 


becomes  localized  on  a  free  (nonhydrogen  bond¬ 
ing)  OH.  At  the  limit  of  complete  decoupling  of 
two  OH  modes,  the  frequency  of  the  free  OH 
mode  is  given  as  [20] 


where  and  are  the  frequencies  of  symmetric 
and  antisymmetric  modes  of  a  free  water.  In  the 
present  case,  =  3653  and  1^3  =  3750,  and,  thus, 
^lim  =  (*^1  +  *^3)72  ==  3702  [cm~M.  In  these  pro¬ 
ton-transferred  clusters,  the  frequencies  of  ''un¬ 
shifted"  modes  are  about  3715  cm“\  very  close  to 
3702  cm  "7  In  other  words,  two  OH  modes  of  a 
water  in  clusters  are  decoupled,  which  is  evidence 
that  water  molecules  in  proton-transferred  clusters 
are  bonding  to  a  strong  base. 


TABLE  II _ _ _ 

Calculated  harmonic  frequencies  (cm  and 
iR  intensities  (km /mol)  of  proton-transferred 
clusters  and  several  associated  moieties  in  the 
OH  frequency  region;  oniy  dominant  components 
are  written  for  assignments. 


Frequency 

Intensity 

Assignment 

H2O 

3653 

23.4 

sym. 

3750 

87.5 

anti. 

H3O+ 

3385 

62.6 

3481 

598.6 

3481 

598.6 

OH- 

3561 

80.4 

PhO-(H20)3(H30^) 

2832 

836.4 

H3O" 

2867 

876.4 

H3O-" 

2993 

68.9 

— CH  str. 

2996 

5.2 

— CH  str. 

3004 

1140.2 

H30^ 

3010 

10.5 

— CH  str. 

3015 

88.4 

— CH  str. 

3033 

31.4 

— CH  str. 

3127 

1259.5 

H2O 

3145 

1324.3 

H2O 

3205 

2436.5 

H2O 

3714 

156.6 

H2O 

3716 

115.9 

H2O 

3718 

90.0 

(Continued) 

H2O 
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TABLE  II  _ 
(continued) 


Frequency 

Intensity 

Assignment 

(H30"-)(H20)30H  + 

2761 

212.7 

H3O+ 

2761 

212.7 

H30+ 

2958 

2398.4 

H2O 

2958 

2398.4 

H2O 

2961 

1619.0 

3041 

888.6 

H2O 

3716 

142.1 

H2O 

3716 

142.1 

H2O 

3718 

4.4 

H2O 

3723 

12.7 

OH  -  str. 

(H30^)Ph0H(H20)g(0H-) 


2738 

516.0 

2746 

271.0 

OH  of  PhOH 

2888 

2908.4 

H3O  + 

2974 

2003.1 

H2O 

3012 

9.7 

— CH  str. 

3020 

9.4 

— CH  str. 

3032 

59.2 

— CH  str. 

3039 

56.0 

— CH  str. 

3044 

470.6 

— CH  str. 

3048 

1942.5 

HaO^ 

3052 

465.2 

H2O 

3716 

145.3 

H2O 

3717 

63.9 

H2O 

3721 

17.0 

OH"  str. 

Conclusion 

The  IR  spectra  of  proton-transferred  clusters 
have  a  very  wide  window  region  because  of  the 
strong  basicity  of  OH^  and  PhO“.  The  possibility 
of  the  identification  of  the  second  spectrum  of 
Ph0H(H20)4  to  such  proton-transferred  clusters  is 
ruled  out.  An  alternative  species  for  the  second 
spectrum  is  a  larger  size  of  clusters  than 
Ph0H(H20)4.  In  [17],  we  demonstrated  that  sev¬ 
eral  IR  bands  are  found  in  region  3500-3700  cm“\ 
if  the  cluster  has  a  water  molecule  with  both  of  the 
hydrogen  atoms  of  the  molecule  bonded  to  other 
water  molecules.  The  water  molecule  in  which  two 
hydrogens  are  bonded  to  base  molecules  was 
named  the  "double  proton  donor"  by  Jordan  and 
Zwier's  group  [21,  22].  Some  isomers  of 
Ph0H(H20)5  contain  two  such  double  proton 
donor  waters.  The  second  spectrum  could  be  as¬ 
signed  to  one  of  those  isomers.  It  might  be  difficult 


to  experimentally  find  the  proton-transferred  clus¬ 
ters  in  such  small  clusters  as  Ph0H(H20)4  and 
(H20)5  because  of  their  instability.  But  one  could 
obtain  the  proton-transferred  clusters  in  larger  sizes 
by  matrix  isolation  techniques.  By  measuring  the 
IR  spectra,  the  wide  window  region  could  be  a  key 
of  detecting  the  proton-transferred  clusters. 
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ABSTRACT _ _ _ _ _ 

We  present  an  assessment  on  the  accuracy  of  a  dielectric  continuum  solvation  model,  the 
generalized  conductor-like  screening  model  (GCOSMO),  for  predicting  hydration  free 
energies,  tautomeric  equilibria,  and  reaction  profiles  in  solution.  ©  1996  John  Wiley  & 

Sons,  Inc. 


Introduction 

For  the  past  half  century,  quantum  chemistry 
has  made  significant  progress  in  the  develop¬ 
ment  of  theory  as  well  as  its  applications  to  predict 
physical  properties  of  gas-phase  processes.  Re¬ 
cently,  theoretical  efforts  have  been  shifting  focus 
toward  solution  chemistry.  In  particular,  many 
theoretical  approaches  have  been  proposed  for 
modeling  solvation.  The  dielectric  continuum  ap- 

*  To  whom  correspondence  should  be  addressed. 

^  On  leave  from  the  Institute  of  Chemical  Physics,  Univer¬ 
sity  of  Latvia,  19  Rainis,  Blvd.,  Riga,  LV  1586,  Latvia. 


proach  [1-3],  however,  offers  the  simplest  method¬ 
ology  for  incorporating  solvent  effects  into  a  wide 
range  of  ab  initio  molecular  orbital  (MO)  and  den¬ 
sity  functional  theory  (DFT)  levels  of  theory.  In  the 
dielectric  continuum  approach,  the  solvated  sys¬ 
tem  is  modeled  as  the  solute  inside  a  cavity  sur¬ 
rounded  by  a  dielectric  continuum  medium  char¬ 
acterized  by  the  dielectric  constant  e.  Models  based 
on  molecular-shape  cavities  have  been  shown  to 
provide  a  more  accurate  description  of  solvent-so¬ 
lute  interactions  than  those  based  on  simple  spher¬ 
ical  or  ellipsoidal  cavity  [1,  3].  For  molecular-shape 
cavities,  several  promising  approaches  have  been 
introduced.  In  particular,  the  SMx  models  [1], 
which  are  based  on  the  generalized  Bom  formal- 
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ism  within  the  semiempirical  molecular  orbital 
theory,  are  quite  successful  in  calculating  free  en- 
ergy  of  solvation.  However,  the  deficiency  of  the 
semiempirical  molecular  orbital  wave  function 
makes  them  less  useful  for  modeling  reactions  in 
solution.  Thus,  to  model  reactions  in  solution,  more 
accurate  representation  of  the  solute  is  required 
such  as  using  DFT  or  ab  initio  molecular  orbital 
theories.  In  this  case,  one  can  use  the  DFT/PB 
model  [4-7],  which  combines  DFT  for  solute  den¬ 
sity  distribution  and  the  Poisson-Boltzmann  (PB) 
method  for  interaction  with  the  dielectric  contin¬ 
uum  medium.  Alternatively,  the  polarizable  con¬ 
tinuum  model  (PCM)  [3,  8]  can  be  employed.  This 
model  based  on  the  Poisson  boundary  condition 
has  been  implemented  at  the  DFT  and  ab  initio 
MO  levels  of  theory.  Although  both  the  DFT/PB 
and  PCM  models  can  provide  accurate  description 
of  the  solute  in  the  dielectric  cavity,  their  use  is 
computationally  rather  expensive.  Several  approxi¬ 
mations  have  also  been  developed.  In  the  reaction 
field  factors  approach  [9],  the  PCM  equations  are 
simplified  by  using  multipole  expansion  for  the 
solute  density.  We  recently  suggested  the  general¬ 
ized  conductor-like  screening  model  (GCOSMO), 
which  is  based  on  the  COSMO  model  originally 
proposed  by  Klamt  and  Schriiumann  within  the 
semiempirical  MO  formalism  [10].  In  GCOSMO, 
accurate  ab  initio  electronic  wave  function  can  be 
used  for  solute  electrons  and  nonelectrostatic  con¬ 
tributions  are  added  to  the  free  energy  of  solvation 
[11-14].  GCOSMO  approximate  boundary  condi¬ 
tion  allows  significant  reduction  in  computational 
demand  as  compared  to  the  DFT/PB  and  PCM- 
based  models  while  maintaining  comparable  accu¬ 
racy.  Since  DFT/PB,  PCM,  and  GCOSMO  models 
were  implemented  within  the  DFT  and/or  ab  ini¬ 
tio  MO  theory,  they  can  be  employed  for  quantita¬ 
tive  modeling  of  reactions  in  solution.  However, 
before 

extensive  use  of  these  models  to  real  systems, 
thorough  analyses  of  their  accuracy  should  be 
performed. 

Accuracy  of  continuum  solvation  models  de¬ 
pends  on  (1)  boundary  conditions  on  the  surface  of 
the  cavity,  (2)  the  atomic  radii  used  in  determining 
the  cavity  size,  and  (3)  the  validity  of  the  contin¬ 
uum  approximation  itself.  In  this  study,  we  focus 
our  attention  on  how  these  three  factors  affect  the 
accuracy  of  the  GCOSMO  model. 

The  GCOSMO  model  uses  the  boundary  condi¬ 
tion  from  a  cavity  in  a  conductor  and  then  scales 


the  surface  charges  for  a  dielectric  medium.  This  is 
fundamentally  different  from  the  exact  boundary 
condition  used  in  the  PCM  model.  In  our  previous 
study  [11],  we  have  shown  a  good  agreement 
between  GCOSMO  and  PCM  hydration  energies 
for  a  large  set  of  molecules  and  ions  when  the 
same  atomic  radii  were  used.  This  shows  that  the 
GCOSMO  approximation  is  quite  accurate  for  sol¬ 
vents  with  high  dielectric  constants  such  as  water. 
However,  in  principle,  the  GCOSMO  scaling  pro¬ 
cedure  may  yield  errors  for  solvents  with  low 
dielectric  constants,  though  no  systematic  analysis 
has  been  done.  To  estimate  the  magnitude  of  these 
errors,  here  we  compare  results  from  GCOSMO 
and  PCM  calculations  with  experimental  data  for 
the  tautomeric  equilibria  between  2-hydroxypyri- 
dine  and  2-pyridone  in  several  solvents  having  a 
wide  range  of  dielectric  constants. 

In  our  previous  work  [11],  we  optimized  a  set  of 
atomic  radii  for  construction  of  molecular  cavities 
for  both  GCOSMO  and  PCM  calculations.  With 
these  atomic  radii,  GCOSMO  model  reproduces 
experimental  hydration  energies  with  an  average 
unsigned  difference  of  about  1  kcal/mol  for  neu¬ 
tral  solutes  and  of  order  2—4  kcal/mol  for  ions  that 
were  used  in  the  fit.  It  is  desirable  to  test  applica¬ 
bility  of  these  radii  for  a  larger  set  of  molecules 
and  ions  that  were  not  included  in  the  fit.  In  this 
study,  we  calculated  hydration  free  energies  for  39 
molecules  and  5  ions  playing  an  important  role  in 
protein  structure  and  function. 

Finally,  in  this  study,  we  perform  a  GCOSMO 
study  on  hydration  effects  on  the  potential  surface 
for  the  proton  transfer  reaction  in  the 
[H3N-H-NH3]'^  system.  Comparison  of  these  re¬ 
sults  with  previous  QM/MM  Monte  Carlo  simula¬ 
tions  [15]  provides  a  test  of  the  applicability  of  the 
GCOSMO  model  for  modeling  of  reaction  profiles 
in  solutions  and  also  illustrates  the  use  of 
GCOSMO  gradient  in  determining  geometries  of 
stationary  points  on  the  free  energy  surface. 

This  study  is  organized  as  follows.  In  the  theory 
section  we  present  a  brief  overview  of  the 
GCOSMO  model.  The  application  section  is  di¬ 
vided  into  three  parts:  the  first  examines  solvent 
effects  on  the  tautomeric  equilibria  of  2-hydroxy- 
pyridine/2-pyridone  in  different  solvents.  The  sec¬ 
ond  provides  a  test  of  fitted  atomic  radii  by  calcu¬ 
lating  free  energies  of  solvation  for  a  large  number 
of  molecules  and  ions  which  were  not  studied  by 
the  GCOSMO  earlier.  The  third  discusses  hydra¬ 
tion  effects  on  the  potential  surface  the  proton 
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transfer  reaction  in  the  [H3N-H-NH3]‘^  system. 
Finally,  conclusions  and  future  directions  are  given. 


Theory 

GENERALIZED  CONDLCTOR-LIKE  SCREENING 
MODEL  (GCOSMO) 


The  GCOSMO  model  is  a  generalization  of  the 
semiempirical  COSMO  method  [10]  to  have  an 
accurate  ab  initio  description  for  the  solute  charge 
density  and  also  to  include  dispersion,  repulsion, 
and  cavity  formation  contributions  to  the  free  en¬ 
ergy.  The  GCOSMO  model  was  incorporated  into 
classical  ab  initio  MO  and  DFT  frameworks.  The 
central  approximation  of  this  model  is  to  scale  the 
screening  conductor  surface  charge  by  a  factor  of 
f(e)  =  (e  ~  D/e  to  satisfy  the  Gauss  theorem  for 
the  total  surface  charge  for  a  solute  in  a  dielectric 
medium  specified  by  the  dielectric  constant  e.  The 
screening  conductor  surface  charge  can  be  ob¬ 
tained  directly  from  the  boundary  condition  that 
the  total  potential  on  the  surface  of  the  cavity  in  a 
conductor  is  zero: 


L 

i 


2,- 

IR  -  R,l 


r  p(r') 
K  Ir  -  r1 


d^r' 


r  O-(r') 

“^Islr-r'l 


=  0, 


(1) 


where  z-  and  R-  are  values  and  positions  of  N 
nuclear  charges,  p(r)  is  the  solute  electronic  den¬ 
sity  and  the  solvent  reaction  field  is  represented  by 
the  surface  charge  density  o-(r). 

Within  the  boundary  element  approach,  a  cavity 
boundary  is  defined  by  M  surface  elements  with 
areas  {S„}.  The  surface  charge  density  at  each  sur¬ 
face  element  is  approximated  as  a  point  charge, 
{qj,  located  at  the  center  of  that  element,  {tj.  The 
surface  charge  distribution  is  then  given  by 


q=  -/(e)A-HBz4-c),  (2) 

where  A,  B,  and  c  are  M  X  M,  M  X  N,  and  M  X  1 
matrices,  respectively,  with  matrix  elements  de¬ 
fined  by  [10] 


A 


uv 


for  u  ¥=  V  and 


A 


U  U 


B...  = 


it„-R,r 


hr-  t„i  ' 


(4) 
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and  z  is  the  vector  of  N  nuclear  charges.  The  total 
free  energy  of  the  whole  system  (solute  +  surface 
charges)  is  then  given  by 


-  l/(e)z"B"A-iBz  +  (9) 

where  is  the  solute  nuclear-nuclear  repulsion. 
The  solvent  contributions  to  the  one-  and  two-elec¬ 
tron  terms  of  the  Fock  matrix  elements  and 

respectively)  are  expressed  as 

-/(e)ztB*A-iL^,,  (10) 

G;,=  -/(e)c^A-iL^,,  (11) 

where 


and  is  the  density  matrix  element;  is 

the  non-electrostatic  part  of  the  free  energy  of 
solvation  that  includes  the  dispersion,  repulsion, 
and  cavity  formation  contributions.  For  the  disper¬ 
sion  and  short-range  repulsion  contributions,  we 
adopted  Floris  et  al.  method  [16].  For  the  cavity 
formation  term,  we  employed  the  scaled  particle 
fluid  theory  of  Pierotti  [17],  which  was  trans¬ 
formed  by  Huron  and  Claverie  [18]  into  an  atom- 
molecule-type  formalism.  Other  non-elecrostatic 
contributions — such  as  an  entropy  change  due  to 
the  solvent  reorganization,  an  enthalpy  change  due 
to  changes  in  the  solute  vibrational  and  rotational 
degrees  of  freedom,  a  charge  transfer  to  the  sol¬ 
vent,  non-electrostatic  components  of  the 
solute-solvent  hydrogen  bonding,  etc. — are  effec¬ 
tively  included  in  the  atomic  radii  defining  the 
cavity  boundary.  The  cavity  boundary  was  defined 
by  the  solvent-excluding  surface  (also  known  as 
molecular  surface)  proposed  by  Richard  [19].  Note 
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that  in  some  cases  when  the  solute  can  make 
strong  hydrogen  bonds  with  solvent  molecules, 
the  GCOSMO  approach  can  be  improved  by  in¬ 
cluding  several  explicit  solvent  molecules.  This 
will  be  discussed  more  details  in  a  separate  report. 

We  have  implemented  the  above  formalism  into 
the  HF,  MP2,  CIS,  and  local  and  non-local  DFT 
theories.  Our  results  indicate  that  GCOSMO  solva¬ 
tion  energy  gradient  and  Hessian  calculations  on 
the  average  require  only  about  10-20%  more  com¬ 
putational  time  as  compared  to  corresponding 
gas-phase  calculations.  This  is  noticeably  faster 
than  other  existing  ab  initio  MO  or  DFT  dielectric 
continuum  models  based  on  a  general  cavity. 


Applications 

TAUTOMERIC  EQUILIBRIA  OF 
2-HYDROXYPYRIDIi\E/  2-PYRIDOI\E  IN 
SOLUTIONS 

Tautomeric  equilibria  of  heterocyclic  com¬ 
pounds  in  solution  are  very  sensitive  to  the  sol¬ 
vent-solute  interaction,  thus  providing  excellent 
tests  for  accuracy  of  different  solvation  models. 
These  properties  have  also  been  the  subjects  of 
recent  reviews  [1,  2,  20]  due  to  their  importance  in 
biological  systems.  In  this  study,  we  consider  the 
tautomeric  equilibria  of  2-hydroxypyridine/2- 
pyridone  in  different  solvents  as  shown  below: 


H 


Previous  studies  showed  that  almost  all  dielectric 
continuum  models  correctly  predict  the  differen¬ 
tial  free  energies  of  solvation  for  this  system.  We 
use  this  well-behaved  system  here  mainly  to  test 
the  accuracy  of  GCOSMO  boundary  condition  and 
optimized  atomic  radii  for  solvents  other  than  wa¬ 
ter  by  comparing  our  results  with  experimental 
data  and  with  PCM  calculations  which  employs  an 
exact  Poisson  dielectric  continuum  boundary  con¬ 
dition.  We  refer  readers  to  an  excellent  review  by 
Cramer  and  Truhlar  for  more  discussion  on  the 
accuracy  of  other  continuum  solvation  models  for 
this  system  [2]. 

Gas-phase  geometries  for  both  2-hydroxypyri- 
dine  and  2-pyridone  were  optimized  at  the  sec¬ 


ond-order  Moller-Plesset  perturbation  theory  level 
using  the  6-31G**  basis  set.  These  structures  were 
used  in  solvation  calculations  for  solvents  with 
dielectric  constants  e  equal  to  2,  5,  36,  and  78, 
which  correspond  to  cyclohexane,  dimethyl  ether, 
acetonitrile,  and  water,  respectively.  Free  energies 
of  solvation  were  calculated  by  using  both  PCM 
and  GCOSMO  models  with  our  atomic  radii.  The 
van  der  Waals  surface  was  used  for  the  PCM 
model  as  implemented  in  the  Gaussian  94  pro¬ 
gram.  We  found  that  the  non-electrostatic  terms 
contribute  only  about  0.1  kcal/mol  in  the  calcu¬ 
lated  differential  free  energies  of  hydration,  thus 
they  were  not  included  here. 

Calculated  differential  free  energies  of  solvation 
are  listed  in  Table  I  along  with  experimental  data 

[21] .  For  6  =  5,  the  PCM  surface  charges  do  not 
converge  for  2-hydroxypyridine,  This  convergent 
problem  is  due  to  the  iterative  procedure  for  calcu¬ 
lating  surface  charges.  Recent  closure  relation  and 
matrix  inversion  formulations  of  the  PCM  model 

[22]  may  elevate  this  problem.  For  aqueous  sol¬ 
vent,  GCOSMO  yields  the  error  of  0.8  kcal/mol 
compared  to  the  experiment.  As  expected,  the 
GCOSMO  surface  charge  scaling  approximation 
produces  larger  errors  for  less  polar  solvents.  It 
should  be  noted  that  both  2-hydroxypyridine  and 
2-pyridone  have  hydrogen  bonding  groups  which 
would  have  strong  interactions  with  first-solva- 
tion-shell  waters.  A  good  agreement  with  experi¬ 
mental  data  may  result  from  cancelation  of  errors 
from  specific  hydrogen  bond  effects  in  both  tau¬ 
tomers  that  were  not  included  in  the  present  and 
previous  continuum  calculations.  A  detailed  anal¬ 
ysis  of  modeling  hydrogen  bonding  effects  in  con¬ 
junction  with  the  dielectric  continuum  approach 
will  be  presented  in  a  forthcoming  study.  The 
agreement  between  GCOSMO  and  PCM  results  for 


TABLE  I _ 

Difference  between  free  energies  of  solvation 
(kcal  /  mol)  for  2-hydroxypyridine  and  2-pyridone  in 
solutes  with  different  dielectric  constant  e. 


Model  /  Hamiltonian  /  Basis 

e  =  2 

€  =  5 

6  =  36 

fr\ 

II 

CXI 

GCOSMO /MP2/ 6-31 

2.0 

3.8 

4.9 

5.1 

PCM /MP2/ 6-31 G** 

1.6 

NC*" 

4.5 

4.7 

Experiment® 

1.1 

1.8 

3.8 

4.3 

^Ref.  [21]. 

^  Not  converged. 
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a  wide  range  of  solvents  not  only  validates  the 
boundary  approximation  used  in  GCOSMO  but 
also  opens  the  possibility  for  more  quantitative 
quantum  continuum  solvation  studies. 

HYDRATION  FREE  ENERGIES 

Using  our  recently  optimized  atomic  radii  (H: 
1.172;  C,:  2.096;  C,,:  1.635;  N,,:  1.738;  2.126; 

O:  1.576;  F:  1.28;  P:  2.279;  S:  2.023;  Cl:  1.75)  [11],  we 
have  calculated  hydration  free  energies  for  a  set  of 
39  neutral  molecules  and  5  ions  which  are  of 
interest  in  biochemistry.  GCOSMO  calculations 
were  done  at  the  HF/6-31G*  level  using  the  gas- 
phase  optimized  geometries  at  the  same  level  of 
theory.  For  dispersion  and  repulsion  interactions 
between  the  solute  with  solvent  water  hydrogen 
and  oxygen  atoms,  coefficients  are  taken  from  the 
OPLS  force  field  [23].  Recall  that  in  our  previous 
study  [11]  we  found  rather  small  differences  be¬ 
tween  hydration  free  energies  calculated  at  the  HF, 
DFT,  and  MP2  levels.  Thus,  we  expect  that  DFT 
and  MP2  theories  would  show  the  same  trend  as 
the  present  HF  results.  The  calculated  HF/6-31G* 
hydration  free  energies  are  listed  in  Table  II  along 
with  available  experimental  data.  We  found  that 
GCOSMO  yields  hydration  free  energies  for  neu¬ 
tral  nonaromatic  molecules  with  a  RMS  (root-mean 
square)  difference  with  experiments  of  about  1.3 
kcal/mol.  This  is  consistent  with  our  previous 
finding.  An  interesting  new  result  is  that  for  aro¬ 
matic  molecules,  GCOSMO  tends  to  overestimate 
the  hydration  free  energies  with  a  RMS  error  of 
about  2.6  kcal/mol.  A  careful  examination  of  the 
fitting  procedure  [11]  for  atomic  radii  shows  that 
the  sp^  carbon  radius  (1.635  A)  attempts  to  ac¬ 
count  for  both  hydrogen  bonding  effects  in  acids, 
acetones,  and  amides  as  well  as  resonance  effects 
in  aromatic  compounds.  However,  the  sp^  carbon 
radius  is  biased  toward  the  former  since  it  has 
more  experimental  values  used  in  the  fit.  Conse¬ 
quently,  the  sp^  carbon  radius  is  too  small  and 
effectively  overestimates  the  resonance  effects  for 
aromatic  compounds.  For  ions,  the  RMS  error  is 
about  5  kcal/mol,  which  is  about  the  same  range 
as  in  our  previous  study.  Note  that  the  wave 
functions  for  anions  are  quite  diffuse.  As  a  result, 
there  is  a  non-negligible  portion  of  the  solute  elec¬ 
tron  density  outside  of  the  cavity;  this  yields  con¬ 
sistently  smaller  hydration  free  energies  for  anions 
as  seen  in  Table  II.  Therefore,  more  accurate  fitting 
of  atomic  radii  is  required  with  the  use  of  carefully 
selected  set  of  target  species. 


TABLE  II _ 

Calculated  and  experimental  hydration  free 
energies  (in  kilocalories  per  mole). 


Solutes 

AO  exp 
^^hyd 

Neutral  (nonaromatic) 

Ethanol 

-4.90 

-4.9 

1 -Propanol 

-4.39 

-4.8 

Isopropanol 

-4.46 

-4.8 

Dimethylamine 

-1.77 

-4.3 

Ethylthlol 

-1.32 

-1.3 

Methyl-ethylsulfide 

-1.88 

-1.5 

DImethylsulfide 

-0.94 

-1.4 

Diethylsulfide 

0.25 

-1.3 

Butanone 

-3.89 

-3.6 

2-Pentanone 

-3.70 

-3.5 

3-Pentanone 

-2.82 

-3.4 

Acetaldehyde 

-4.58 

-3.5 

Propionic  acid 

-7.78 

-6.5 

acetamide 

- 1 1 .36 

-9.7 

Propionamide 

-10.35 

-9.4 

A/-methylacetamide 

-8.97 

-10.0 

A/-p-Guanldine 

- 1 1 .61 

-11.0 

Trimethylamine 

0.19 

-3.2 

1 -Propylamine 

-2.89 

-4.4 

2"Methoxyethanol 

-6.70 

-6.8 

2-Methoxypropane 

-0.19 

-2.0 

3-pentanone 

-3.64 

-3.4 

Ethylacetate 

-4.27 

-3.1 

Methyl-butanoate 

-3.39 

-2.8 

Methyl-propanoate 

-4.02 

-2.9 

Methylformate 

-5.40 

-2.8 

N-methylacetamide 

-9.04 

-10.0 

Piperazine 

-5.77 

-7.4 

Pro-2-en-1-ol 

-4.64 

-5.0 

RMS  error 

1.27 

Neutral  (aromatic) 

Benzene 

-2.95 

-0.9 

Toluene 

-2.01 

-0.9 

4-Methylpyridine 

-2.76 

-4.9 

P-cresole 

-7.91 

-6.1 

2-Methylphenol 

-7.72 

-5.9 

Methylindole 

-7.47 

-5.9 

Methylimidazole 

- 1 1 .42 

-10.0 

Aniline 

-7.28 

-4.9 

Acetophenone 

-8.47 

-4.6 

Benzaldehyde 

-9.29 

-4.0 

RMS  error 

2.64 

Ions 

A/-Butylammonium 

-63.6 

-69 

Methyl-thiol  ion 

-72.4 

-74 

Propionate  ion 

-73.4 

-79 

P-cresole  ion 

-66.5 

-75 

Methyllmidazolium 

-66.9 

-64 

RMS  error 

5.4 
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SOLVENT  EFFECTS  ON  POTENTIAL  ENERGY 
SERFAGE  FOR  THE  PROTON  TRANSFER  IN 

HsN..HNH3+ 

Proton  transfer  reactions  are  fundamentally  im¬ 
portant  in  many  chemical  and  biological  processes 
and  are  the  subjects  of  numerous  theoretical  and 
experimental  studies.  Since  proton  transfer  in  bio¬ 
logical  systems  occurs  under  the  influence  of  the 
protein  and  aqueous  environments,  environmental 
effects  are  of  profound  importance  for  modeling  its 
reaction  mechanisms.  In  this  study,  we  provide  an 
analysis  of  the  potential  energy  surface  of  the 
proton  transfer  in  the  H3N...HNHJ  system  in 
aqueous  solution,  which  is  often  being  used  as  a 
model  for  proton  transfer  in  biological  systems.  In 
particular,  McCammon  and  co-workers  [24]  have 
constructed  an  analytical  potential  function  for  the 
gas-phase  reaction  based  on  the  MP2/6-31G'^  ener¬ 
gies  and  pointed  out  that  the  N-N  vibration  of  the 
donor  and  acceptor  fragments  has  a  noticeable 
effect  on  the  barrier  to  proton  transfer.  Gao  [15] 
has  performed  hybrid  QM /MM  simulations  based 
on  the  AMI  semiempirical  Hamiltonian  for  the 
quantum  region  to  obtain  a  two-dimensional  po¬ 
tential  energy  surface  for  this  reaction  and  noted 
that  hydration  effects  further  raise  the  barrier  at 
large  N-N  distances. 

Geometries  of  the  ion-dipole  complex  and  sad¬ 
dle  point  were  fully  optimized  at  the  MP2/6-31G*'^ 
level  of  theory  in  both  the  gas  phase  and  in  aque¬ 
ous  solution.  Optimized  geometrical  parameters 
and  barrier  heights  are  listed  in  Table  III.  We 
found  that  the  aqueous  environment  has  a  signifi¬ 
cant  effect  on  the  geometry  of  the  ion- dipole  com¬ 
plex  due  to  the  large  total  dipole  but  has  a  little 
effect  on  the  structure  of  the  zero-dipole  transition 
state,  even  though  the  difference  in  hydration  en¬ 
ergies  of  the  complex  and  transition  state  is  only 
1.5  kcal/mol.  In  particular,  the  NH^  bond  distance 
(R^)  of  the  ammonium  shortens  from  1.134  to 
1.058  A  while  the  hydrogen  bond  (R2)  elongates 
from  1.533  to  1.801  A  when  transferring  from  the 
gas  phase  to  aqueous  solution.  This  is  consistent 
with  previous  finding  on  the  similar 
[H20..H,.0H2]^  system  using  the  PCM  solva¬ 
tion  model  [25].  As  pointed  out  in  previous  studies 
[15,  25],  the  charge  distribution  is  more  delocalized 
at  the  transition  state;  consequently  the  transition 
state  is  less  solvated  compared  to  that  of  the 
ion-dipole  complex.  This  leads  to  an  increase  in 
the  proton  transfer  barrier  by  1.5  kcal/mol  from  its 
values  in  the  gas  phase  of  0.45  kcal/mol.  Similar 


TABLE  III _ 

Geometrical  parameters  (bond  lengths  in 
angstroms  and  angles  in  degrees)  of  stationary 
points  and  barrier  heights  (kilocaiories  per 
mole)  to  proton  transfer  in  both  the  gas 
phase  and  aqueous  soiution. 


f5 

H  • 

R2 

A 

H 

H 

Gas  phase 

Solution 

Complex 

TS 

Complex 

TS 

1.134 

1.298 

1.058 

1.296 

FI2 

1.533 

1.801 

a 

110.7 

111.8 

109.9 

111.7 

p 

112.7 

112.8 

AEorAAG  0.0 

0.45 

0.0 

1.94 

^^solv 

70.4 

68.9 

increase  of  1.24  kcal/mol  was  found  for  the  barrier 
of  the  proton  transfer  in  the  [H2O . .  H . .  OH2]^ 
system  using  the  PCM  solvation  model  [25].  How¬ 
ever,  our  result  is  a  factor  of  2  smaller  than  Gao's 
prediction  of  3  kcal/mol  using 
QM/MM-AMl /OPLS  approach  [15].  This  differ¬ 
ence  could  result  from  several  factors.  One  is  the 
solvent-solute  hydrogen  bond  effects  which  is 
treated  in  an  average  manner  in  our  continuum 
calculations.  But  including  specific  first-solvation- 
shell  hydrogen  bond  effects  in  a  discrete  contin¬ 
uum  model  was  found  to  lower  the  proton  transfer 
barrier  for  the  [H2O . .  H . .  OH2]^  system  [25],  The 
other  factor  is  the  difference  in  the  solute  elec¬ 
tronic  wave  functions,  MP2  in  our  calculations  as 
compared  to  AMI  in  Gao's  simulations  [15].  How¬ 
ever,  as  mentioned  earlier,  our  previous  study  [12] 
showed  that  solvation  energy  is  not  very  sensitive 
to  the  quality  of  the  solute  wave  function  (though 
only  HP,  MP2,  and  DPT  levels  were  considered). 
More  study  is  certainly  needed  to  sort  out  the 
origins  of  this  discrepancy. 

To  provide  a  quantitative  understanding  of  the 
hydration  effects  on  the  potential  energy  surface  of 
this  reaction  particularly  along  the  N-N  stretching 
mode,  we  have  calculated  the  potential  curves  for 
proton  transfer  between  two  nitrogen  atoms  sepa¬ 
rated  by  distances  Rjvjn  ==  2.5,  2.75,  and  3  A,  re¬ 
spectively,  both  in  the  gas  phase  and  in  aqueous 
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solution.  Since  the  spectator  bond  distances 
and  HNN  angles  remain  nearly  constant  about 
1.02  A  and  111.6°,  respectively,  at  the  reactant  and 
transition  state,  they  are  assumed  to  be  fixed. 
These  potential  curves  plotted  versus  the  reaction 
coordinate  are  shown  in  Figure  1.  The  reaction 
coordinate  used  here  is  defined  as  the  difference  in 
the  distances  from  the  transferring  proton  to  the 
acceptor  and  the  donor  nitrogens.  In  general,  our 
results  agree  well  with  those  obtained  from 
QM/MM  Monte  Carlo  simulations  by  Gao.  In  par¬ 
ticular,  at  Kjvjv  ^  A,  the  free  energy  surface  is 
barrierless  to  proton  transfer.  However,  as  the  N-N 
distance  increases  the  barrier  to  proton  transfer 
also  increases  but  at  a  faster  rate  in  the  presence  of 
aqueous  solvent.  Furthermore,  the  ion- dipole 
complex  is  more  stabilized  by  the  aqueous  solvent 
as  the  N-N  distance  increases  making  the  complex 
minimum  flatten  out  relative  to  that  of  the  gas- 
phase  minimum.  These  features  have  been  dis¬ 
cussed  in  details  by  Gao.  The  general  agreement 
between  our  results  and  those  from  Gao's 
QM/MM  simulations  [15]  leads  to  an  important 
conclusion  that  the  simple  and  efficient  GCOSMO 
model  can  be  used  effectively  for  reaction  profile 
calculations  instead  of  expensive  simulations.  This 
allows  one  to  use  more  sophisticated  quantum 
mechanical  methods  that  is  essential  for  quantita- 


FIGURE  1. 


five  description  of  bond-breaking  and  -forming 
processes  in  solution  and  to  consider  realistic  re¬ 
acting  systems  with  reasonable  computational  cost. 
In  addition,  the  availability  of  GCOSMO  first  and 
second  energy  derivatives  allows  one  to  study 
solvent  effects  on  the  reaction  coordinates  which 
have  not  been  addressed  in  the  past.  Such  effects 
will  be  discussed  in  a  forthcoming  study. 


Conclusion 

The  GCOSMO  continuum  solvation  model  has 
shown  considerable  promise  in  modeling  equilib¬ 
ria,  structure,  and  reactivity  of  reactions  in  solu¬ 
tions.  In  many  cases,  like  tautomeric  equilibrium 
in  2-hydroxypyridine/2-pyridone  and  proton 
transfer  reaction  in  the  H3N-H-HN/  system 
studied  here,  the  use  of  the  dielectric  continuum 
GCOSMO  model  is  justified,  as  shown  by  compar¬ 
ison  with  more  rigorous  PCM  and  QM/MM  re¬ 
sults.  In  such  cases,  the  main  advantage  of  the 
GCOSMO  model  is  the  computational  efficiency  in 
calculating  solvation  free  energy  and  its  deriva¬ 
tives.  The  accuracy  of  about  1  kcal/mol  for  hydra¬ 
tion  free  energies  of  neutral  nonaromatic  solutes 
and  slightly  larger  for  aromatic  solutes  and  ions  is 
comparable  to  the  more  rigorous  PCM  model.  More 
work  is  needed  to  improve  the  atomic  radii  for 
defining  the  cavity  and  to  include  specific  hydro¬ 
gen  bonding  effects.  Extending  this  model  to  study 
solvent  effects  on  electronic  absortion  and  vibra¬ 
tional  spectra  is  currently  under  way  in  our  labora¬ 
tory. 
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ABSTRACT  _ _ 

Multireference  configuration  interaction  wave  functions  with  single  and  double 
excitations  were  calculated  for  the  ground  state  of  the  C2  molecule  and  the  excited 
states  of  C2  with  symmetries  +  ,  and  ^11^.  The  corresponding  7r„, 

and  77^  valence  Dyson  orbitals  were  calculated.  Most  of  the  density  due  to  the  valence 
electrons  is  accounted  for  by  three  one  and  one  degenerate  pair  of  7r„  Dyson 
orbitals.  Electron  correlation  plays  an  important  role  in  the  bond  strength  of  C2  by 
increasing  the  occupation  of  the  valence  orbitals  and  decreasing  the  occupation  of  the 
and  77^  valence  orbitals.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

A  number  of  experimental  [1]  and  theoretical 
[2-11]  studies  of  C2  were  conducted  to 
determine  potential  energy  curves,  spectro¬ 
scopic  constants,  electron  affinity,  and  ionization 
potentials  and  to  identify  the  energy  positions  of 
several  of  its  excited  states.  The  ^2^  ground 
state,  KK(2(T^y(2aJ^(l7Tj'^,  is  only  slightly 
below,  0.09  eV,  the  ^11  excited  state, 
KK(2a^maJ\l7Tj\3a^y  [12]. 


The  experimental  bond  energy,  D^,  for  C2  is 
0,234  au  [12],  but  the  Hartree-Fock  value  is  only 
0.029  au  [10].  The  near  degeneracy  of  the  2cr^^,  3 
iTTy,  and  Itt^  orbitals  and  the  large  spatial  overlap 
of  the  2a-„  and  3o-^  orbitals  are  thought  to  con¬ 
tribute  to  the  stability  of  C2  [2].  The  conventional 
bond-order  analysis  of  the  self-consistent-field 
(SCF)  single-configuration  wave  function 

leads  to  the  prediction  that  C2  has  a  double  77 
bond.  Configuration  interaction  (Cl)  wave  func¬ 
tions  for  C2  typically  have  a  coefficient  of  about 
0.88  for  the  primary  configuration  and  a  coeffi¬ 
cient  of  about  0.32  for  the  configuration 
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KK(2c7-^)^(l7rj^)^(3a^)^.  One  effect  of  the  correlated 
wave  function  is  to  increase  the  amount  of  cr 
bonding  in  C2. 

Dyson  orbitals  (DOs),  also  called  generalized 
overlap  amplitudes  and  Feynman-Dyson  ampli¬ 
tudes,  were  used  for  a  qualitative  analysis  of 
chemical  bonding  in  oligosilanes  [13]  and  carbon 
clusters  [14].  In  this  article,  we  examine  the  chemi¬ 
cal  bonding  in  C2  using  DOs. 

Dyson  spin  orbitals  (DSOs)  can  be  defined  as 
[15,  16] 

g,(x)  = 

^  2  •  •  •  ^  \  /  Cl) 

where  represents  the  spatial  and  spin  coordi¬ 
nates  of  the  ith  electron.  The  serve  as  nor¬ 

malization  constants  in  Eq.  (1)  and  can  be  inter¬ 
preted  as  occupation  numbers  when  expanding 
the  density  in  terms  of  gf. 

pix)  =  I3OT,|g,(x)|^.  (2) 

i 

The  DSO  occupation  numbers,  my,  sum  to  N,  the 
number  of  electrons.  In  general,  they  are  not  equal 
to  the  natural  spin  orbital  (NSO)  occupation  num¬ 
bers. 

The  DSOs  can  be  written  as  the  product  of  a 
spatial  part  times  a  spin  part  gy(x)  ^  S',(r)cry.  For 
the  singlet  state  considered  here,  gy(r)  is  the  same 
for  both  a  and  ^  spin  parts.  We  refer  to  the  ^^(r) 
as  the  DOs  and  will  refer  to  DSOs,  the  ^y(x), 
mainly  with  reference  to  discussing  their  occupa¬ 
tion  numbers. 

The  N-electron  wave  function  can  be  expanded 
in  terms  of  the  DOs  and  the  wave  functions  for  the 
(N  -  l)-electron  states: 

(3) 

The  gy(r)  are  the  orbitals  occupied  by  one  electron 
if  the  others  occupy  an  eigenfunction  of  the  ionic 
system  of  (N  -  1)  electrons.  The  DOs  are  physi¬ 
cally  meaningful,  since  the  square  of  the  Fourier 
transform  into  momentum  space  of  individual  or¬ 
bitals  can  be  measured  by  electron  momentum 
spectroscopy  [17]. 


Although  certain  DOs  with  the  same  spin  are 
predicted  to  have  the  same  long-range  decay  [18, 
19]  related  to  the  lowest  ionization  potential  (IP), 
calculations  on  the  Li  atom  suggest  that  they  may 
have  intermediate-range  decays  which  depend  on 
their  associated  IPs  [20].  DOs  with  higher  IPs  tend 
to  decay  more  rapidly  at  intermediate  distances. 
The  most  diffuse  DOs  appear  to  be  associated  with 
the  lower  states  of  the  ion.  DOs  associated  with 
higher  excited  states  of  the  ion,  higher  IPs  of  the 
N-electron  system,  appear  to  be  more  compact,  to 
have  fewer  nodes  and  to  decay  more  rapidly.  DOs 
associated  with  higher  IPs  for  H2  contribute  to 
building  up  the  electron  density  at  the  nucleus  and 
in  the  bonding  region  [21]. 


Calculations  and  Results 

Multireference  configuration  interaction  with 
single  and  double  excitations  (MRSDCI)  wave 
functions  were  calculated  for  C2  and  for  the  ex¬ 
cited  states  of  C2  having  symmetries  ^2^1,  ^2”/ 
^rii/,  and  Calculations  were  performed  at  the 
C2  equilibrium  internuclear  distance  =  2.348 
bohr  using  the  MELD  [22,  23]  programs  which 
include  a  program  for  calculating  DOs.  The  basis 
set  consisted  of  14s,  12p,  3d,  2/,  and  Ig  Cartesian 
Gaussian  functions.  All  Cartesian  components  were 
kept.  The  14s,  12  p,  and  3d  orbitals  are  taken  from 
Wang's  228-CGTO-2  basis  set  [24]  and  the  2/  and 
Ig  functions  are  the  correlated  orbitals  for  carbon 
from  Dunning  [25].  The  SCF  energy  in  this  basis 
set  is  —  75.4065  au. 

Configurations  in  the  MRSDCI  wave  functions 
were  dropped  if  their  second-order  perturbation 
theory  contributions  were  less  than  1  X  10“^.  This 
resulted  in  51,105  configurations  for  the  C2  ground 
state,  33,595  configurations  for  the  ^2^  states  of 
CJ ,  42,934  configurations  for  the  ^2”  states  of  C J , 
35,044  configurations  for  the  ^Uu  states  of  CJ ,  and 
57,222  configurations  for  the  ^11^  states  of  CJ 
using  D2},  symmetry.  The  total  computed  Cl  en¬ 
ergy  of  C,  is  —75.83235  au,  which  accounts  for 
82%  of  the  correlation  energy.  (The  exact  total 
energy  is  estimated  to  be  —  75.9255  au  using 
—  37.8450  au  for  the  total  nonrelativistic  energy  of 
the  carbon  atom  [26]  and  the  experimental  dissoci¬ 
ation  energy  D^,  =  147.8  kcal/mol  [27].) 

The  highest  NSO  [28]  occupation  numbers  for 
the  ground  state  of  C2  are  listed  in  Table  I.  The 
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TABLE  I _ 

NSO  occupation  numbers. 


o-g 

1 

0.9993 

0.9680 

0.9993 

0.0251 

2 

0.9875 

0.0031 

0.8527 

0.0009 

3 

0.1296 

0.0012 

0.0053 

4 

0.0031 

0.0014 

5 

0.0031 

0.0008 

6 

0.0019 

2*  sum® 

4.2688 

3.8892^’ 

3.7186 

0.1040'" 

®The  factor  of  2  accounts  for  both  a  and  (B  spin  types. 
^The  extra  factor  of  2  accounts  for  the  spatial  degeneracy  of 
the  77  orbitals. 


large  amount  of  correlation  between  the 
KUla-^niajKlTT.y  and  KK(2crp2(l7r„A3orp2 
configurations  is  illustrated  by  the  high  occupation 
number  of  0.1396  for  the  3cr^  NSO  and  the  low 
occupation  number  of  0.8527  for  the  2a;^  NSO. 
These  occupation  numbers  would  be  expected  to 
be  near  0  and  1.0,  respectively,  for  a  wave  function 
that  had  little  correlation.  The  sum  of  the  spin- 
traced  occupation  numbers  is  about  4.3  for  the 
natural  orbitals  (NOs)  and  about  3.7  for  the  o-y 
NOs.  The  occupation  of  the  7r„  NSO,  while  down 
slightly,  is  not  reduced  nearly  as  much  as  the  2cr^ 
NSO.  Electron  correlation  has  the  effect  of  con¬ 
tributing  to  the  stability  of  the  molecule  by  reduc¬ 
ing  the  density  contribution  of  the  antibonding 
orbitals. 


The  number  of  Dyson  orbitals  that  could  be 
determined  for  each  symmetry  of  the  C2  ion  is  the 
same  as  the  number  of  configurations  in  the  wave 
function,  many  more  than  the  number  of  SCF 
orbitals  in  a  finite  basis  set.  We  label  DOs  by  the 
symbol  nXm,  where  X  is  the  symmetry  label,  n  is 
the  principal  SCF  quantum  number  within  a  given 
symmetry  type,  and  m  is  an  index  which  orders 
DOs  according  to  their  IP  within  a  given  n  and  X. 
The  first  o*^  DO  is  3o-^l,  the  second  is  3<t^2,  the 
third  is  2cr^l,  and  so  on. 

The  first  13  IPs  and  DSO  occupation  numbers 
are  listed  in  Table  II  for  the  cr^,  and  7r„ 

orbital  symmetries.  All  the  7T^^  DOs  listed  in  Table 
II  are  l7r„m,  all  listed  are  and  all  listed 

77^  are  iTr^m.  DOs  corresponding  to  the  core  or¬ 
bitals  lo"^  and  IcTj^  were  not  calculated  because 
they  correspond  to  excited  states  of  the  ion  that 
were  higher  than  we  were  able  to  compute.  The  six 
DOs  that  make  significant  contributions  to  the 
valence  density  are  the  3o-^2,  2cr^3,  and  2c7-„l 

and  the  two  Itt^I  DOs. 

If  the  wave  functions  had  coefficients  near  1  for 
the  dominant  configurations  and  coefficients  near 
0  for  the  others,  the  occupation  numbers  for  both 
the  NSOs  and  the  DSOs  would  be  near  0  and  1. 
The  largest  occupation  number  for  the  DSO  is 
0.9237,  near  1,  while  the  largest  occupation  num¬ 
ber  for  the  2a^  DSOs  is  0.8076.  Three  of  the 
valence  DSOs  which  have  significant  occupancy 
are  the  3or^2  DSO  with  an  occupation  number 


TABLE  II _ _ _ 

IPs  and  DSO  generalized  overlap  amplitudes  (GOA)  occupation  numbers  for  C2  tor  R  =  2.348  au. _ 

DO  IP(o-g)  m(o-g)  IP(o-J  m(aj _ IPUJ _ _ IPUg)  ming) 


1 

0.5911 

0.0009 

2 

0.8519 

0.1335 

3 

1.0593 

0.2179 

4 

1.1756 

0.0199 

5 

1 .2287 

0.4261 

6 

1 .2599 

0.0084 

7 

1 .2972 

0.0285 

8 

1 .3372 

0.0110 

9 

1.3516 

0.0657 

10 

1 .3687 

0.0043 

11 

1.3812 

0.0022 

12 

1 .4030 

0.0004 

13 

1 .4340 

0.0002 

Lnij 

0.9190 

0.5867 

0.8076 

0.4932 

0.8371 

0.0041 

0.7459 

1.1032 

<  10"^ 

0.8487 

1.1922 

0.0008 

0.9919 

1.2127 

0.0034 

1 .0370 

1 .2937 

0.0005 

1 .0606 

1 .3091 

0.0010 

1.1346 

1.3647 

0.001 1 

1.1910 

1 .3725 

<  10“^ 

1.2024 

1 .3774 

<  10"^ 

1.2310 

1 .3864 

0.0002 

1 .2538 

1.4198 

<  lO”'^ 

1 .2837 

1 .4253 

0.0009 

0.8196 

1 .3086 

0.9237 

0.5677 

0.0038 

0.0001 

0.7372 

0.0013 

0.0002 

0.9107 

0.0013 

0.0003 

0.9574 

0.0003 

0.0019 

1.1457 

0.0051 

<  10“^ 

1.1907 

0.0002 

0.0001 

1.2560 

<  10"^ 

0.0002 

1.2665 

0.0004 

<  10“^ 

1 .2685 

<  10”" 

<  10~^ 

1.2863 

<  10"^ 

0.0009 

1 .2957 

0.0001 

0.001 1 

1.3012 

<  10"^ 

0.0023 

1.3167 

0.0003 

0.9308 

0.0128 
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0.1335,  the  2cr^l  DSO  with  an  occupation  number 
of  0.2179,  and  the  7a*^3  DSO  with  an  occupation 
number  of  0.4261.  The  other  valence-shell  DSOs 
have  occupation  numbers  less  than  0.1. 

The  perspective  plots  of  the  3(t^2,  and 

2<t^,3  DOs  are  shown  in  Figures  1-3,  respectively. 
The  3cr^2  DO  is  similar  to  the  3o-^,  SCF  virtual 
orbital.  The  2cr^l  DO  and  2cr^3  DO  are  similar  to 
the  2(t^  SCF  orbital.  The  3cr^2,  and  2(7^3  DOs 
have  amplitudes  of  0.4187,  0.3945,  and  0.3647,  re¬ 
spectively,  at  the  bond  midpoint  compared  to  val¬ 
ues  of  0.3705  and  0.2816  for  the  2a-^  and  3o-^  SCF 
orbitals.  The  3o^,2  DO  bond-midpoint  amplitude  is 
significantly  higher  than  is  the  bond-midpoint  am¬ 
plitude  for  the  3cr^  SCF  orbital,  an  indication  that 
occupation  of  the  3  0-^,2  DO  would  contribute  to  the 
bond  strength  more  than  would  that  of  the  3  a 
SCF  orbital. 

The  overlaps  of  the  valence  DOs  with  the  se¬ 
lected  SCF  orbitals  are  given  in  Table  III.  The  3cr^l 
DO  is  a  mixture  of  -0.2598  2a^  and  0.8846  3a^ 
SCF  orbitals  and  contains  significant  contributions, 
>  0.1,  from  higher  SCF  virtual  orbitals.  The  3cr^^2 
DO  is  a  mixture  of  0.4645  2a^  and  0.8707  3cr^  SCF 
orbitals.  The  2cr^4  and  2cr^3  DOs  are  primarily  the 
2(t^  SCF  orbital.  The  2a-„l  DO  is  essentially  the  2cr„ 
SCF  orbital  and  the  Itt^^I  DO  is  essentially  the  Itt^^ 
SCF  orbital.  The  a^  DOs  gain  increasing  amounts 
of  2cr,  SCF  character  going  across  Table  III  from 
3cr,l  to  2a^3. 

The  sums  of  the  first  13  DO  occupation  num¬ 
bers  for  each  symmetry  are  0.9308  for  the  7r„  DOs, 
0.9190  for  the  cr^  DOs,  0.8196  for  the  a^^  DOs,  and 
0.0128  for  the  DOs.  These  sums  are  for  only  a 
very  small  fraction  of  the  DOs,  but  they  are  consis¬ 
tent  with  the  increased  a^  NO  occupancy  and 


FIGURE  2.  Perspective  plot  of  the  2 0-^1  DO. 


decreased  NO  occupancy  compared  to  the  SCF 
orbitals. 

One  indication  of  an  orbital's  contribution  to 
bonding  is  the  fraction  of  its  density  that  lies  in  the 
bond.  We  calculated  partially  integrated  orbital 
densities  by  integrating  the  orbital  density  over 
the  X  and  y  coordinates,  taking  the  internuclear 
axis  to  be  along  the  z  axis.  This  gives  the  total 
density  in  the  xy  plane  at  each  point  along  the 
internuclear  axis. 

The  partially  integrated  densities  of  the  30-^2, 
2cr^l,  and  2o-^3  DOs  are  shown  in  Figure  4  and  the 
partially  integrated  densities  of  the  2a;^l  and 
DOs  are  shown  in  Figure  5.  There  are  small  peaks 
at  the  nuclear  positions  for  the  2o-^l  DO,  the  2cr^3 
DO,  and  the  2a;^l  DOs.  The  highest  peaks  of  the 
partially  integrated  densities  are  in  the  bonding 
region  for  the  a^  and  orbitals  and  are  outside 
the  bonding  region  for  the  antibonding  2a^^l  DO. 


FIGURE  1.  Perspective  plot  of  the  3  0-^2  DO.  FIGURES.  Perspective  plot  of  the  2  0-^3  DO. 
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TABLE  III _ 

Overlap  of  the  DOs  with  the  SCF  orbitals. 


4> 

3o-g1  DO 

3o-g2  DO 

2o-g1  DO 

2<Tg2  DO 

2o-g3  DO 

m 

0.0009 

0.1335 

0.2179 

0.0199 

0.4261 

<1crgSCF|<;>> 

0.0029 

-0.0034 

-0.0016 

0.9  X  10”^ 

0.0020 

<2o-gSCF|<^> 

-0.2598 

0.4645 

0.9983 

0.9996 

0.9998 

<3(7gSCF|<^> 

0.8846 

0.8707 

0.0323 

-0.0169 

0.0123 

</> 

DO 

m 

0.9237 

<l77-uSCF|<^> 

0.9999 

</> 

2o-J  DO 

m 

0.8076 

(2a,SCF\<l>) 

0.9993 

In  Eq.  (2X  the  total  density  is  obtained  by  multi¬ 
plying  the  DO  densities  by  their  respective  occu¬ 
pation  numbers  and  summing.  The  partially  inte¬ 
grated  DO  densities  times  their  occupation 
numbers  are  plotted  in  Figure  6  for  the  3cr^2, 
and  20-^3  and  in  Figure  7  for  the  2a-j^l  and 
DOs.  All  these  orbitals  make  significant  contribu¬ 
tions  to  the  electron  density  and  the  and  Itt^ 
DOs  contribute  to  building  up  the  density  in  be¬ 
tween  the  two  atoms.  The  3(t^2  DO  is  not  expected 


to  contribute  to  the  bond  strength  as  much  as  do 
the  two  2<t^  Dyson  orbitals  because  it  contains  a 
large  component  of  the  3a^  SCF  orbital,  but  it  has 
a  smaller  occupation  number  and  makes  a  smaller 
contribution  to  the  total  density  than  that  of  the 
other  two  Dyson  orbitals. 

Bader  et  al.  [29]  analyzed  the  SCF  density  of  C2 
and  other  diatomic  molecules  in  terms  of  Berlin's 
[30]  binding  and  antibinding  regions.  Electron 
density  in  the  binding  region  produces  attractive 


FIGURE  4.  Partially  integrated  densities  of  (dashed  line)  the  3 0-^2  DO,  (solid  line)  the  2ag\  DO,  and  (dashed-dotted 
line)  the  20-^3  DO.  Vertical  dotted  lines  indicate  atomic  positions. 
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z 


FIGURE  5.  Partially  Integrated  densities  of  (dashed  line)  the  2o-J  DO,  and  (solid  line)  the  I77J  DO.  Vertical  dotted 
lines  indicate  atomic  positions. 


forces  on  the  nuclei,  pulling  them  together,  whereas 
electron  density  in  the  antibinding  region  pushes 
the  nuclei  apart.  The  calculated  fraction  of  the 
density  in  the  binding  region  for  the  SCF  density 
of  C2  is  0.60  [29].  The  SCF  binding  density  fraction 


computed  with  the  current  basis  is  0.602  and  the 
binding  density  fraction  computed  from  the  Cl 
density  is  0.598  for  C2.  They  are  essentially  the 
same,  indicating  that  the  additional  bond  strength 
determined  by  the  Cl  calculation  does  not  come 


o' -  ■  +  -  ■  —  I — - 1 - i -r  —  - 1 _ I 

—6  —4  —2  0  2  4  6 


z 


FIGURE  6.  Partially  integrated  densities  times  their  occupation  numbers  for  (dashed  line)  the  3cTg2  DO,  (solid  line)  the 
2crg1  DO,  and  (dashed-dotted  line)  the  2(7^3  DO.  Vertical  dotted  lines  indicate  atomic  positions. 
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FIGURE  7.  Partially  integrated  densities  times  their  occupation  numbers  for  (dashed  line)  the  1cr„1  DO  and  (solid  line) 
the  iTrJ  DO.  Vertical  dotted  lines  indicate  atomic  positions. 


from  transfer  of  electron  density  from  the  antibind¬ 
ing  region  to  the  binding  region.  The  difference 
between  the  Cl  density  and  the  SCF  density  is 
shown  in  Figure  8.  The  Cl  density  is  more  con¬ 
tracted  than  is  the  SCF  density  all  along  the  bond 


axis  except  at  the  nuclei,  where  it  is  reduced.  The 
additional  bond  strength  associated  with  the  Cl 
density  appears  to  come  from  a  higher  density 
along  the  bond  axis  rather  than  from  a  higher 
density  in  the  Berlin  binding  region. 


FIGURE  8.  Density  difference  map  for  p(CI)-p(SCF)  for  Cg. 
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TABLE  IV 

Binding  density  fractions  for  Dyson  and  SCF  orbitals. 

Orbital 

Fraction 

3ag2  DO 

0.5988 

2ag\  DO 

0.8712 

20^3  DO 

0.8571 

2  0-^1  DO 

0.2267 

177^1  DO 

0.7388 

10-g  SCF 

0.5078 

1o-„  SCF 

0.5074 

2(Tg  SCF 

0.8498 

2o-„  SCF 

0.2488 

3o-g  SCF 

0.2710 

l77-„  SCF 

0.7499 

The  binding  density  fractions  for  the  SCF  or¬ 
bitals  and  selected  DOs  are  listed  in  Table  IV.  The 
lo-g  and  lo;,  SCF  orbitals  have  binding  density 
fractions  of  about  0.5,  consistent  with  their  non¬ 
bonding  character.  The  SCF  orbital  has  a  bind¬ 
ing  density  fraction  of  0.8498,  the  antibonding  2cr„ 
SCF  orbital  has  a  binding  density  fraction  of  0.2488, 
and  the  3cr^  SCF  orbital  has  a  binding  density 
fraction  of  only  0.2710.  The  equilibrium  bond  dis¬ 
tance  of  the  'S;,  state 

of  C2  is  2.396  bohr,  about  0.048  bohr  greater  than 
the  equilibrium  bond  distance  of  the  ground  state 
of  C2.  This  is  consistent  with  the  low  binding 
density  fraction  of  the  3o-^  SCF  orbital  and  is  an 
indication  that  occupation  of  this  orbital  weakens 
the  bond.  The  3o-^2,  the  2cr^l,  and  20-^3  DOs  have 
binding  density  fractions  of  0.5988,  0.8712,  and 
0.8571,  respectively,  and  occupation  of  these  or- 
bitals  is  expected  to  contribute  to  the  bond  strength. 


Conclusions 

Electron  correlation  plays  a  significant  role  in 
chemical  bonding  in  the  C2  molecule.  An  exami¬ 
nation  of  the  NSO  occupation  numbers  shows  an 
increase  in  the  occupation  of  the  NSOs  and  a 
decrease  in  the  occupation  of  the  a;,  and  77^^  NSOs. 
The  net  effect  is  to  increase  the  density  along  the 
bond  axis. 

The  occupation  of  the  valence  DOs  is  spread 
over  three  main  DOs,  alfof  which  contribute  to 
building  up  the  electron  density  in  the  binding 


region.  Occupation  of  the  3o-^l  DO  places  in¬ 
creased  density  at  the  bond  midpoint.  Occupation 
of  DOs  which  build  up  electron  density  in  the 
binding  region  and  at  the  bond  midpoint  goes 
hand  in  hand  with  increased  bond  strength  due  to 
electron  correlation. 
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ABSTRACT _ 

Ab  initio  electron-correlated  calculations  of  the  equilibrium  geometries,  dipole  moments, 
and  static  dipole  polarizabilities  are  reported  for  benzene  and  12  heteroaromatic  six- 
membered  rings  obtained  from  it  by  aza-substitution.  Our  geometries  and  dipole  moments 
agree  well  with  available  experimental  microwave  determinations.  The  polarizabilities 
are  in  reasonable  agreement  with  the  fragmentary  experimental  data  available.  Uncoupled 
Hartree-Fock  calculations  indicate  that  as  much  as  half  the  polarizability  comes  from  the 
cr-electrons.  Simple  empirical  formulas  based  on  atom-  and  bond-additive  models 
correlate  the  calculated  polarizabilities  of  33  five-  and  six-membered  heteroaromatic  rings 
(10  azoles,  10  oxazoles,  13  azines)  quite  well.  The  correlation  improves  significantly  if 
systematic  data  of  uniform  quality  are  used.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

Polarizabilities  determine  long-range  inter- 
molecular  induction  and  dispersion  forces, 
low-energy  electron-molecule  scattering  cross  sec¬ 
tions,  Langevin  capture  cross  sections  in  ion-neu¬ 
tral  collisions,  and  various  phenomena  such  as 
collision-induced  spectral  line  shifts  [1,  2].  We  have 
begun  a  systematic  study  of  the  polarizabilities  of 
heteroaromatic  molecules  [3-6]  to  build  a  sizable 
body  of  accurate  polarizability  data  for  such 
molecules,  to  examine  the  utility  and  limits  of 
simple  models  for  these  polarizabilities,  and  to 

"^To  whom  correspondence  should  be  addressed. 


study  the  connection  between  polarizability  and 
aromaticity.  We  have  previously  studied  four  fam¬ 
ilies  of  heteroaromatic  molecules:  azines  [3],  azoles 
[4],  oxopurines  [5],  and  oxazoles  [6].  However,  our 
study  of  the  azines  (aza-benzenes)  [3]  differs  from 
our  subsequent  ones  [4-6]  in  that  (i)  a  mix  of 
experimental  and  calculated  geometries  was  used 
rather  than  a  consistent  set  of  calculated  geome¬ 
tries,  (ii)  a  poorer  basis  set  was  used  for  some  of 
the  polarizability  calculations,  (hi)  5  of  the  12  pos¬ 
sible  aza-substitutions  were  not  considered,  (iv) 
additive  models  were  not  reported,  and  (v)  no 
attempt  was  made  to  separate  the  o*-  and  7r-elec- 
tron  contributions  to  the  polarizability. 

The  purpose  of  this  study  is  to  revisit  the  azines 
and  remove  these  shortcomings.  We  report  sys¬ 
tematic,  electron-correlated,  ab  initio  calculations 
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of  the  equilibrium  geometries,  dipole  moments, 
and  static  dipole  polarizabilities  of  benzene  and  all 
12  aza-benzenes.  Uncoupled  Hartree-Fock  calcula¬ 
tions  are  used  to  determine  the  fraction  of  the 
mean  polarizability  that  arises  from  the  77-elec¬ 
trons.  Additive  and  other  simple  models  for  the 
polarizabilities  of  the  13  azines,  10  azoles,  and  10 
oxazoles  are  discussed  and  empirical  formulas  are 
given  that  fit  the  polarizabilities  of  these  33 
molecules  quite  well. 


The  Kerr  anisotropy  and  the  in-plane  anisotropy, 
respectively,  are 


A2a  = 


(ttj  -  a2)^  +  (a2  -  +  (ot^  “  otif' 


2  1V2 


(3) 


Ajtt  =  [(A2a)^  -  (Ajaf]  ''  =  -^(03  - 


v/3 


(4) 


Methods  and  Deflnitions 

Only  a  concise  summary  of  our  methods  is 
given  here  because  full  details  were  given  in  pre¬ 
vious  work  [4]  which  used  the  same  computa¬ 
tional  methods.  Equilibrium  geometries  for  all  13 
azines  were  computed  using  all-electron,  second- 
order  Moller-Plesset  (MP2)  perturbation  theory  [7, 
8]  and  the  6-31G*  basis  set  [9].  The  dipole  mo¬ 
ments  and  polarizabilities  were  calculated  by  the 
finite-field  method  [10,  11]  using  self-consistent- 
field  (SCF)  and  valence  MP2  field-dependent  ener¬ 
gies  computed  with  GAUSSIAN-90  [12].  The  fi¬ 
nite-field  SCF  polarizabilities  are  equivalent  to 
coupled  Hartree-Fock  (CHF)  ones  [11,  13].  We 
used  a  [5s3p2d/3s2p]  basis  set,  denoted  C  in  Ref. 
[4],  of  contracted  Gaussian-tjq^e  functions  (GTF) 
for  the  finite-field  calculations.  It  consists  of  a 
[4s2p/2s]  double-zeta  substrate  [14]  augmented 
by  [1  sip /Is]  diffuse  GTF,  and  [2d/2p]  polariza¬ 
tion  GTF — one  optimized  for  electron  correlation 
[15]  and  the  other  for  polarizabilities  [4].  We  com¬ 
pute  uncoupled  Hartree-Fock  (UCHF)  polarizabil¬ 
ities  [16]  to  estimate  the  relative  contribution  of 
the  77-electrons. 

The  molecules  were  placed  on  the  yz  plane  to 
block  diagonalize  the  polarizability  tensor  whose 
eigenvalues  <  a2  <  a3  determine  the 

mean: 

a  =  =  1(oli  +  a2  +  a3)  (1) 

and  the  difference  between  the  mean  in-plane  and 
out-of-plane  components: 

A, a  =  a,!  - 

=  +  a^)  -  ttj.  (2) 


Atomic  units  are  used  in  this  work  for  polarizabili¬ 
ties. 


Equilibrium  Geometries 

Figure  1  is  a  scale  drawing  of  our  MP2/6-31G* 
equilibrium  geometries  for  the  13  azines.  The 
MP2 /6-31G*  bond  lengths  were  remarkably  simi¬ 
lar  in  all  the  azines;  CC  bonds  were  between  138.5 
and  139.6  pm,  CN  and  NN  bonds  between  133.5 
and  134.7  pm,  and  CH  bonds  between  108.5  and 
108.8  pm.  A  comparison  of  our  structures  with 
experimental  [17-25]  and  other  ab  initio  calcula¬ 
tions  [26-35]  follows. 

Seven  of  our  geometries  are  not  tabulated  be¬ 
cause  they  are  very  similar  to  previously  reported 
calculations.  MP2/6-31G*  geometries  for  benzene, 
pentazine,  and  hexazine  were  reported  previously 
[26,  27,  29].  Raman  [17]  and  infrared  (IR)  [25] 
spectroscopy  leads  to  a  similar  geometry  for  ben¬ 
zene.  There  are  no  published  experimental  geome¬ 
tries  for  pentazine  and  hexazine  because  they  have 
not  yet  been  isolated  [27,  29].  Our  1,3,5-triazine 
geometry  coincides  exactly  with  an  MP2/6-31G** 
calculation  of  Creuzet  and  Langlet  [31].  They  dis¬ 
cussed  the  good  agreement  with  an  experimental 
Raman  structure  [18].  There  is  even  better  agree¬ 
ment  with  the  combined  electron  diffraction  (ED), 
Raman,  and  calculated  geometry  of  1,3,5-tri¬ 
azine  [22]  which  differs  from  ours  by  less  than  0.1 
pm  and  0.1°  except  that  their  CH  bond  is  1.8  pm 
longer.  Our  unlisted  MP2/6-31G*  geometries  for 
1,2,3-triazine  and  1,4-diazine  are  within  0.2  pm 
and  0.1°  of  their  frozen  core  counterparts  [32,  33]. 
Fischer  et  al.  [32]  found  fair  agreement  between 
their  MP2  calculation  and  an  X-ray  structure  [21] 
for  1,2,3-triazine.  Our  geometry  for  1,4-diazine  dif- 
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FIGURE  1.  The  azines  at  their  MP2/6-31G*  optimized  geometry:  benzene  (1),  pyridine  (2),  1,2~diazine  or  pyridazine 
(3),  1,4-diazine  or  pyrazine  (4),  1,3-diazine  or  pyrimidine  (5),  1 ,2,3-trlazine  (6),  1,2,4-triazine  (7),  1 ,3,5-trlazine  or 
s-triazine  (8),  1 ,2,3,4-tetrazine  (9),  1 ,2,4,5-tetrazine  or  s-tetrazine  (10),  1,2,3,5-tetrazine  (11),  pentazine  (12),  and 
hexazine  (13).  MP2/  C  dipole  moments  are  shown  to  scale,  with  the  arrow  head  pointing  to  the  negative  end.  The  solid 
line  is  the  a  axis  of  inertia  and  the  dotted  line  is  the  axis  of  polarizability  (not  specified  for  symmetric  tops). 
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fers  by  about  0,5  pm  and  <  1.1°  from  the 
structure  obtained  by  Cradock  et  al.  [23]  using 
electron  diffraction  and  liquid-crystal  (LC)  nuclear 
magnetic  resonance  (NMR)  data.  Our  unreported 
geometry  for  l,2A5-tetrazine  is  very  similar  to  an 
MP2/6-311G**  calculation  [35]  except  that  our 
NN  bond  is  0.4  pm  longer.  The  same  work  [35] 
included  a  CASPT2/ANO  geometry  in  closer 
agreement  to  the  microwave  (MW)  structure 
[20].  However,  the  experimental  parameters  have 
rather  large  uncertainties;  for  example,  the  NN 
bond  length  of  132.56  ±1.6  pm  [20]  does  not  rule 
out  our  MP2/6-31G*  value  of  133.9  pm. 

Table  I  lists  our  calculated  geometries  for  the 
other  six  azines.  We  are  unaware  of  previous  elec¬ 
tron-correlated  geometries  for  pyridine  and  1,3- 
diazine,  although  SCF  geometries  have  been  pub¬ 
lished  in  6-31G*  and  6-31G**  basis  sets  [36,  37] 
among  others.  Our  MP2/6-31G*  geometry  for 
1,2-diazine  has  bond  lengths  that  differ  from  their 


MP2/6-311G**  counterparts  [30]  by  up  to  1.7  pm. 
Table  I  shows  that  our  pyridine  geometry  agrees 
with  the  microwave  substitution  structure  [19] 
within  0.6  pm  and  0.3°,  Our  geometries  for  1,2- 
and  1,3-diazine  are  in  reasonable  agreement  with 
the  structures  obtained  by  Cradock  et  al.  [23, 
24]  by  combining  ED,  MW,  and  LC-NMR  data. 
Table  I  shows  that  the  average  absolute  discrepan¬ 
cies  are  0.7  pm  and  0.5°,  but  the  largest  bond 
length  differences  are  1.3  pm  for  CN  in  1,3-diazine 
and  1.5  pm  for  C4H  in  1,2-diazine. 

Table  I  compares  our  geometries  for  the  other 
three  azines  with  previous  calculations  because  we 
are  unaware  of  any  experimental  geometries.  The 
recent  MP2/TZVP  geometry  [34]  for  1,2,4-triazine 
has  virtually  identical  bond  angles  but  longer  ring 
bonds  and  shorter  CH  bonds  with  an  average 
difference  of  0.5  pm.  Table  I  shows  that  our  bond 
lengths  differ  from  CISD/DZP  ones  [28]  by  an 
average  and  maximum  of  1.6  and  3.8  pm  for 


TABLE  I _ _ _ 

Comparison  of  MP2/6-31G*  geometries  with  others.  Bond  lengths  in  picometers  and  bond  angles  in  degrees. 


Pyridine 

1,2-diazine 

1,3-diazine 

1 ,2,4-triazine 

1 ,2,3, 4-1 

tetrazine 

1 ,2,3,5Tetrazine 

MP2 

Exp.^ 

MP2 

m 

X 

p 

’  IT 

MP2 

Exp.'" 

MP2 

MPa'" 

MP2 

CISD" 

MP2 

CISD^ 

1  -2 

134.4 

133.8 

134.7 

133.7 

134.1 

132.8 

134.7 

134.1 

134.6 

133.0 

133.8 

131.1 

CO 

1 

CM 

139.4 

139.4 

134.3 

133.8 

134.0 

133.6 

133.5 

129.7 

1 

CO 

139.3 

139.2 

139.6 

140.0 

134.3 

135.0 

134.6 

134.2 

134.3 

133.3 

4-5 

138.5 

138.5 

139.1 

139.3 

133.5 

133.2 

133.7 

131.4 

133.6 

132.3 

5-6 

139.6 

139.4 

139.0 

139.6 

6-1 

134.0 

133.6 

2-H 

108.8 

108.7 

108.7 

108.2 

3-H 

108.6 

108.3 

108.7 

107.9 

108.6 

107.9 

4-H 

108.7 

108.2 

108.6 

107.1 

108.8 

107.9 

108.7 

108.1 

5-H 

108.5 

108.7 

108.8 

108.1 

108.6 

108.0 

6-H 

108.6 

108.0 

612 

116.8 

116.9 

119.0 

119.4 

115.6 

116.0 

117.6 

117.5 

117.8 

118.3 

118.2 

118.2 

123 

123.8 

123.8 

127.4 

128.0 

118.0 

118.1 

121.7 

121.9 

119.8 

120.5 

234 

118.7 

118.5 

124.1 

123.8 

127.4 

127.5 

345 

118.4 

118.4 

116.9 

122.3 

121.2 

114.1 

113.9 

125.6 

125.3 

456 

116.9 

117.8 

120.7 

120.7 

120.6 

119.8 

112.6 

112.5 

561 

122.3 

122.3 

H23 

120.5 

120.2 

H34 

121.2 

121.4 

121.4 

121.3 

117.1 

117.2 

H45 

122.2 

122.5 

121.5 

120.9 

118.1 

118.5 

H56 

121.9 

121.7 

123.1 

123,0 

H61 

115.6 

115.7 

^MW  Tg  structure,  Ref.  [19]. 

‘"ED,  MW,  LC-NMR  structure,  Refs.  [23,  24]. 
^TZVP  basis,  Ref.  [34]. 

^DZP  basis,  Ref.  [28]. 
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1,2,3,4-tetrazine  and  1.1  and  2.7  pm  for  1,2,3,5- 
tetrazine.  All  but  the  CC  bond  are  shorter.  Angles 
differ  by  as  much  as  0.8°. 


Dipole  Moments 

Five  of  the  molecules  have  a  zero  dipole  mo¬ 
ment;  Figure  1  shows,  to  scale,  the  magnitudes, 
orientations,  and  polarities  of  the  dipole  moments 
of  the  other  eight  azines.  Figure  1  also  shows  the 
inertial  a  axis  corresponding  to  the  smallest  princi¬ 
pal  moment  of  inertia.  The  orientation  of  the  dipole 
moment  is  determined  by  symmetry  except  in 

I, 2,4-triazine  whose  MP2/C  dipole  moment  makes 
an  angle  of  45.19°  with  the  a  axis.  In  each  azine, 
the  negative  end  of  the  dipole  moment  points  to 
the  region  with  the  most  nitrogens. 

Our  MP2/C  dipole  moments  are  listed  in  Table 

II.  Half  are  higher  and  half  lower  than  the  unlisted 
SCF/C  dipole  moments  by  an  average  of  1.5%  and 
a  maximum  of  3%.  The  dipole  moments  obtained 
by  microwave  Stark  spectroscopy,  2.215D  for  pyri¬ 
dine  [38],  4.22D  for  1,2-diazine  [39],  and  2.334D  for 
1,3-diazine  [40],  are  0.13-0.16D  (or  4-6%)  lower 
than  our  MP2/C  values.  Somewhat  better  agree¬ 
ment  with  experiment  was  obtained  by  Palmer  et 
al.  who  reported  MRCI/TZVP  dipole  moments  of 
2.275D,  4.343D,  and  2.383D  for  pyridine  [41],  1,2- 
diazine  [42],  and  1,3-diazine  [43],  respectively.  For 
the  other  azines,  there  are  no  experimental  dipole 


moments  available,  and  our  calculations  are  more 
accurate  than  previous  ones.  Palmer  et  al.  [44] 
reported  minimal  basis  SCF  dipole  moments  (at  ad 
hoc  geometries)  for  all  eight  azines  which  are  on 
average  16%  lower  than  our  SCF /C  values.  Mo  et 
al.  reported  [45]  SCF/6-31GV/SCF/6-31G  dipole 
moments  for  seven  azines;  they  differ  from  our 
SCF/C  values  by  an  average  of  1.6%  and  a  maxi¬ 
mum  of  4.4%. 


'  Polarizabilities 

Figure  1  shows  the  MP2/C  principal  axis  of 
highest  polarizability — the  0^3  axis.  It  lies  on  a 
symmetry  axis  except  in  1,2,4-triazine  where  it 
makes  a  86.31°  angle  with  the  inertial  a  axis.  The 
a3  axis  is  generally  more  distant  from  the  less 
polarizable  nitrogen  atoms  than  the  a 2  axis.  The 
three  symmetric  tops  have  an  isotropic  in-plane 
polarizability,  and  a  nonunique  a3  axis,  which  is 
therefore  not  shown  in  Figure  1. 

Our  MP2/C  polarizabilities  are  listed  in  Table 
II.  Figures  2  and  3  show  that  MP2/C  polarizabili¬ 
ties  are  higher  than  the  CHF/C  values  by  an 
average  of  4.2%  and  7%  for  a  and  A^a,  respec¬ 
tively,  with  maximum  differences  of  7.2  and  12% 
for  s-triazine.  The  UCHF/C  a  and  A^a,  respec¬ 
tively,  are  3  and  10%  different  from  the  CHF/C 
results  on  average,  but  are  12  and  26%  higher  for 
the  worst-case  hexazine. 


TABLE  II _ _ 

MP2/C  dipole  moments  in  Debyes  and  polarizabilities  in  atomic  units  at  the  MP2/6-31G*  geometries. 
The  7T-fractions  f  of  the  UCHF  polarizabilities  are  given  as  percentages. 


0^1 

0^2 

^3 

a 

A^a 

A  2  Ol. 

Agtt 

f(±  ,  tt) 

fill  77) 

Benzene 

45.17 

81.33 

81.33 

69.28 

36.16 

36.16 

0.00 

62.0 

56.1 

Pyridine 

2.36 

41.08 

73.23 

77.55 

63.95 

34.31 

34.51 

3.74 

59.2 

54.8 

1 ,2-Diazine 

4.38 

37.81 

69.70 

71.38 

59.63 

32.73 

32.76 

1.46 

56.8 

54.3 

1 ,4-Diazine 

37.34 

65.50 

74.36 

59.06 

32.59 

33.48 

7.67 

56.7 

54.8 

1 ,3-Diazine 

2.46 

37.44 

67.78 

70.75 

58.65 

31.82 

31.93 

2.57 

55.9 

52.7 

1 ,2,3-Triazine 

5.20 

34.98 

64.02 

64.56 

54.52 

29.31 

29.31 

0.47 

54.2 

53.5 

1,2,4-Triazine 

2.72 

34.60 

60.90 

67.16 

54.22 

29.43 

29.93 

5.42 

53.7 

53.4 

1,3,5-Triazine 

34.20 

62.89 

62.89 

53.33 

28.69 

28.69 

0.00 

51.9 

49.9 

1 ,2,3,4-Tetrazine 

4.71 

32.34 

57.33 

60.64 

50.11 

26.65 

26.80 

2.86 

51.6 

53.2 

1 ,2,4,5-Tetrazine 

32.50 

55.88 

61.52 

49.97 

26.20 

26.65 

4.88 

51.3 

54.0 

1 ,2,3,5-Tetrazine 

2.73 

32.23 

56.41 

60.85 

49.83 

26.40 

26.68 

3.85 

50.5 

51.8 

Pentazine 

2.78 

30.49 

52.52 

55.39 

46.13 

23.47 

23.60 

2.49 

48.8 

52.9 

Hexazine 

28.98 

49.16 

49.16 

42.44 

20.18 

20.18 

0.00 

46.8 

53.2 
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FIGURE  2.  Percent  differences  between  the  mean  polarizability  computed  by  a  given  method  and  its  counterpart 
calculated  at  the  CHF/C  level.  The  numbering  system  for  the  azines  Is  defined  in  Figure  1. 


A  comparison  with  previous  CHF  calculations 
is  also  made  in  Figures  2  and  3.  Hinchliffe  et  al/s 
[37,  46]  CHF  calculations  using  a  6~311G  +  3d,3p 
basis  set,  denoted  CHF/G,  are  consistently  3  and 
5%  below  our  CHF/C  results  for  a  and  A^a, 
respectively,  because  their  basis  set  is  less  appro¬ 


priate  for  polarizabilities.  By  contrast,  CHF  results 
[3,  47]  obtained  using  Sadlej's  basis  [48]  designed 
specifically  for  polarizabilities,  denoted  CHF/S, 
are  very  close  to  our  CHF/C  values;  the  differ¬ 
ences  average  only  0.6%  for  a  and  1.4%  for  A^a. 
Even  these  small  differences  are  chiefly  due  to 


FIGURE  3.  Percent  differences  between  the  polarizability  anisotropy  Aia  computed  by  a  given  method  and  its 
counterpart  calculated  at  the  CHF/C  level.  The  numbering  system  for  the  azines  is  defined  in  Figure  1. 
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small  differences  in  geometry.  To  confirm  this,  we 
calculated  UCHF/C  polarizabilities  at  the  geome¬ 
tries  (denoted  //exp)  used  in  Refs.  [3,  47].  The 
(UCHF/C//MP2/6-31G*-UCHF/C//exp)  differ¬ 
ences  always  agreed  in  sign  and  rough  magnitude 
with  the  (CHF/C//MP2/6-31G*-CHF/S//exp) 
differences. 

A  comparison  with  previous  correlated  calcula¬ 
tions  is  also  made  in  Figures  2  and  3.  Some  older 
calculations  discussed  previously  [3]  are  not  men¬ 
tioned  here.  The  MCLR  a  of  Knuts  et  al.  [49]  are 
significantly  lower  (4.5%  on  average)  than  even 
our  CHF/C  values.  Moreover,  their  MCLR  results 
are  lower  than  their  own  RPA  values  whereas  all 
other  calculations  predict  that  correlation  raises  a 
for  the  azines.  Their  MCLR  values,  obtained  at  the 
geometry  of  benzene  for  all  the  azines,  are  proba¬ 
bly  unreliable.  By  contrast,  the  MP2/S(adlej)  po¬ 
larizabilities  [3]  for  three  azines  differ  from  the 
MP2/C  values  by  only  1  and  2%  for  a  and  A^a, 
respectively.  Again  the  small  differences  are  due 
to  small  geometrical  differences.  We  found  the 
MP2/C  values  for  s-tetrazine  using  the  same  ge¬ 
ometry  [20]  as  in  Ref.  [3]  to  be  49.44,  25.66,  and 
26.07  for  a,  A^a,  and  A2OL,  respectively,  which  is 
within  0.1  of  the  MP2/S  result  [3].  Schiitz  et  al. 
[35]  reported  very  similar  CASPT2/ 
S//CASPT2/ANO  values  of  49.2,  25,  and  25.3  a.u. 
for  a,  A^a,  and  A2a,  respectively.  Archibong  and 
Thakkar  [3]  reported  hybrid  polarizabilities  calcu¬ 
lated  by  combining  CHF/S  or  MP2/S  values  with 
higher  order  correlation  corrections  computed  with 
Spackman's  smaller  basis  set  [50].  Figures  2  and  3 
show  that  there  is  no  uniformity  to  the  differences, 
averaging  1.2  and  3.6%  for  a  and  A^a,  respec¬ 
tively,  between  their  hybrid  results  and  our 
MP2/C  values.  Our  imiform  MP2/C  results  are 
likely  to  be  more  reliable. 

There  have  been  many  experimental  measure¬ 
ments  of  the  polarizability  of  benzene  but  only 
two  in  the  gas  phase.  Alms  et  al.  [51]  extrapolated 
refractivity  data  and  Rayleigh  scattering  depolar¬ 
ization  ratios  to  zero  frequency  to  find  a  =  67.48 
and  A^a  =  35,0.  Gentle  and  Ritchie  [52]  used  the 
temperature  dependence  of  electro-optical  Kerr  ef¬ 
fect  measurements,  and  Rayleigh  scattering  depo¬ 
larization  ratios  at  optical  frequencies,  to  find  a 
zero  frequency  value  of  A^a  =  38.5  ±  1.2  after  cor¬ 
recting  for  vibronic  effects.  These  measured  values 
are  in  good  agreement  with  our  MP2 /C  values  of 
a  =  69.3  and  =  36.2  given  that  we  have  omit¬ 
ted  the  effects  of  zero-point  vibrational  motion. 


There  has  been  much  less  experimental  work  on 
the  polarizabilities  of  the  other  azines.  The  only 
gas-phase  result  we  are  aware  of  is  a  value  of 
A^a  =  5.4  for  s-tetrazine  obtained  by  laser  Stark 
spectroscopy  [53];  however,  this  value  is  almost 
five  times  as  small  as  theoretical  calculations  [3, 
35,  53]  which  are  all  close  to  our  MP2/C  value  of 
26.2.  This  huge  discrepancy  has  not  yet  been  re¬ 
solved.  Battaglia  and  Ritchie  [54]  extracted  polariz¬ 
abilities  from  dipole  moments,  molar  refractions, 
and  molar  Kerr  and  Cotton-Mouton  constants 
measured  at  633  nm  for  benzene  and  five  azines 
dissolved  in  dioxane.  They  had  to  assume  A3a  =  0 
to  carry  out  the  analysis;  Table  II  shows  that  this 
assumption  is  most  nearly  valid  in  1,2-diazine  and 
least  valid  in  1,4-diazine.  They  determined  a  to  be 
70.4,  64.3,  56.7,  and  50.4  for  benzene,  pyridine,  the 
diazines,  and  s-triazine,  respectively.  These  values 
differ  from  our  MP2/C  results  by  an  average  of 
3.4%  and  a  maximum  of  6.2%.  Battaglia  and  Ritchie 
[54]  found  A^a  «  A2a  ^  25.4,  25.5,  26.4,  27.3,  29.2, 
and  29.1  for  benzene,  pyridine,  1,2-diazine,  1,4- 
diazine,  1,3-diazine,  and  s-triazine,  respectively. 
These  anisotropies  are  lower  than  our  MP2/C  val¬ 
ues  by  amounts  ranging  from  30  to  9%  except  for 
s-triazine  where  their  value  is  only  1.5%  higher 
than  ours.  Given  that  Battaglia  and  Ritchie's  polar¬ 
izabilities  are  for  a  wavelength  of  633  nm  in  diox¬ 
ane  solution  and  based  on  a  questionable  assump¬ 
tion,  the  agreement  with  our  infinite  wavelength 
calculations  for  nonvibrating  isolated  molecules  is 
as  good  as  can  be  hoped  for. 

Finally,  note  from  Table  II  that  the  UCHF  tt- 
fraction  of  the  transverse  polarizability  decreases 
with  aza-substitution  from  62%  in  benzene  to  47% 
in  hexazine.  The  UCHF  rr-fraction  of  the  mean 
in-plane  polarizability  varies  more  erratically  be¬ 
tween  50  and  56%.  These  fractions  are  very  similar 
to  those  obtained  from  a  nonempirical  Unsold  ap¬ 
proximation  by  Mulder  et  al.  [55]. 


Polarizability  Models 

Molecular  polarizabilities  have  long  been  mod¬ 
eled  as  a  sum  of  contributions  from  the  constituent 
atoms  [56,  57].  Such  models  work  rather  well  for 
ab  initio  polarizabilities  of  many  organic  [58]  and 
heteroaromatic  molecules  [4,  6].  Here  we  examine 
models  that  can  describe  simultaneously  the  polar¬ 
izabilities  of  10  azoles  [4],  10  oxazoles  [6],  and  the 
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13  azines.  The  simplest  is  the  additive  atom  model: 

in  which  is  the  number  of  atoms  of  type  i.  The 
parameters,  obtained  by  linear  regression  of  our 
1MP2/C  polarizabilities  for  33  molecules,  are  listed 
in  Table  III  with  the  modeTs  average  absolute 
error  8^  and  maximum  error  Model  5  predicts 
the  average  polarizability  of  33  heteroaromatic 
molecules  with  8^  =  1.1%  and  8„^  -  3%.  The  coef¬ 
ficient  of  the  term  can  be  interpreted  as  the 
polarizability  of  an  atom  of  type  i  in  a  planar 
heteroaromatic  molecule;  Table  III  shows  that  these 
coefficients  are  smaller  than  free  atom  polarizabili¬ 
ties  [2].  When  model  5  is  applied  to  the  anisotropy 
the  errors  are  tripled.  Addition  of  a  molecular 
''volume"  term,  taken  to  be  as  in  Ref.  [6], 

where  A  is  the  area  enclosed  by  the  ring,  yields 
the  model 

a  -  +  ^4^0  +  (6) 

Table  III  shows  that  this  model  improves  8^^  to 
0.66  and  3%  for  a  and  A^a,  respectively.  Unfortu¬ 
nately,  the  coefficients  of  the  terms  no  longer 
have  a  simple  interpretation. 

A  more  elaborate  model  uses  additive  contribu¬ 
tions  from  connections  [4] 


where  is  the  number  of  connections  (or  bonds, 
making  no  distinction  between  single  and  multiple 
bonds)  between  atoms  of  types  i  and  /,  For  our  33 
molecules,  there  are  two  constraints  upon  the 

^  ^CN  +  ^co  number  of  in¬ 
traring  connections  =  6  —  -h  +  ^co^- 

Thus  Eq.  (7)  reduces  to 

a  ~  Cl  -h  +  Q^cc 

+  ^6^CO* 

Table  III  shows  that  model  8  is  a  slight  improve¬ 
ment  over  model  6.  Adding  a  A^^^  term  to  model 
8  does  not  improve  it.  Numerical  experiments,  and 
previous  correlations  between  polarizability  and 
electronegativity  [59],  led  us  to  a  term  involving 
the  energies  of  the  frontier  orbitals: 

a  —  Cl  -h  C2n^^  +  +  Q^cc  Q^no 

T  Q^co  ^7/ X 

in  which  =  (^lumo  “  ^homo')/^  can  be  thought 
of  as  an  approximation  to  the  hardness.  Table  III 
shows  that  model  9  gives  an  excellent  prediction 
of  a  =  0.36%,  =  1,2%),  and  a  reasonable 

one  of  Aitt  (5^  =  2%,  =  6%), 

The  accuracy  of  our  simple  models  is  somewhat 
higher  than  might  have  been  expected.  We  think 
this  is  so  partly  because  the  33  heteroaromatic 
molecules  are  closely  related  and  partly  because 
our  polarizability  database  is  of  uniform  quality. 


TABLE  III _ 

Parameters  (in  atomic  units)  and  errors  of  various  polarizability  models.  Model  numbers  refer  to  equations 
in  the  main  text  and  an  asterisk  Indicates  that  a  nonuniform  data  base  was  used. 


a 

Model 

A^a 

Model 

5 

6 

8* 

8 

9* 

9 

5 

6 

8 

9 

e. 

1.1 

11.5 

0 

0 

0 

0 

-2.87 

1.93 

0 

0 

62 

10.3 

-6.56 

0 

0 

0 

0 

8.82 

1.1 

0 

0 

63 

6.86 

1.14 

0 

0 

0 

0 

3.74 

1.113 

0 

0 

64 

2.8 

7.5 

0 

0 

0 

0 

-1.7 

0.5 

0 

0 

65 

0 

0.52 

0 

0 

0 

0 

0 

0.24 

0 

0 

Ci 

0 

0 

8.6 

7.1 

-3.33 

-2.45 

0 

0 

-16.89 

0.1 

c, 

0 

0 

5.5 

5.78 

6.5 

6.56 

0 

0 

6.5 

5.1 

C3 

0 

0 

7.43 

7.75 

8.22 

8.38 

0 

0 

7.684 

6,6 

C4 

0 

0 

10.05 

10.4 

10.58 

10.8 

0 

0 

8.8 

8 

Cs 

0 

0 

3.1 

3.4 

4.6 

4.6 

0 

0 

5.8 

3.6 

Ce 

0 

0 

4.8 

5 

5.7 

5.8 

0 

0 

5.81 

4.45 

Cy 

0 

0 

0 

0 

1.604 

1.29 

0 

0 

0 

-2.3 

S,(%) 

1.1 

0.66 

0.75 

0.59 

0.45 

0.36 

3.4 

3.0 

2.7 

2.0 

sj%) 

3.4 

2.6 

2.8 

2.0 

1.9 

1.2 

11. 

13. 

10. 

6. 
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To  demonstrate  this  latter  point,  we  include  in 
Table  111  models  8*  and  9*  obtained  by  fitting  Eqs. 
(8)  and  (9)  to  our  polarizability  database  with  eight 
azine  polarizabilities  of  this  work  replaced  by  the 
hybrid  results  of  Ref.  [3].  Table  III  shows  clearly 
that  the  error  measures  for  models  8*  and  9*  are 
noticeably  larger  than  those  for  models  8  and  9. 

These  models  utilize  only  isotropic  information 
and  therefore  work  better  for  the  mean  polarizabil¬ 
ity  a  than  for  the  anisotropy  A^a.  The  Kerr 
anisotropy  Ajct  can  be  described  by  these  models 
with  an  accuracy  slightly  worse  than  that  obtained 
for  Aja.  However,  an  accurate  model  for  the  in¬ 
plane  anisotropy  AjU  requires  anisotropic  infor¬ 
mation.  We  are  trying  to  develop  such  models. 
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ABSTRACT - - 

Aspects  of  the  influence  of  a  strong  Debye  plasma  environment  on  the  negative  hydrogen 
ion  and  the  neutral  helium  atom  have  been  studied.  Contrary  to  earlier  work,  in  the 
present  calculation  all  interactions  have  been  screened.  This  increases  the  stability  of  the 
systems  to  the  extent  that  neither  the  H'  ion  nor  the  ground  state  of  helium  will  lose  an 
electron  by  pressure  ionization.  It  has  been  found  that  the  charge  distribution  of  H 
remains  remarkably  constant  over  a  vast  range  of  values  of  the  Debye  parameter  D. 

©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

In  the  investigation  of  plasmas  consisting  of 
atomic  ions  and  electrons  a  good  understand¬ 
ing  of  the  interaction  potential  among  those  species 
is  essential.  Compared  to  the  free-space  case  the 
force  between  the  charges  is  modified  due  to 
short-range  order  and  static  and  dynamical  screen¬ 
ing  effects  introduced  by  neighboring  ions  and  fast 
electrons,  respectively.  These  modifications  effect 
one-electron  properties  (e.g.,  spectral  lines  which 
serve  as  key  c^^uantities  in  plasma  diagnostics)  as 
well  as  properties  involving  more  than  one  elec- 

*To  whom  correspondence  should  be  addressed. 


tron  (e.g.,  dielectronic  recombination  rates  which 
account  for  plasma  losses).  Extending  the  usual 
Debye-Hiickel  treatment  of  screening  to  encom¬ 
pass  also  dynamic  screening  effects  due  to  ion 
motions  is  an  important  research  area  in  its  own 
right.  Such  research  is  expected  to  be  noticeably 
facilitated  by  the  incorporation  of  the  static  screen¬ 
ing  effects  into  the  calculation  of  atomic  properties 
and  processes  right  from  scratch.  Doing  so,  one 
can  then  formulate  a  procedure  which  allows  the 
simultaneous  evaluation  of  level  broadening  and 
of  the  lowering  of  the  continuum  threshold — both 
on  the  same  footing. 

In  a  recent  study  [1]  the  numerical  pair  fimction 
method  of  our  earlier  studies  of  the  Debye  poten¬ 
tial  [2]  has  been  replaced  by  a  variational  calcula¬ 
tion  employing  correlated  wave  functions.  This 
change  was  needed  in  order  to  achieve  high  accu- 
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racy  in  calculations  of  not  only  the  ground-state 
energies  of  the  negative  hydrogen  ion  and  the 
helium  atom  but  also  of  some  excited  sttaes  of  the 
latter  system,  as  well  as  the  radical  one-electron 
density  function  of  such  systems  under  various 
plasma  conditions.  In  this  investigation  the  attrac¬ 
tive  Coulomb  potential  between  the  electron  and 
the  nucleus  has  been  replaced  by  a  screened  poten¬ 
tial  of  the  Debye  type  Z  exp(-r/D)/r.  Varying 
the  Debye  length  D  from  infinity—which  corre¬ 
sponds  to  no  screening— to  small  values,  different 
plasma  conditions  can  be  simulated.  Since  the  De¬ 
bye  parameter  is  a  funtion  of  both  the  electron 
density  and  the  electron  temperature  T^,  a  par¬ 
ticular  value  of  D  corresponds  to  a  range  of  plasma 
conditions.  It  is,  however,  meaningful  to  associate 
smaller  values  of  D  with  stronger  screening.  The 
earlier  studies,  not  including  screening  of  the  elec¬ 
tron-electron  interaction,  indicated  that  the  as- 
sumedly  very  fragile  H"  ion  may  in  reality  be 
more  stable  in  weak-to-moderate  plasma  condi¬ 
tions  than  initially  assumed.  The  inclusion  of  De¬ 
bye  screening  into  the  electron  interaction  in  the 
present  work  corroborates  these  findings  strongly. 
As  expected,  this  modification  leads  to  a  further 
increase  of  stability  for  a  H“  ion  embedded  in  a 
Debye  plasma.  Since  screening  in  one  form  or  the 
other  is  a  fact  on  plasmas,  the  gross  features  of  the 
present  calculation,  which  is  strictly  based  on  the 
validity  of  the  Debye  model,  may  well  serve  as  a 
general  outline  of  trends  under  more  general  con¬ 
ditions.  The  detailed  interpretation  of  the  numeri¬ 
cal  results,  however,  requires  caution:  The  Debye 
potential  may  not  always  be  appropriate  to  model 
a  real  plasma.  In  particular,  in  the  range  of  small 
values  of  the  Debye  parameter  D,  close  to  (or  even 
smaller  than)  the  average  interparticle  distance, 
the  interpretation  of  the  present  results  in  terms  of 
a  plasma  is  not  reasonable.  However,  even  in  those 
regions  the  results  represent  properties  of  the  De¬ 
bye  potential  which  have  not  been  studied  before. 
In  particular,  by  extending  a  similar  study  by 
Rogers  and  co-workers  [3]  of  the  hydrogen  atom  to 
two-electron  systems,  it  was  possible  to  extract 
accurate  ionization  energies  for  the  helium  atom 
and,  above  all,  detachment  energies  for  H*.  Over 
most  of  the  range  of  D  values  covered  in  the 
present  work,  the  results  are  indeed  representative 
for  typical  effects  of  screening  on  the  binding  en¬ 
ergy  and  the  electron  distribution  of  small  atoms. 


The  Calculations 

The  nonrelativistic  Hamiltonian 
H=  - 

n.  2  ''l  2^2  - 

h 

Z  exp(  -  r2/D)  exp(  - 

- ^ -  +  - - - ,  (1) 

describing  two-electron  systems  in  a  Debye  plasma 
characterized  by  the  parameter  D,  has  been  diago¬ 
nalized  in  a  basis  of  correlated  functions  of  the 
following  type: 

(2) 

The  variables  and  /'2  are  the  radial  coordinates 
of  the  two  electrons  and  the  variable  r,2  stands  for 
the  distance  between  them,  while  the  quantity 
^kim  is  a  normalization  factor.  Expansions  of  this 
type  have  previously  been  shown  to  be  very  effi¬ 
cient  in  obtaining  highly  accurate  variational  wave 
functions  for  S  states  of  two-electron  atoms  [4]. 
The  two  signs  correspond  to  singlet  and  triplet 
states,  respectively.  The  nonlinear  parameters  a^., 
/3;,  and  y,,,  have  been  preselected  so  as  to  approxi¬ 
mate  a  definite  integral  optimally  as  described  in 
the  earlier  reference  which  introduced  these  func¬ 
tions  as  a  particular  type  of  integral  transform 
wave  functions.  The  linear  parameters  are  deter¬ 
mined  in  the  usual  variational  procedure  by  solv¬ 
ing  the  generalized  eigenvalue  equation 

(H-S£)4>  =  0.  (3) 

Here  H  and  S  stand  for  the  Hamiltonian  and 
overlap  matrices,  respectively.  Since  the  main  goal 
of  this  study  has  been  to  evaluate  all  of  the  bind¬ 
ing  energy  of  the  H~  ion  embedded  in  a  pure 
Debye  plasma  without  overestimating  the  effect,  a 
comparatively  large  basis  set  of  220  functions  of 
the  type  given  in  Eq.  (2)  has  been  used.  The  use  of 
correlated  wave  functions  containing  the  interelec- 
tronic  distance  r-i2  as  a  dynamical  variable  in  con¬ 
nection  with  the  Debye  potential  is  very  conve¬ 
nient  because  all  integrals  can  be  expressed  in 
closed  form  modifying  the  closed-form  expressions 
given  in  [4]. 

The  radial  one-electron  density  function  is  ob¬ 
tained  by  integrating  the  two-electron  density  ex- 
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pr6Ssion  over  the  coordinate  of  one  of  the  particles 
as  well  as  over  the  angular  variables  of  the  other, 
which  in  the  present  case  of  spherically  symmetric 
states  reduces  to  applying  a  factor  of  47r; 

Pfr^)  =  47rJ  (i^r2<I>*(rir2)<l>(rir2).  (4) 

In  the  present  study  we  focus  on  the  lowest  bound 
state  to  obtain  information  about  the  spatial  exten¬ 
sion  of  the  systems  in  various  plasma  environ¬ 
ments.  Hence,  in  Eq.  (4)  the  quantity  O’  stands  for 
the  ground-state  wave  function  but  could  be  re¬ 
placed  by  the  eigenfunction  of  any  excited  state  to 
evaluate  the  density  of  that  state. 

Although  an  extension  of  the  correlated  wave 
function  approach  to  atoms  with  more  than  two 
electrons  may  not  be  practical,  the  treatment  of 
systems  of  particles  interacting  via  Yukawa  forces 
by  an  appropriate  modification  of  more  standard 
methods  of  quantum  chemistry  or  atomic  theory  is 
possible.  In  fact,  systems  of  particles  interacting 
via  Yukawa  forces  have  been  studied  early  on  in 
nuclear  physics,  e.g.,  by  Swiatecki  [5].  This  refer¬ 
ence  contains  also  the  formulas  for  the  interaction 
integrals  of  particles  described  by  wave  functions 


of  Gaussian  type,  i.e.,  exp(-ar^),  or  interactmg 
via  Yukawa  forces.  The  adaptation  to  systems  with 
both  attractive  and  repulsive  screened  interactions 
and  the  interpretation  of  the  results  as  meaningful 
for  ions  embedded  in  a  plasma,  however,  is  new 
and  promises  interesting  results,  in  particular,  for 
the  further  sudy  of  atomic  processes  in  plasmas. 

All  calculations  have  been  performed  in  ex¬ 
tended  precision  on  the  Hewlett-Packard  9000 
735 /125  computer  system  of  the  Physics  Depart¬ 
ment  of  tlie  University  of  Nevada  at  Reno. 


Results 

RESULTS  FOR  THE  ENERGIES 

The  results  for  the  ground-state  energies  of  both 
the  negative  hydrogen  ion  and  the  neutral  hydro¬ 
gen  atom,  which  have  been  reported  before  [1],  are 
summarized  in  Figure  1.  Although  the  results  for 
large  values  of  the  Debye  parameter  D  are  not 
displayed  here  for  the  sake  of  readability  in  the 
low  D  region,  the  results  for  large  but  finite  values 
of  D  are  represented  directly  as  detachment  ener¬ 
gies  in  Figure  2.  For  D  the  energy  curves  for 


FIfSURE  1  The  enerqv  of  the  negative  hydrogen  ion  vs.  the  screening  parameter  D  compared  to  the  ground  s 
elfrgy  of  the  hydrogen  ^ori  unde?  the  saL  t^asma  conditions.  The  ionic  system  is  bound  over  the  whole  range  of  D 
values  in  which  the  neutral  system  is  also  capable  of  supporting  at  least  one  bound  sta  . 
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FIGURE  2.  Detachment  energy  for  the  negative  hydrogen  ion  vs.  the  Debye  length  D. 


H  and  H  approach  smoothly  the  corresponding 
vacuum  values  of  -0.52775  and  -0.5  a.u.,  respec- 
tively. 

In  fact,  our  findings  indicate  that  the  negative 
hydrogen  ions  survives  increasing  plasma  condi¬ 
tions  as  well  as  neutral  hydrogen  does  and  that 
both  species  lose  their  binding  property  simultane¬ 
ously  as  a  function  of  decreasing  values  of  D.  The 
smallest  value  of  D  for  which  binding  of  the 
two-electron  system  has  still  been  obtained  (al¬ 
though  with  a  tiny  value  of  the  detachment  energy 
of  0.68  X  10  ^  a.u.)  is  D  =  0.860  a.u.  This  value  is 
so  close  to  the  value  obtained  by  Rogers  and 
co-workers  [3]  for  the  limit  beyond  which  not  even 
neutral  hydrogen  is  able  to  support  a  bound  state 
that  it  is  safe  to  state  as  our  main  finding  the  fact 
that,  in  a  pure  Debye  potential,  there  is  no  way  for 
the  negative  ion  to  lose  its  second  electron  due  to 
pressure  ionization.  Any  detachment  that  occurs  is 
due  to  the  action  of  one-particle  operators  such  as 
collisions  with  other  particles  or  photons.  As  a 
function  of  decreasing  values  of  D  the  detachment 
energy  of  decreases  initially  very  slowly  from 
its  vacuum  value  of  0.0277  a.u.  and  holds  still  a 
sizable  0.0234  a.u.  at  D  =  6.  From  then  on  its 
drops  rapidly. 

The  corresponding  results  for  the  helium  atom 
in  comparison  to  the  ground-state  energy  of  the 


helium  ion  are  presented  in  Figure  3.  Again  it  has 
been  found  that  both  systems  lose  their  capability 
to  support  bound  states  at  the  same  value  of  D. 
More  precisely,  we  calculated  this  to  happen  at 
^  0.4311  a.u.  for  the  helium  ion.  The  critical 

point  of  no  binding  for  the  first  two  excited  singlet 
S  states  of  helium  is  noticeably  different  from  this 
value:  D  =  0.460  a.u.  for  the  (ls2s)  state  and  D  = 
0.4957  a.u.  for  the  (ls3s)  state.  Since  the  excited 
states  of  helium  are  pushed  into  the  continuum  in 
a  region  of  D  values  where  the  positive  helium 
ion  still  carries  a  bound  ground-state— though 
barely  so— the  corresponding  curves  in  Figure  3 
do  intersect.  For  higher  excited  states  this  crossing 
will  occur  at  values  of  D  which  still  represent 
plasma  situations.  In  these  regions,  pressure  ion¬ 
ization  of  one  Rydberg  electron  is  possible  even  in 
a  real  Debye  plasma.  In  Figure  4  the  ionization 
energies  of  the  neutral  helium  atom  are  plotted 
versus  the  Debye  length  D. 

RESULTS  FOR  THE  ONE-ELECTRON 

RADIAL  DENSITY 

The  one-electron  radial  density  has  been  calcu¬ 
lated  for  the  negative  hydrogen  ion  under  various 
plasma  conditions  mainly  in  order  to  study  the 
spatial  extension  of  this  system  as  it  becomes  less 
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FIGURE  3  The  energy  of  the  two  lowest 'S  states  of  the  helium  atom  vs.  the  screening  parameter  D 
a  SinSe  energy  S^he  positive  helium  ion  under  the  same  plasma  conditions.  While  the  ground  state  of  Hej 
?rnd  ovfr  the  3e  range  o^  D  values  for  which  there  is  a  bound  state  of  the  ion.  the  curve  of  the  exerted  He  state 
crosses  the  ionic  curve,  although  this  is  not  noticeable  in  the  graph. 


FIGURE  4.  Ionization  energy  for  the  neutral  helium  atom  vs.  the  Debye  length  D. 
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and  less  bound  with  decreasing  value  of  D.  This 
effect  is  frequently  addressed  as  “lowering  of  the 
continuum  threshold."  The  expectation  is  that 
increases  in  spatial  extension  with  the  reduction  of 
the  binding  energy.  This  is  indeed  the  case  as 
shown  in  Figure  5.  However,  this  increase  in  size 
occurs  in  a  region  of  small  D  values  which  is  not 
representative  of  a  plasma.  The  most  prominent 
feature  of  these  results  is  rather  the  high  degree  of 
stability  the  ion  exhibits  in  various  plasma  condi¬ 
tions.  There  is  very  little  change  in  the  density 
from  D  =  20,000  a.u.  to  D  =  4  a.u.  A  small  shift  of 
electron  density  from  the  irmer  regions  toward  the 
tail  of  the  distribution  is  noticeable.  The  most 
striking  feature,  however,  is  the  fact  that  we  do 
not  see  more  of  it.  Even  at  D  =  2  a.u.  the  curve 
retains  its  maximum  at  roughly  the  same  position. 
From  then  on,  however,  the  charge  distribution 
changes  noticeably.  The  lowest  curve,  correspond- 
ing  to  D  =  0.860  a.u.,  is  so  extended  that  within  a 
region  of  r  <  80  a.u.  not  more  than  25%  of  the 
total  charge  is  collected.  The  relative  constancy  of 
both  the  detachment  energy  and  the  electron  dis¬ 
tribution  of  H  accounts  well  for  its  property  to 


enhance  the  opacity  of  plasmas.  The  unexpectedly 
high  degree  of  stability  of  the  ion  points  toward  its 
presence  even  in  high-density  laboratory  plasmas 
as  opposed  to  just  in  astrophysical  plasmas. 

SOME  SPECULATIVE  RESULTS 

While  the  Debye  theory  postulates  a  uniform 
screening  potential  between  two  charges,  it  seems 
plausible  to  assume  that  local  fluctuations  of  the 
screening  are  not  only  possible  but  rather  the  rule. 
If  that  is  so,  it  may  be  argued  that  the  electrons, 
being  light  particles,  can  be  more  efficient  in  tak¬ 
ing  advantage  of  such  local  fluctuations  than  an 
electron— ion  pair  which  has  one  less  mobile  part¬ 
ner.  In  particular,  it  may  be  argued  that  electron 
pairs  will  preferably  move  into  regions  with  higher 
screening  potential  (i.e.,  lower  values  of  D)  be¬ 
cause  this  will  lower  the  energy  of  the  system  and 
that  electron  pairs  are  more  efficient  to  do  so  than 
electron-ion  pairs.  In  order  to  explore  the  rudi¬ 
ments  of  an  emerging  theory  of  such  ideas,  we 
have  studied  cases  in  which  the  values  of  the 
Debye  parameter  for  the  electron-electron  interac- 


r,',n  radial  one-electron  density  of  the  negative  hydrogen  ion  in  various  plasma  environments  The  too 
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tion  was  chosen  different  from  those  of  the  elec¬ 
tron-nucleus  potential.  In  practice,  we  have  aug¬ 
mented  the  last  term  of  the  Hamiltonian  Eq.  (1)  by 
a  scaling  factor  y  to  read  exp(  — i/r^2/^)/^i2 
the  other  terms  remain  unchanged.  By  changing  y 
one  can  modify  the  screening  of  the  electron  inter¬ 
action  relative  to  the  other  interactions.  Results  of 
these  calculations  are  presented  in  Table  1.  The 
most  interesting  feature  of  this  scenario  is  the 
possibility  of  excited  states  of  H“.  Although  here 
only  states  of  S  symmetry  have  been  included,  it 
is  likely  that  bound  states  of  higher  angular  mo¬ 
mentum  do  exist.  Then  one  would  expect  low- 
energy  electromagnetic  transitions,  which,  if  iden¬ 
tifiable,  might  serve  diagnostic  purposes.  Astro- 
physical  spectra  usually  have  lots  of  broadened 


low-energy  transition  lines  so  that  the  identifica¬ 
tion  of  such  transitions  may  be  difficult.  Labora¬ 
tory  plasma  have  a  much  better  chance  for  the 
detection. 


Summary 

While  in  previous  work  the  screening  effects 
were  applied  only  to  the  electron-ion  interaction, 
the  present  study  extends  Debye  screening  also  to 
the  electron-electron  interaction.  A  variational  cal¬ 
culation  of  the  ground-state  energy  of  the  negative 
hydrogen  ion  has  been  performed  for  various 
plasma  conditions  characterized  by  different  val- 


TABLE  I _ _ _ ^ _ 

Binding  energies  of  the  negative  hydrogen  ion  are  listed  for  various  values  of  the  screening  parameter  D  and 
various  vaiues  of  the  scaling  factor  y  by  which  the  screening  of  the  electron  -  electron  interaction  can  be 
modified  without  affecting  the  screening  of  the  electron  nuclear  potential.  For  comparison,  the  values  for 
neutral  hydrogen  (1s)  are  given  in  column  2.  Multiple  entries  in  columns  2-5  indicate  the  existence  of  excited 
states  of  symmetry 


Negative  Hydrogen  Ion 


H  Atom 

y=  1.0 

y=1.1 

II 

cn 

o 

cvi 

II 

D  -  200.0 

0.49502 

0.52276 

0.52325 

0.52520 

0.52760 

0.49517 

0.49642 

0.49801 

0.49578 

0.49684 

D  =  100.0 

0.49007 

0.51780 

0.51876 

0.52254 

0.52717 

0.49033 

0.49237 

0.49501 

0.49109 

0.49275 

D  =  50.0 

0.48030 

0.50795 

0.50978 

0.51693 

0.52554 

0.48060 

0.48361 

0.48761 

0.48131 

0.48319 

D  =  30.0 

0.46748 

0.49499 

0.49785 

0.50894 

0.52203 

0.46772 

0.47127 

0.47618 

0.46804 

0.46968 

D  =  20.0 

0.45182 

0.47904 

0.48303 

0.49829 

0.51590 

0.45192 

0.45545 

0.46066 

0.45188 

0.45284 

D  =  15.0 

0.43653 

0.46338 

0.46834 

0.48706 

0.50827 

0.43960 

0.44461 

0.43673 

D=  10.0 

0.40706 

0.43295 

0.43942 

0.46340 

0.48975 

0.40867 

0.41252 

o 

00 

II 

Q 

0.38588 

0.41092 

0.41821 

0.44499 

0.47387 

0.38656 

0.38929 

o 

CD 

II 

Q 

0.35226 

0.37568 

0.38390 

0.41372 

0.44507 

0.35312 

o 

11 

Q 

0.29092 

0.31074 

0.31960 

0.35135 

0.38351 
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ues  of  the  Debye  parameter  D.  This  results  in  a 
considerable  increase  in  stability  of  the  ion  as 
compared  to  the  case  of  an  unscreened  interaction 
(i.e.,  with  only  the  electron-ion  potential  screened). 

As  an  example  of  related  research,  we  refer  to  a 
study  by  Scheibner,  Weisheit,  and  Lane  [6]  of 
plasma  screening  effects  on  proton-impact  excita¬ 
tion  of  positive  ions.  In  this  work  two  screened 
ions  interact  via  a  Debye  force. 

ACKNOWLEDGMENT 

This  work  was  supported  by  the  Division  of 
Chemical  Sciences,  Office  of  Basic  Energy  Sciences, 


Office  of  Energy  Research,  U.S.  Department  of 
Energy. 


References 

1.  P.  Winkler,  Brief  Report,  Phys.  Rev.  E,  to  appear, 

2.  Z.  Wang  and  P.  Winkler,  Phys,  Rev.  A  52,  216  (1995), 

3.  F.  J.  Rogers,  H.  C.  Graboske,  Jr.,  and  D.  J.  Harwood;  Phys. 
Rev.  Al,  1577  (1970). 

4.  P.  Winkler  and  R.  N.  Porter,  J.  Chem.  Phys.  61,  2038  (1974). 

5.  W.  J.  Swiatecki,  Proc.  Roy.  Soc.  A205,  238  (1951). 

6.  K.  Scheibner,  J,  C.  Weisheit,  and  N.  F.  Lane,  Phys.  Rev.  A  35, 
1252  (1987). 


1650 


QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  30 


Condensed-Phase  Effects  on  the 
Conformational  Equilibrium  of 
Ethylene  Glycol 


B.  J.  COSTA  CABRAL 

Departamento  de  Quimica  e  Bioquimica,  Faculdade  de  Ciencias  da  Universidade  de  Lisboa,  1700 
Lisboa,  Portugal,  and  ^Centro  de  Fisica  da  Materia  Condensada  da  Universidade  de  Lisboa,  Av. 
Professor  Gama  Pinto,  2,  1699  Lisboa,  Portugal;  e-mail:  ben@alfl.ciifc.ulpt 

Received  February  25,  1996;  accepted  May  7,  1996 


ABSTRACT _ _ _ 

Density  functional  calculations  for  ethylene  glycol  (CH2OHCH2OH)  in  the  gas  and  in  a 
dielectric  medium  are  reported.  The  condensed-phase  calculations  are  based  on  the 
self-consistent  reaction  field  approach  and  the  environment  has  the  dielectric  constant  of 
liquid  methanol.  NPT  Monte  Carlo  simulations  of  ethylene  glycol  (ETC)  in  liquid 
methanol  are  also  reported.  The  simulations  were  carried  out  for  three  conformers  of  ETC 
(tGg',  gGg',  and  tTt).  Comparison  between  SCRF  results  for  the  conformational 
equilibrium  in  the  gas  and  in  the  dielectric  suggests  that  the  tGg'  conformer  is  slightly 
stabilized  relative  to  the  gGg'  conformer  in  the  solvent.  However,  the  energy  difference 
between  them  is  less  the  1.0  kj/mol,  which  indicates  that  frequent  interconversions 
between  the  tGg'  and  gGg'  conformers  are  expected  in  the  condensed  phase.  The  a\l-trans 
conformer  (tTt)  is  higher  than  the  most  stable  conformer  in  the  gas  by  14  kj/mol.  Monte 
Carlo  simulations  predict  that  the  tGg'  and  gGg'  conformers  have  very  similar  energies 
in  the  solvent.  However,  the  simulations  also  show,  in  agreement  with  experimental 
data,  that  the  tTt  conformer  is  stabilized  in  liquid  methanol,  relative  to  the  gas  phase.  The 
microscopic  mechanism  leading  to  the  stabilization  of  the  tTt  conformer  in  the  liquid  is 
related  to  the  differential  hydrogen-bonding  formation  between  the  ETG  conformers  and 
the  methanol  molecules.  ©  1996  John  Wiley  &  Sons,  Inc. 
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Introduction 

Interest  in  the  study  of  polyols  has  been  re¬ 
cently  emphasized  by  the  research  of  potent 
HIV-1  protease  inhibitors  [1,  2].  One  of  the  sim¬ 
plest  polyols  (a  diol)  is  ethylene  glycol 
(CH20HCH20H  =  ETG).  This  molecule  is  a  model 
system  to  discuss  the  formation  of  intramolecular 
hydrogen  bonding.  Ab  initio  studies  of  ETG  are 
also  a  good  test  for  the  quantum  chemical  predic¬ 
tion  of  the  relative  stability  due  to  the  presence  of 
several  conformational  local  minima. 

Another  interesting  aspect  of  ETG  is  that  the 
interactions  of  this  molecule  with  an  environment 
can  induce  important  changes  on  the  conforma¬ 
tional  equilibrium.  This  effect  is  particularly  im¬ 
portant  in  hydrogen-bonding  solvents.  In  these 
cases,  competition  between  internal  and  external 
hydrogen-bonding  formation  changes  the  confor¬ 
mational  equilibrium  of  the  solvated  molecule  rel¬ 
ative  to  the  gas-phase  monomer.  In  addition,  the 
solvation  of  polyols  in  water  or  hydrogen-bonding 
solvents  reduces  the  formation  of  hydrogen- 
bonded  networks  and  modifies  the  dynamics  lead¬ 
ing  to  the  phase  transitions  of  the  pure  solvents. 

Several  experimental  studies  on  the  conforma¬ 
tional  equilibrium  of  ETG  have  been  reported 
[3-14].  Theoretical  studies  on  the  conformational 
equilibrium  of  ETG  were  the  research  theme  of 
several  works  [15-30].  However,  some  relevant 
aspects  in  the  theoretical  modeling  of  this  molecule, 
as,  e.g.,  electronic  correlation  effects  on  the  confor¬ 
mational  equilibrium,  were  analyzed  only  recently 
[21]. 

Ab  initio  studies  of  ETG  [15-28]  predict,  in 
agreement  with  experimental  results,  that  two  con- 
formers  with  an  internal  hydrogen  bond  (gGg'  and 
tGgO  are  the  predominant  structures  in  the  gas. 
The  energy  difference  between  them  is  very  de¬ 
pendent  on  the  theoretical  level  of  the  calculations 
[28].  This  is  related  to  the  fact,  verified  by  the  most 
recent  theoretical  studies  of  ETG  [21-28],  that  the 
energy  difference  between  tGg'  and  gGg'  conform- 
ers  is  less  than  1.5  kj/mol. 

Theoretical  studies  on  the  conformational  equi¬ 
librium  of  ETG  in  condensed  phases  are  few  and 
very  recent  [26-31].  Some  of  these  studies  were 
based  on  Monte  Carlo  [26-27]  and  molecular  dy¬ 
namics  [30,  31]  methods  and  analyzed  the  solva¬ 
tion  of  ETG  in  water  [26-31]  and  carbon  tetrachlo¬ 


ride  [30].  Very  recent  molecular  dynamics  simula¬ 
tions  of  ETG  in  water  and  in  xenon  have  been  also 
reported  [31].  Other  theoretical  studies  of  ETG  in 
the  condensed  phase  were  based  on  self-consistent 
reaction  field  (SCRF)  calculations,  using  the  con¬ 
tinuum  model  approach  [28,  29].  The  simplifica¬ 
tions  introduced  by  this  approach  and  the  limita¬ 
tions  of  the  method  are  well  known.  However,  by 
including  in  the  Hamiltonian  of  the  solvated 
molecule  the  interaction  with  the  environment, 
SCRF  methods  [32-37]  provide  the  capability  to 
model  quantum  chemical  solvent  effects  in  a  self- 
consistent  way. 

A  theoretical  study  on  the  modifications  of  the 
ETG  conformational  equilibrium  when  it  is  sol¬ 
vated  in  liquid  methanol  is  presented  reported. 
Two  different  approaches  were  used:  density  func¬ 
tional  SCRF  calculations  and  Monte  Carlo  simula¬ 
tions.  The  interest  to  perform  statistical  mechanics 
computer  simulations  of  ETG  in  liquid  methanol  is 
related  to  hydrogen-bonding  formation  and  also  to 
the  fact  that  a  small  reorganization  of  the  solvent 
is  expected  considering  the  similarity  between  ETG 
and  two  methanol  molecules.  The  results  are  com¬ 
pared  to  experimental  data  and  with  other  theoret¬ 
ical  studies. 


Computational  Details 

DENSITY  FUNCTIONAL  CALCULATIONS 

The  density  functional  calculations  were  per¬ 
formed  using  for  the  exchange  the  Becke  func¬ 
tional  [38a]  and  for  correlation  the  Lee,  Yang,  and 
Parr  functional  [39]  (BLYP).  Additional  calcula¬ 
tions  using  a  hybrid  functional  including  a  mix¬ 
ture  of  Hartree-Fock  exchange  with  DFT  ex¬ 
change-correlation  (Becke's  parameter  function) 
[38b]  which  is  being  represented  by  B3LYP  are 
also  reported.  To  assess  the  importance  of  elec¬ 
tronic  correlation  effects,  some  calculations  with 
the  Becke  exchange  correlation  only  (HFB)  were 
carried  out.  The  calculations  were  performed  with 
the  G92/DFT  program  [40]  and  the  6-31G**  and 
6-311G**  basis  sets  were  used. 

Three  ETG  conformers,  defined  by  the  rela¬ 
tive  orientations  of  the  O — H  group  around  the 
C — C — O  plane  (Fig.  1)  and  the  angle  between 
C — C — O  planes,  are  considered.  The  conformers 
are  tGg',  gGg',  and  tTt.  There  are  10  different 
conformers  of  ETG.  These  three  conformers  were 
selected  because  it  is  known  that  tGg'  and  gGg' 
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tGg’ 


FIGURE  1.  tGg',  gGg',  and  tTt  conformers  of  ETG. 


are  the  most  stable  in  the  gas  phase  and  also 
because  experimental  information  predicts  a  sig¬ 
nificant  stabilization  of  the  tTt  conformer  in  ETG 
pure  liquid  [9]. 

Quantum  chemical  calculations  in  the  con¬ 
densed  phase  have  been  based  on  the  self-con¬ 
sistent  reaction  field  (SCRF)  approach  [32-37].  In 
this  approach,  the  solute  molecule  is  placed  in  a 
cavity,  which  is  presently  assumed  to  be  spherical 
of  radius  Uq  (Onsager  cavity  radius).  The  environ¬ 


ment  is  represented  by  a  dielectric  medium  of 
constant  e.  The  Onsager  cavity  radius  for  the  dif¬ 
ferent  conformers  was  in  the  3.48-3.51  A  range. 
The  value  of  the  dielectric  constant  was  that  of 
liquid  methanol  (e  =  33.6).  Recent  developments 
in  the  SCRF  are  reported  in  [36,  37]. 

The  G92/DFT  implementation  [34,  35]  of  the 
SCRF  approach  includes  only  the  Bom  and  On¬ 
sager  contributions  to  the  reaction  field.  They  are 
related,  respectively,  to  the  total  charge  and  dipole 
moment  of  the  solute.  It  is  known  that  for  some 
systems  the  contributions  to  the  reaction  field  due 
to  higher-order  multipolar  moments  can  be  impor¬ 
tant  [37].  In  addition,  the  a\l-trans  conformer  (tTt) 
has  no  permanent  dipolar  moment  and  the  present 
implementation  of  the  SCRF  it  is  not  adequate  to 
model  the  solvation  of  this  conformer. 

MONTE  CARLO  SIMULATIONS 

The  Monte  Carlo  simulations  for  ETG  solvated 
in  liquid  methanol  were  carried  out  with  one  so¬ 
lute  molecule  and  127  solvent  molecules  in  a  cube 
with  periodic  boundary  conditions.  The  ETG 
molecule  was  represented  by  six  interaction  sites, 
which  are  the  ethyl  group  and  the  hydrogen  and 
oxygen  atoms.  The  methanol  molecule  was  repre¬ 
sented  by  three  interaction  sites  which  are  the 
methyl  group  and  the  hydrogen  and  oxygen  atoms. 
The  OPTS  intermolecular  potential  function  was 
used  to  model  the  site-site  interactions  and  the 
parameters  were  those  proposed  by  Jorgensen  [41]. 

A  preferential  sampling  algorithm  [42],  ade¬ 
quate  to  simulate  dilute  solutions,  was  used.  The 
Markov  chain  was  generated  by  the  usual 
Metropolis  algorithm  in  the  NPT  ensemble  [43] 
that  is  particularly  convenient  to  model  mixtures 
and  dilute  solutions. 

The  Monte  Carlo  simulations  of  ETG  in  liquid 
methanol  were  carried  out  for  the  three  ETG  con¬ 
formers  (tGg',  gGg',  and  tTt).  The  geometry  of 
each  conformer  was  the  theoretical  gas-phase  ge¬ 
ometry  and  has  been  kept  constant  during  the 
simulations.  The  main  advantage  of  this  procedure 
is  that  the  relaxation  of  the  solvent  around  a  par¬ 
ticular  conformer  is  assured  and  that  the  statistics 
related  to  the  solute-solvent  properties  is  accurate. 
A  similar  approach  was  recently  used  in  the  simu¬ 
lations  of  ETG  solvated  in  water  and  in  Xe  carried 
out  by  Hayashi  et  al.  [31].  Although  ETG  is  a  very 
flexible  molecule,  it  should  be  observed  that  the 
main  purpose  of  the  present  simulations  is  to  pro- 
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vide  detailed  information  on  the  differential  solva¬ 
tion  of  the  three  conformers  (gGg',  tGg',  and  tTt). 

The  simulations  were  carried  out  at  25‘"C  and  1 
atm  of  pressure.  For  each  conformer,  4-6  X  10^ 
Monte  Carlo  steps  (one  step  corresponding  to  the 
trial  move  of  one  solvent  particle),  were  performed 
for  equilibration.  The  coordinates  of  the  solute 
molecule  were  perturbed  each  50-200  steps.  The 
volume  of  the  system  was  perturbed  each  2500 
steps.  The  stochastic  parameters  related  to  the 
translations  and  rotations  of  the  solute,  transla¬ 
tions,  and  rotations  of  the  solvent  molecules  and 
volume  perturbations  were  adjusted  in  order  that 
^50%  of  the  generated  configurations  are  ac¬ 
cepted.  Additional  8-12  X  10^  steps  were  gener¬ 
ated  to  estimate  the  reported  final  averages. 


Results  and  Discussion 

DENSITY  FUNCTIONAL  CALCULATIONS 

Density  functional  calculations  for  the  ETC  con- 
formers  in  the  gas  phase  and  in  a  dielectric  repre¬ 
senting  liquid  methanol  are  reported  in  Table  I. 
The  present  analysis  is  oriented  to  the  discussion 
of  energy  differences  and  dipole  moments.  A  fur¬ 
ther  analysis  of  the  geometries  and  vibrational 
frequencies  of  ETG  and  ETG— (H20)„  complexes  is 
in  progress  and  will  be  reported  elsewhere. 

The  energy  differences  relative  to  the  tGg'  con- 
former  range  from  -1.97  to  0.18  kj/mol.  There  is 
almost  no  difference  between  the  HFB/6-31G** 
and  BLYP/6-31G**  calculations.  A  significant  re¬ 
duction  (0.87  kJ/mol)  of  the  energy  difference  be¬ 


tween  the  tGg'  and  gGg'  conformers  can  be  ob¬ 
served  when  we  move  from  the  6-31G**  to  the 
6-31 IG**  basis  set.  B3LYP  results  are  significantly 
different  from  the  BLYP  results,  reducing  consid¬ 
erably  the  energy  difference  between  the  conform¬ 
ers.  Excepting  the  B3LYP/6-311G**  calculation 
(0.18  kJ/mol),  all  other  calculations  predict  that 
the  gGg'  conformer  is  slightly  favored  in  the  gas. 
The  B3LYP/6-311G**  result  is  in  good  agreement 
with  the  MP2/6-311G**  value  (0.25  kJ/mol)  pre¬ 
dicted  by  Teppen  et  al.  [25]  and  is  slightly  lower 
than  an  MP2/6-31G*  (0.7  kJ/mol)  calculation  [21]. 
ZPVE  correction  to  this  energy  is  0.12  kJ/mol  at 
the  HF/6-31G*  level  [26,  27]. 

The  experimental  studies  clearly  indicate  that 
these  two  conformers  are  the  most  stable  in  the  gas 
phase.  The  relative  stability  of  the  tGg'  and  gGg' 
conformers,  however,  is  much  more  difficult  to 
assess.  Takeuchi  and  Tasumi  [11]  suggested  the 
possible  coexistence  of  the  two  conformers  in  low- 
temperature  argon  matrices.  Walder  et  al.  [10]  in¬ 
terpreted  the  microwave  spectrum  of  ETG  in  terms 
of  the  gGg'  conformer  only.  Kristiansen  et  al.  [12] 
predicted  that  gGg'  is  1.4  (4)  kJ/mol  less  stable 
than  is  tGg'  at  300  K.  Comparison  among  the 
present  calculations,  other  theoretical  results,  and 
experimental  works  suggests  that  both  tGg'  and 
gGg'  conformers  coexist  in  the  gas  phase.  The 
all-trans  conformer  is  14  kJ/mol  higher  than  the 
gGg'  conformer  (BLYP/6-31G**),  which  is  in  good 
agreement  with  the  value  (13.6  kj /mol)  predicted 
by  Oie  et  al.  [23]. 

The  present  DFT/SCRF  calculations  predict  that 
the  solvation  free-energy  difference  between  tGg' 


TABLE  I _ _ _ 

Total  energies®  (in  au),  O  C  C — O  dihedral  angle  <p  (In  deg.),  dipole  moment  p.  (in  Debye),  and  energy 
difference  relative  to  the  tGg'  conformer  AE  (in  kJ  /  mol). 


Gas 

gGg' 

tGg' 

Energy 

<P 

Energy 

<P 

AE 

HFB/6-31G** 

-229.02260 

56.9 

2.2 

-229.02187 

63.3 

2.3 

-1.93 

BLYP/ 6-31 G** 

-230.17616 

53.9 

2.3 

-230.17541 

60.2 

2.4 

-1.97 

BLYP/ 6-31 1G** 

-230.24689 

55.1 

2.3 

-230.24648 

61.1 

2.4 

-1.1 

B3LYP/6-31G** 

-230.25757 

54.5 

2.4 

-230.25735 

59.8 

2.5 

-0.57 

B3LYP/6-311G** 

-230.32318 

55.6 

2.4 

-230.32325 

60.6 

2.5 

0.18 

Dielectric^ 

B3LYP/6-311G** 

a  -I-  -  .  .  .  /. 

-230.32499 

53.8 

2.9 

-230.32525 

57.9 

3.1 

0.68 

^Total  energies  (in  au)  for  the  aW-trans  conformer  are  -230.16952  (BLYP/ 6-31 G**)  and  -230.25221  (B3LYP/6-31G**). 
The  Onsager  cavity  radius  for  the  tGg'  and  gGg'  are,  respectively,  3.5  and  3.49  A. 

The  dielectric  constant  was  33.6  (liquid  methanol). 
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and  gGg'  conformers  is  —0.7  kj/mol.  Solvation 
free-energy  differences  between  gGg'  and  tGg' 
conformers  in  water  and  other  solvents  were  dis¬ 
cussed  by  several  theoretical  studies  [26-31].  In 
general,  they  predict  that  gGg'  is  better  solvated. 
However,  a  recent  molecular  dynamics  simulation 
by  Hayashi  et  al.  [31]  suggests  that  there  is  a 
preference  for  the  tGg'  in  aqueous  solution.  A 
previous  Hartree-Fock/SCRF  calculation  of  Alag- 
ona  and  Ghio  [29]  predicted  that  the  difference  in 
the  solvation  free  energy  between  tGg'  and  gGg' 
was  2.7  kj /mol.  Another  continuum  model  study 
by  Cramer  and  Truhlar  [28]  based  on  AMI  calcula¬ 
tions  predicted  that  this  difference  was  0.8  kj/mol. 
The  present  DFT/SCRF  calculations  also  predict  a 
reduction  of  the  O — C — C — O  dihedral  angle  {(p) 
for  the  tGg'  conformer.  The  angle  (p  changes  from 
60.6°  in  the  gas  of  57.9  in  the  dielectric.  A  similar 
reduction  in  the  dihedral  angle  <p  can  be  observed 
for  the  gGg'  conformer. 

The  present  calculations  at  the  B3LYP/ 6-31 IG** 
level  predict  that  the  dipole  moments  (see  Table  I) 
for  the  two  most  stable  conformers  in  the  gas 
phase  are  2.4  D  (gGg')  and  2.5  D  (tGg').  These 
values  are  in  excellent  agreement  with  experiment 
(2.3  ±  0.1  D)  [7a].  SCRF  calculations  predict  that 
the  values  of  the  dipole  moment  in  the  dielectric 
medium  representing  liquid  methanol  are  signifi¬ 
cantly  higher.  They  are  2.9  D  (gGg')  and  3.1  D 
(tGg').  This  result  is  in  very  good  agreement  with 
experimental  data  for  the  neat  liquid  (3.5-4.0  D) 
[5].  The  observed  increase  of  the  dipolar  moment 
of  ETG  in  the  liquid  and  the  difference  between 
the  values  for  the  two  conformers  is  in  the  present 
model  coherent  with  a  better  solvation  of  the  tGg' 
conformer. 

MONTE  CARLO  SIMULATIONS 

Thermal  properties  for  ETG  solvated  in 
methanol  are  reported  in  Table  II.  The 
solute-solvent  interaction  energies  (E^^)  for  the 
three  conformers  show  that  the  tGg'  and  gGg' 
have  similar  interaction  energies  with  methanol 
(-110.9  kJ/mol). 

The  partition  of  in  and  E^^.^ 

also  suggests  that  for  tGg'  a  higher  solute-solvent 
repulsive  energy  is  compensated  by  the  Eg ^.5  at¬ 
tractive  contribution.  The  solute -solvent  interac¬ 
tion  energy  for  the  tTt  conformer  is  -106.7  kJ/moL 
Comparison  between  Egg  and  V  from  the  simula¬ 
tions  of  ETG  in  liquid  methanol  and  those  from 
the  simulations  of  the  neat  liquid  (E*  and  V*) 


TABLE  II  _ _ 

Thermal  data  for  NPT  Monte  Carlo  simulations  of 
ETG  in  liquid  methanol. 


tGg' 

gGg' 

tTt 

-110.9  ±  4 

-110.9  ±  4 

-106.7  ±  4 

(-131.4)® 

(-136.0)® 

(-133.1)® 

^SX-12 

68.2 

64.8 

65.7 

^  sx-6 

-86.2 

-82.0 

-81.2 

^SX'^ 

-92.9 

-93.7 

-91.2 

^  ss 

-35.0  +  2 

-35.4  +  2 

-35.5  +  2 

V 

9161  ±  50 

9089  ±  45 

9193  ±  50 

C*  b 

-34.7 

9070  +  40 

The  pressure  is  1  atm  and  the  temperature  298  K.  Energies 
are  in  kJ/mol.  Volumes  are  in  A^.  is  the  total 
solute-solvent  energy.  Esx-12>  ^sx-6>  ^sx-i  respec¬ 
tively,  the  and  Coulombic  interaction  energies 

between  the  solute  and  the  solvent.  is  the  solvent  total 
energy.  E%  and  V*  are,  respectively,  the  energy  and  vol¬ 
ume  of  pure  methanol. 

^For  ETG  solvated  in  liquid  water  from  [26]. 

^From  the  simulations  of  liquid  methanol  [41]. 


suggests  that  solvation  of  ETG  in  liquid  methanol 
do  not  induces  a  significant  reorganization  of  the 
solvent.  This  is  expected,  since  ETG  can  be  seen  as 
being  formed  by  two  molecular  fragments  similar 
to  methanol.  Thus,  the  energy  difference  between 
the  tTt  conformer  and  the  tGg'  and  gGg'  conform¬ 
ers  in  liquid  methanol  can  be  estimated  as  ~4.2 
kj/mol.  If  we  compare  this  value  with  the  gas- 
phase  energy  difference  (14.0  kJ/mol  at  B3LYP/6- 
31G**),  it  is  suggested  that  solvation  of  ETG  in 
liquid  methanol  stabilizes  the  tTt  conformer  rela¬ 
tive  to  the  gas  phase.  This  result  is  in  agreement 
with  experimental  data  for  the  neat  liquid  by 
Pruettiangkura  et  al.  [9].  These  authors  predicted 
that  the  effect  of  solvation  on  the  conformational 
energy  difference  was  to  induce  a  relatively  large 
stabilization  of  the  all-frans  rotamer  in  pure  ETG 
by  6.3  kl /mol.  The  present  study  for  ETG  solvated 
in  methanol  indicates  that  the  tTt  conformer  is 
stabilized  by  ^9.8  kJ/mol  relative  to  the  gas 
phase. 

The  site-site  distribution  functions  for  the  cor¬ 
relations  between  the  ETG  oxygen  atom  and  the 
methanol  methyl  site  [O — CH3(M)],  the  ethyl  ETG 
site,  and  the  methanol  oxygen  atom  [CH2 — 0(M)] 
are  reported  in  Figures  2  and  3.  O — CH3(M)  (Fig, 
2)  shows  for  all  the  three  conformers  a  maximum 
about  3.5  A,  which  is  less  pronounced  for  the  gGg' 
conformer.  The  CH2— 0(M)  site-site  correlation 
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R(A) 

FIGURE  2.  O — CH3(M)  site-site  distribution  functions  for  the  three  conformers  of  ETG  in  liquid  methanol. 


functions  present  a  first  maximum  3.6  A.  For  the 
tGg'  conformer,  there  is  second  maximum  at  3.9  A, 
which  reflects,  in  this  case,  the  different  roles 
played  by  the  CH2  sites.  Also  indicated  is  the 
formation  of  a  second  coordination  shell,  after  a 
second  maximum^  which  is  for  all  the  three  con- 
formers  at  ~  5.0  A. 


The  correlations  between  oxygen  and  hydrogen 
atoms  are  reported  in  Figures  4  and  5.  The  correla¬ 
tions  between  the  ETG  oxygen  atom  and  the 
methanol  hydrogen  (Fig.  4)  and  the  ETG  hydrogen 
atom  and  the  methanol  oxygen  (Fig.  5)  are  very 
different  for  the  three  conformers.  As  illustrated  in 
Figure  4,  for  the  all-trans  conformer,  the  height  of 


FIGURE  3.  O — CH2(M)  site-site  distribution  functions  for  the  three  conformers  of  ETG  in  liquid  methanol. 
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FIGURE  4.  0— H(M)  site-site  distribution  functions  for  the  three  conformers  of  ETG  in  liquid  methanol. 


the  first  maximum  is  significantly  higher  than  those 
corresponding  to  the  tGg'  and  gGg'.  The  O — H(M) 
distribution  functions  show  that  the  tTt  conformer, 
when  comparison  is  made  with  the  two  other 
conformers  (tGg'  and  gGg'),  is  the  preferential 
proton  acceptor  conformer.  It  is  also  illustrated  in 


Figure  4  that  for  tGg'  correlations  between 
O — H(M)  are  favored  after  the  first  coordination 
shell. 

Figure  5  suggests  that  the  H — 0(M)  site-site 
correlation  functions  for  the  three  conformers  are 
not  very  different,  although  the  approaching  of 


FIGURE  5.  H— 0(M)  site-site  distribution  functions  for  the  three  conformers  of  ETG  in  liquid  methanol. 
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R(A) 

FIGURE  6.  Average  coordination  numbers  as  a  function  of  the  site-site  distances  for  the  tGg'  conformer  in  liquid 
methanol. 


oxygen  methanol  atoms  to  ETG  hydrogens  seems 
slightly  more  important  for  the  gGg'  conformer. 
However,  this  figure  also  suggests  that  after  the 
first  coordination  shell  the  approaching  of  0(M) 
atoms  is  more  favored  to  tXt  and  tGg'  conformers. 


This  is  related  to  the  second  maximum  of  the 
correlation  functions  at  ~  3.4  A. 

Figures  6-8  report  the  average  coordination 
numbers  as  a  function  of  the  distance  between  the 
sites,  respectively,  for  the  tGg',  gGg',  and  tit  con- 


FIGURE  7.  Average  coordination  numbers  as  a  function  of  the  site-site  distances  for  the  gGg'  conformer  in  liquid 
methanol. 
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FIGURE  8.  Average  coordination  numbers  as  a  function  of  the  site-site  distances  for  the  tTt  conformer  in  liquid 
methanol. 


formers.  For  the  tGg'  conformer.  Figure  6  shows 
that  given  one  oxygen  ETG  atom^  there  is  one 
methanol  hydrogen  atom  at  ^2.9  A  and  that  for 
one  hydrogen  ETG  atom  there  is  one  methanol 
oxygen  at  '^2.0  A.  In  Figure  7,  a  competition 
between  CH2— 0(M)  and  O— CHjCM)  correlations 
for  the  gGg'  conformer  is  illustrated  by  the  cross¬ 
ing  of  the  coordination  numbers  at  '^33  and 
-3.9  A. 

The  formation  of  plateaux  in  the  average  coordi¬ 
nation  curves,  as  a  function  of  the  distances  be¬ 
tween  the  sites,  illustrates  the  definition  of  the  first 
coordination  shell.  This  effect  is  particularly  im¬ 
portant  in  the  O — H  correlations  and  it  is  related 
to  the  value  of  the  minima  in  the  site~site  distribu¬ 
tion  functions.  Figure  8  shows  the  coordination 
numbers  for  the  tTt  conformer.  The  correlations 
between  O — H(M)  (dashed  curve)  for  this  con¬ 
former  are  clearly  more  important  than  for  the  two 
other  conformers  (Figs.  6  and  7).  This  suggests,  in 
agreement  with  the  data  for  thermal  properties, 
that  hydrogen  bonding  plays  a  very  important  role 
to  stabilize  the  ETG  al\-trans  conformer  in  liquid 
methanol. 


Conclusions 

The  modifications  relative  to  the  gas  phase  of 
several  molecular  properties  of  ETG  when  it  is 
solvated  in  liquid  methanol  were  analyzed.  DFT 


calculations  for  different  conformers  of  ETG  in  the 
gas  and  in  a  dielectric  medium  modeling  liquid 
methanol  were  reported.  The  calculations  were 
performed  with  different  representations  for  the 
exchange- correlation  functionals.  The  condensed- 
phase  calculations  based  on  the  DFT/SCRF  ap¬ 
proach  predict  that  for  ETG  solvated  in  a  dielectric 
there  is  a  small  stabilization  of  tGg'  relative  to 

Monte  Carlo  simulations  of  ETG  in  liquid 
methanol  showed  that  tGg'  and  gGg'  have  similar 
interactions  with  liquid  methanol  and  that  a  very 
small  reorganization  of  the  solvent  is  observed. 
However,  the  simulations  also  predict,  in  agree¬ 
ment  with  experiment  data,  an  important  stabiliza¬ 
tion,  relative  to  the  gas-phase  equilibrium,  of  the 
al\-trans  conformer.  The  microscopic  mechanisms 
leading  to  this  stabilization  were  analyzed  through 
the  calculation  of  site-site  correlation  functions 
and  were  related  to  the  specific  role  played  by  the 
aW-trans  ETG  conformer  as  a  proton  acceptor  in 
liquid  methanol. 
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ABSTRACT  _ _ 

For  formaldehyde,  the  C  —  O  stretch  potential  of  ^77,77*)  crosses  all  Rydberg 
potentials,  such  as  n,3p^,  n,  3d^^,  etc.,  thereby  transferring  the  intensity  of  the  tmassigned 
X77,  77*)  ^  X  system  to  these  Rydberg  states.  For  thioformaldehyde,  the  situation  is 
similar  but  a  shift  in  the  potentials  allows  for  direct  observation  of  (77,77*).  In  its 
^77, 77*)  state,  H2CO  is  planar,  having  a  low  barrier  of  about  0.2  eV  toward  the 
nonplanar  \cr,  77*)  state.  For  FI2CS,  the  planar  conformation  of  Htt,  77*)  is  a  saddle 
point,  with  ^(a,  77*)  being  the  global  minimum  on  the  2^A'  surface.  The  triplet  77,  77* 
states  of  H2CO  and  H2CS  are  nonplanar,  having  inversion  barriers  of  0.1  and  0.05  eV, 
respectively.  For  both  H2CO  and  H2CS,  the  77,  77*  configuration  also  crosses  the 
ground-state  configuration,  which  explains  predissociation  and  radiationless  transitions 
of  some  Rydberg  states.  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

In  the  absorption  spectrum  of  formaldehyde, 
the  electric  dipole  forbidden  transition 
^A2(n,  77*)  is  observed  and  well  studied, 

whereas  the  allowed  transition  ^77,77*)  ^  X^A^, 
despite  being  predicted  to  be  very  intense,  has  so 
far  escaped  all  attempts  at  detection.  The  problem 
of  the  '"missing"  ^77,  77*)  state  also  plagues  many 
other  aldehyde  and  ketones.  Consequently,  Robin, 
a  prominent  reviewer  in  the  field  of  higher  excited 
states,  wrote  that  "aldehydes  and  ketones  are  a 
most  perplexing  chromophoric  group"  [11]. 


For  a  long  time,  it  was  believed  that  ^(77,77*) 
had  a  vertical  excitation  energy  in  excess  of  the 
first  ionization  potential  (IP)  of  10.8887  eV  [2]  and, 
therefore,  is  not  observed.  For  example,  Langhoff 
et  al.  [3]  predicted  ^77,  77*)  to  be  near  11.2  eV. 

The  absorption  spectrum  in  the  vacuum  ultravi¬ 
olet  (VUV)  region  of  formaldehyde  was  first  stud¬ 
ied  by  Price  [4].  Currently,  all  medium  and  strong 
bands  in  the  VUV  spectrum  of  H2CO  are  assigned 
to  Rydberg  transitions.  The  latest  assignments  by 
Brint  et  al.  [5]  identified  n  ^  s,  p,  rf, /-Rydberg 
transitions  up  to  5s,  12  p,  12  d,  and  9/,  converging 
to  the  first  IP.  However,  in  many  cases,  the  intensi¬ 
ties  or  quantum  defects  do  not  conform  to  expecta¬ 
tions  for  Rydberg  transitions.  For  example,  the 
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intensity  of  n,4p  is  higher  than  that  of  n,3p,  and 
the  quantum  defect  of  n,3d  is  0.4,  whereas  it  is 
expected  to  be  <0.1.  Further,  for  n  >  4,  the  np, 
nd,  and  nf  transitions  have  similar  intensities,  con¬ 
trary  to  their  normal  behavior  [1,  5].  Although  the 
source  of  the  Rydberg  anomalies  is  usually  at¬ 
tributed  to  Rydberg-valence  interactions,  the  na¬ 
ture  of  the  perturbing  valence  state  has  never  been 
positively  identified. 

In  contrast  to  the  missing  singlet  tt  ^  tt*  sys¬ 
tem,  the  triplet  tt  ^  tt*  system  of  H2CO  has  been 
observed  by  electron  impact  studies  ([6]  and  refer¬ 
ences  therein).  It  should  also  be  noted  that  the 
absorption  spectrum  of  thioformaldehyde,  whose 
electronic  spectrum  is  expected  to  be  similar  to 
that  of  formaldehyde,  shows  a  progression  of 
bands,  lying  between  n  ^  4s  and  n  ^  4p,  that 
were  assigned  to  ^(tt,  77*). 

Obviously,  it  is  of  significant  interest  to  investi¬ 
gate  theoretically  the  reasons  for  the  strange  be¬ 
havior  of  the  77,  77*  state.  In  a  series  of  multirefer¬ 
ence  (MR)  configuration  interaction  (Cl)  studies 
performed  over  the  last  few  years,  potential  en¬ 
ergy  curves  for  the  C  —  O  stretch  and  the  out-of- 
plane  motion  were  obtained  for  H2CO  [7-11]  and 
H2CS  [12-15].  Using  more  extended  basis  sets  and 
large  Cl  expansions,  the  vertical  excitation  energy 
of  the  ^77,  77*)  state  of  H2CO  is  calculated  to  be 
9.6  eV  [8].  It  is  important  to  include  the  doubly 
excited  configuration  77*^  in  the  set  of  refer¬ 
ence  configurations.  From  the  calculated  potential 
energies,  vibronic  energy  levels  and  oscillator 
strengths  were  evaluated  [8,  13].  The  results,  as 
they  concern  the  77, 77*  state  of  H2CO  and  H2CS, 
are  most  interesting  and,  in  our  opinion,  explain 
why  ^(77,  77*)  had  previously  not  been  assigned  in 
the  absorption  spectrum  of  H2CO. 


Methods 

Ab  initio  MR-CI  calculations  were  performed 
using  the  MRD-CI  programs  [16].  A  (10s6p/5s4p) 
basis  set  for  C  and  O,  (12s9p/5s4p)  for  S,  with 
additional  polarization  functions  and  s,  p,  d 
Rydberg  functions  on  C  and  O  (C  and  S  for  H2CS) 
is  used.  The  hydrogen  basis  is  (5s/3s)  with  a  p 
polarization  fimction.  Details  are  given  in  [8,  10, 
13, 14].  In  their  ground  states,  H2CO  and  H2CS  are 
planar,  having  C2~,,  symmetry.  For  the  out-of-plane 
motion,  symmetry  was  observed.  The  number 
of  reference  configurations  varied  between  40  and 


60.  They  were  selected  such  that  all  symmetry- 
adapted  functions  (SAF)  with  >  0.001  in  the 
final  wave  function  were  part  of  the  reference 
space,  for  the  whole  range  of  geometries  covered 
by  the  calculations. 


The  Role  of  ^  71,  tt*  )  in  the  Absorption 
Spectrum  of  II2CO 

In  Figure  1,  the  potential  curves  for  the 
C  —  O  stretch  of  H2CO  are  shown.  They  are  simi¬ 
lar  to  the  ones  obtained  in  [8],  but  a  lower  configu- 
'ration  selection  threshold  (4  /xHartrees)  was  used. 
The  other  geometry  parameters  were  held  at  ex¬ 
perimental  ground  state  (GS)  values  (R^h  ^  2.0796 
flg  and  (/)(HCH)  =  116.3°  [17]).  It  is  seen  that  dia- 
batically  the  ^(77,77*)  state  crosses  all  Rydberg 
potentials,  also  the  higher  ones  not  shown  here. 
Even  the  apparent  long-distance  minimum  of 
(77,  77*),  at  2.914  Uq  is  due  to  an  avoided  crossing 
of  77, 77*  with  the  GS  configuration  At  3  the 

wave  function  contains  about  45%  and 

45%  77,77*,  whereas  2^A^  has  40%  and  45% 
77,  77*  (see  Fig.  2).  (The  percentage  contribution 
was  calculated  from  cf,  where  is  the  coefficient 
of  the  respective  SAF  in  the  Cl  wave  function.  In 
the  Cl  expansion,  77*)  MOs  were  used.  The 
given  numbers  change  slightly,  but  not  greatly, 
with  the  choice  of  MOs.)  At  the  largest  CO  dis¬ 
tance  shown  in  Figure  2,  4  a^,  the  eventual  dissoci¬ 
ation  to  CH2  +  O  is  not  yet  important.  Open-shell 
configurations  to  describe  the  dissociation  become 
dominant  at  larger  distances  than  shown.  The  re¬ 
pulsive  crossing  of  \ 77,  77*)  with  all  ^A^  Rydberg 
states  and  the  ground  state  leads  to  predissociation 
and  radiationless  transitions.  According  to  Figure 
1,  the  doubly  excited  77*^)  state  appears  at 
about  10.2  eV,  having  a  fairly  shallow  potential, 
and  crossing  77,  77*  at  small  R  and  higher  Rydberg 
states  at  larger  R. 

Figure  3  shows  an  enlarged  version  of  the 
C  —  O  stretch  curves  for  2M^  to  4^^,  including 
vibrational  energy  levels  for  each  of  the  minima.  It 
should  be  noted  that  all  minima,  except  for  n,3p^ 
and  n,  3rfy-,  are  caused  by  avoided  crossings  with 
77,  77*.  These  potential  wells  are  bounded  by  77,  77* 
on  the  left,  and  by  a  Rydberg  configuration  on  the 
right.  A  notation  such  as  77,  77*/n,3py,  used  here 
to  describe  such  potentials,  should  be  obvious. 

Assuming  that  all  states  are  planar,  and  that 
the  C  —  O  stretch  vibrations  can  be  approximated 
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FIGURE  1.  C  —  O  stretch  potentials  for  ’A,  states  of 
HjCO. 


by  the  one-dimensional  potentials  given  in  Figure 
3,  the  vibronic  oscillator  strengths  were  evaluated 
for  each  vibrational  level,  by  first  calculating  the 
electric  dipole  transition  moment  as  a  function  of 
Rco  and  then  combining  it  with  the  Franck-Con- 
don  (FC)  factor. 

In  Table  I,  experimental  values  for  energies  and 
oscillator  strengths  are  compared  with  calculated 
vertical  values  and  with  vibronic  values  evaluated 
for  each  vibrational  level,  as  described  above.  For 
the  0-0  bands  of  the  n  3s  and  n  3p  transi¬ 
tions,  the  vertical  numbers  agree  reasonably  well 
with  the  experimental  values.  However,  for  the 
0-0  bands  of  n  ^  3dy2  (8.88  eV),  n,4s  (9.26  eV), 
and  higher  states,  the  calculated  vertical  oscillator 
strengths  are  too  small.  Also,  the  0-1  and  0-2 
bands  of  Rydberg  states  have  very  low 
Franck-Condon  factors,  and  the  observed  bands  at 
8.32,  9.04,  and  9.18  eV  could  not  be  explained  on 
the  basis  of  vertical  oscillator  strengths. 

In  the  last  three  columns  of  Table  I,  the  calcu¬ 
lated  vibronic  values  are  given.  It  is  seen  that 
vibronic  energies  can  be  matched  with  the  energies 
of  the  observed  bands  to  a  maximum  deviation  of 
0.24  eV,  but  mostly  lying  within  0.05  eV.  Larger 
deviations  may  be  indicative  of  the  need  for  a 
non-Bom-Oppenheimer  treatment. 

Except  for  the  three  lowest  bands,  the  literature 
assignments,  as  given  in  Table  I,  had  to  be  changed 


R(CO)  distance  (a„) 

FIGURE  2.  Contribution  of  various  configurations  to  the  wave  functions  lA,  to  S'A,  of  HgCO  using  ^(n,  tt*)  molecular 
orbitals. 
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FIGURE  3.  C  —  O  stretch  potentials  for  to 
states  of  H2CO,  showing  vibrational  energy  as  used  in 
the  calculations.  Higher  Rydberg  potentials  were 
estimated,  and  are  shown  as  dashed  lines. 


to  accommodate  the  new  states  that  involve  mixed 
TT,  7r*/Rydberg  potentials.  For  example,  the  peak 
observed  at  9.04  eV  was  originally  assigned  to  the 
3j  band  of  where  is  the  symmetric  CH2 

bending  vibration.  By  itself,  n,3d^,  has  low  inten¬ 
sity.  This  band  is  now  associated  with  2^  of 
77,  7r*/n,3py,  where  is  the  C  —  O  stretch  vi¬ 
bration.  It  clearly  derives  its  intensity  =  0.024, 
/obs  0.022)  from  the  tt,  tt*  section  of  the  poten¬ 
tial. 

Most  of  the  calculated  oscillator  strengths  are  in 
surprisingly  good  agreement  with  the  observed 
values.  Exceptions  are  the  Oq  band  of  n,3s,  for 
which  vibronic  interaction  with  cr,  7r*/7r,  tt*  may 
play  a  role,  as  outlined  in  [7].  The  small  calculated 
/-value  for  the  8.32  eV  band  may  be  due  to  the  fact 
that  both  the  v  =  3  level  of  tt,  7r*/n^  and  the 
V  =  1  level  of  n,  3  lie  close  to  the  barrier  in  the 
2^  A I  potential,  which  was  not  taken  into  account 
in  the  calculations.  The  third  exception  is  the  peak 
at  9.26  eV,  for  which  is  too  large.  It  is  pre¬ 
sumed  that  the  rapidly  increasing  density  of  states 
in  this  high-energy  region  may  allow  for  other 
possible  interpretations  of  the  observed  bands. 

Overall,  the  results  of  Table  I  show  that  the 
77,  TT*  state  is  the  major  intensity  provider  for 
transitions  to  Rydberg  states  higher  than  n,3p. 
Since  intensities  of  Rydberg  transitions  normally 
decrease  with  increasing  quantum  number  (as 
[3]),  the  contribution  of  77, 77*  to  Rydberg  transi¬ 
tions  beyond  the  ones  covered  here,  such  as  n,  4rf 
and  n,5s,  etc.,  should  be  even  more  significant. 


TABLE  I _ _ _ 

Comparison  of  observed  values  for  energies  AE  (in  eV)  and  oscillator  strengths  f  with  calculated 
vertical  and  adiabatic  values  for  bands  of  H2CO  lying  above  7  eV.® 


Experimental 

Vertical 

Vibronic 

AE 

f 

Assignment^ 

^^vert 

fvert 

AE 

'0 

Assignment^ 

7.09 

0.033 

Oq  n,3s 

7.15 

0.005 

7.09 

0.005 

0°  n,3s 

7.97 

0.018 

Og  n,3p^ 

8.05 

0.021 

7.94 

0.021 

0°  n,3p^ 

8.12 

0.035 

OS  n,3py 

8.10 

0.039 

7.98 

0.035 

OS  n,  3py 

8.32 

0.005 

2i  n,3py 

— 

— 

8.18 

5  X  10"^ 

2l  TT,  TT*  1  +  2I  n,3p„ 

8.88 

0.014 

OS  n,  3dy^ 

9.25 

0.005 

8.64 

0.012 

Og  TT,iT*  ln,3py 

9.04 

0.022 

3ln,3dy, 

— 

— 

8.90 

0.024 

2l  IT,  TT*  /n,  3py 

9.18 

-  0.008 

3q  n,  3dy^ 

— 

— 

9.16 

0.013 

2g  tt/tt*  / n,  3Py  +  Oq  n,  3dy2 

9.26 

-  0.001 

Oq  n,  4s 

9.19 

6  X  IO-® 

9.27 

0.049 

Og  TT,  TT*ln,3dy, 

9.63 

0.033 

0§  n,  4py 

9.41 

9  X  10-5 

9.63 

0.027 

2g  TT,  TT*  In,  3dy^ 

ai-v_x_ 

0°  n,  4p^ 

9.47 

9x10'* 

^Data  taken  from  [8,  11]  and  experimental  references  therein,  except  for  vertical  oscillator  strengths. 
Literature  assignment  where  P2  's  CO  stretch  and  1^3  is  CH2  symmetric  bend. 

""New  assignments  based  in  part  on  tt,  tt*  /  Rydberg  potentials. 
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77,  77 


explaining  the  relatively  high  intensity  observed 
for  higher  Rydberg  states. 

For  other  vibrational  modes,  the  changes  in  ge¬ 
ometry  between  the  GS  and  excited  Rydberg  states 
are  usually  small  (as  tested  in  full  geometry  opti¬ 
mizations  [10]),  and,  therefore,  the  0-0  transition  is 
expected  to  contain  most  of  their  intensity.  Com¬ 
plications  due  to  17,  TT*,  as  encountered  for  the 
C  —  O  stretch,  are  not  likely. 

Other  symmetry  species,  such  as 
(i2, 3d-j^y) . . . )  and  62(71, 3s,  n,  3p^/  ti,  3d ^2 _ yi/ 
n,3d^i;...)  are  not  affected  by  direct  coupling 
with  77,  77* .  Therefore,  C  —  O  stretch  bands  of 
higher  Rydberg  states  not  belonging  to  the 
symmetry  species  carmot  receive  intensity  from 
77,  77*  directly  and  should  therefore  have  much 
lower  oscillator  strengths  than  those  of  the  Aj 
transitions. 

This  is  borne  out  by  the  vertical  oscillator 
strengths  given  in  Table  I  for  n, 4s  and  «,4p 
(~  10“^).  The  calculated  (vertical)  oscillator 
strengths  for  n,3d^2_yi,  n,3d^i,  and  n,3dj.y  are 
0.001,  0.005,  and  3  X  10“^,  respectively.  Using 
values  of  9.12,  9.06,  and  9.30  eV  [8],  respectively, 
these  3d-Rydberg  transitions  may  be  hidden  in  the 
more  intense  bands  observed  at  9.04,  9.18,  and  9.26 
eV. 


The  Geometry  of  the  H  it,  tt*  )  State  of 
H2CO 

In  Figure  4,  the  out-of-plane  potentials  are 
shown  for  the  2M'  and  3'A'  states  of  H2CO  at 
R(~q  =  2.914  Oq,  corresponding  to  the  77,  Tr*/n^ 
minimum  of  2Mj  (Fig.  1).  R^h  and  (^(HCH)  were 
kept  at  the  GS  values.  It  is  seen  that  ir,  TT*/n^,  at 
that  distance  about  an  equal  mix  of  ir,  ir*  and  n^, 
is  planar,  although  there  is  only  a  small  barrier  of 
about  0.2  eV  toward  the  nonplanar  u,  ir*.  In  all 
out-of-plane  conformations,  tt,  77*  mixes  with 
<T,  77*,  and  for  6  =  42.1°  and  R(-o  =  2.89  Ug  (the 
minimum  of  cr,77*),  the  2^A'  wave  function  is 
44%  a,  77*,  32%  77, 77*,  and  11% 

Contour  plots  of  2M'  are  shown  in  Figure  5,  and 
3-dimensional  plots,  in  Figure  6.  The  local  77,  77* 
minimum  is  confined  to  a  small  range  of  coordi¬ 
nates,  whereas  n,3py  is  more  extended;  in  com¬ 
parison,  the  global  a,  tt*  minimum  covers  a  wide 
range  of  geometries.  Optimized  geometries  and 
adiabatic  excitation  energies  for  ^ir,  77*),  (cr,  77*), 
and  '(«,  3py),  all  lying  on  the  2^ A  surface,  are 


FIGURE  4.  Out-of-plane  potentials  for  2M'  and  3M'  of 
HgCO,  at  Rco  =  2.914  Bq. 


given  in  Table  II.  The  cr,  77*  minimum  lies  about 
0.35  eV  below  the  tt,  tt*  minimum. 

The  low  barrier  of  tt,  tt*  toward  cr,  tt*  implies 
that  the  higher  vibrational  levels  of  the  ir,  77*/n^ 
minimum  in  Figure  3  are  not  bound  by  tt,  tt* /ri^ 
but  only  by  cr,  ir*.  Therefore,  for  such  higher  lev¬ 
els,  the  C  —  O  mode  may  mix  with  the  out-of¬ 
plane  mode. 


The  '( TT,  TT* )  Stale  of  H2CS 

In  Figure  7,  the  C  —  S  stretch  potentials  of  the 
'Ai  states  of  H2CS  are  given.  The  Rch  distance 
and  angles  were  kept  at  the  experimental 

values  of  2.0645  Uq  and  116.87°  [18].  As  in  the  case 
of  H2CO,  77,  77*  crosses  all  Rydberg  states.  How¬ 
ever,  there  is  an  important  difference.  The  crossing 
of  the  lowest  A^  Rydberg  state  «,4py  occurs  near 
its  minimum  rather  than  at  larger  R,  as  was  the 
case  in  H2CO.  Therefore,  the  potential  well  for 
n,4py  has  effectively  disappeared.  The  left-side 
turning  points  for  the  vibrational  levels  of  the 
77,  77*  potential  move  closer  to  the  GS  equilibrium 
distance,  making  the  Franck-Condon  factors  for 
the  electronic  transition  to  77,  tt*  more  favorable. 

As  for  H2CO,  the  2Mj  wave  function  has  an 
increasing  contribution  from  as  R^s  increases. 
At  3.85  ag,  2M1  is  37%  tt,  tt*  and  42%  n^,  whereas 
iMi  is  52%  77, 17*  and  36%  n^.  The  participation 
of  the  major  configurations  in  the  wave  function  is 
very  similar  to  the  case  of  H2CO  which  was  shown 
in  Figure  2.  Figure  7  also  indicates  that  the  doubly 
excited  state  ir*^  has  dropped  below  n,3y^. 
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TABLE  II _ _ _ 

Calculated  and  observed  bands  for  HjCO  and  optimized  geometries  for  selected  states  of  HjCO.® 


AE 

calcd.  (exptl.) 

^^vert 

fV' 

'0 

calcd.  (exptl.) 

^vert 

Assignment® 

7.09  (7.09) 

7.15  {7.30} 

0.005  (0.033) 

0.005  {0.006} 

Oq  n,  3s 

7.94  (7.97) 

8.05  {8.09} 

0.021  (0.018) 

0.021  {0.023} 

Og  n,3p. 

7.98  (8.12) 

8.10  {8.12} 

0.035  (0.035) 

0.039  {0.041 } 

Og  n,3py 

8.15  (8.37) 

8.32  {8.32} 

-(-) 

— 

Og  n,3p. 

8.18  (8.32) 

—  {7.30} 

5  X  10  ®  (5  X  10  ®) 

— 

2g  n,  3py 

25  TT,Tr*ln^  +  2l 

n,3py 

8.64  (8.88) 

0.012  (0.014) 

Og  n,3dy^  -y 

Og  TT,TT*  ln,3py 

8.90  (9.04) 

— 

0.024  (0.022) 

— 

3  0  r? ,  3  ciy2 

2q  77,  77*/n,3py 

9.16  (9.18) 

0.013  (~  0.008) 

3g  n,  3dy^ 

2o  77,  77*  / n,  3py  +  Oq  n,  3dy^ 

9.18 

9.25  {9.24} 

5  X  10  5  {3  X  10-5} 

—  n,3dy. 

9.10 

9.80  {9.13} 

1  X  10-5  {6  X  10-5} 

—  n,  3dy2  _  y2 

9.06 

9.37  {9.31 } 

5  X  10-5  {1  X  10-5} 

—  n,3d^2 

9.17 

9.40  {9.31 } 

— 

—  n,3dxz 

9.30 

9.34  {9.23} 

3  X  10-5  {3  X  lO"'*} 

—  n,3dyy 

9.19 

9  X  10-5 

—  n,4s 

9.27  (9.26) 

0.049  (~  10-5) 

Oq  n,  4s 

og  Tr,Tr*  In,  3dy, 

9.41 

9  X  10"5 

—  n,Apy 

9.47 

5  X  10“'' 

—  n,4p. 

9.63  (9.63) 

0.027  (0.033) 

Og  n,4p  ^ 

2o  77, 77*/n,3dy2 

Optimized  geometries  and  for  selected  states  of  H2CO^ 

Re(CO) 

0(OOP)  (^.(HCH) 

Re(CH) 

Te 

State 

Configuration 

(ao) 

(degrees)  (degrees) 

(ao) 

(eV) 

2^1 

77, 77* 

2.914 

0.0  120.8 

2.113 

7.95 

2^' 

cr,  77* 

2.890 

42,1  115.1 

2.040 

7.59 

2M, 

n,3Py 

2.266 

0.0  123.3 

— 

7.89 

I^A 

77  ,  77* 

2.789 

38.1  119.9 

2.032 

4.43 

2^  A 

cr,  77* 

2.750 

35.5  133.5 

2.060 

7.15 

n,3py 

2.256 

0.0  — 

— 

-  7.89 

%2  is  CO  stretch  and  is  CH2  symmetric  bend. 

■^Optimized  geometries  obtained  from  [10]  for  the  singlets  and  [9]  for  the  triplets. 


interfering  with  the  77,77*  and  potentials 

at  smaller  R  and  with  n,4cp^  at  larger  R. 

In  Figure  8,  the  out-of-plane  potential  is  shown 
for  the  state  of  H2CS,  at  both  R^s  =  3.436  Uq 
and  Res  =  3.850  Uq,  The  latter  distance  corre¬ 
sponds  to  the  €2^  minimum  of  77,  77*.  At  3.436  Uq, 
77, 77*  has  a  minimum  with  a  low  barrier  toward 
(7,77*.  However,  at  the  77,77*  equilibrium  dis¬ 
tance  of  3.850  Uq,  2^ A'  has  a  slight  maximum  at 
6  =  0°.  While  77,  77*  may  still  be  called  planar  (the 


diabatic  77,  77*  potential  is  indicated  by  the  dashed 
curve),  the  planar  minimum  is  a  saddle  point  in 
the  3-dimensional  potential  surface  including  the 
out-of-plane  angle,  rather  than  a  local  minimum. 

In  Table  III,  the  optimized  geometries  of  the 
^77,  77*),  H(7,  77*),  and  ^(n,4py)  states,  all  lying  on 
the  2^A'  surface,  are  given,  together  with  val¬ 
ues.  It  should  be  noted  that  the  minimum  of  cr,  77* 
lies  only  0,07  eV  below  that  of  77,77*.  Therefore, 
77,77*  (and  (7,77*)  have  a  shallow  potential  both 
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FIGURE  7.  C  —  S  stretch  potentials  for  states  of 
H^CS. 


for  the  C  —  S  stretch  and  the  out-of-plane  angle, 
constituting  a  shallow  potential  over  two  dimen¬ 
sions  rather  than  just  one.  The  implications  for 
spectroscopy  should  be  rather  interesting. 

As  mentioned  in  the  Introduction,  the  77,  tt* 
spectrum  of  H2CS  is  known,  and  the  observed 
bands  have  been  interpreted  as  belonging  to  the 
C  —  S  stretch  mode  v^.  For  an  out-of-plane  angle 
of  36.2°,  corresponding  to  the  optimized  value  of 
2M',  the  C  —  S  potential  is  virtually  parallel  to 
the  C  —  S  potential  for  planar  H2CS  (see  dashed 
curve  in  Fig.  9).  Therefore,  the  C  —  S  stretch  vi¬ 
brational  energies  and  vibronic  oscillator  strengths 
calculated  in  [13]  for  the  supposedly  planar  77,77* 
state,  and  reproduced  in  Table  IV,  retain  some 
validity,  although  all  energies  should  be  corrected 
by  the  out-of-plane  stabilization  energy. 

If  the  calculated  energies  are  matched  with 
nearest  experimental  values,  as  done  in  Table  IV, 
then  the  problem  arises  that  calculated  oscillator 
strengths  for  the  low-lying  bands  are  too  small  to 
explain  their  observation.  Therefore,  a  shift  of  the 
calculated  energies  toward  the  reported  band  ori¬ 
gin  at  5.54  eV  is  necessary.  Also,  a  case  has  been 


out-of-plane  angle  ( 0  ) 


FIGURE  8.  Out-of-plane  potentials  for  2^A  state  of 
HgCS,  at  Res  =  3.436  and  at  3.850  Sq. 

made  in  [13]  that  hot  bands,  starting  from  v"  =  1 
of  the  GS,  might  play  a  role.  Since  our  calculated 
vibrational  frequency  of  473  cm”^  lies  near  the 
experimental  value  of  476  cm“^,  it  should  be  pos¬ 
sible,  by  proper  choice  of  the  origin  and  inclusion 
of  hot  bands,  to  match  the  experimentally  ob¬ 
served  spectrum  quite  closely. 

TABLE  III _ 


Optimized  geometries  and  for  selected 
states  of  H2CS.^’^ 


Re(CO) 

6»(OOP) 

<^(HCH) 

Te 

State 

Configuration 

(ao) 

(degrees) 

(degrees) 

(eV) 

77  ,  77* 

3.850 

0.0 

118.5 

5.85 

2^A' 

a,  77* 

3.672 

36.2 

112.5 

5.78 

2^1 

n,4Py 

3.155 

0.0 

-  122 

6.63 

^^A' 

77, 77* 

3.483 

28 

119.8 

2.70 

2^A' 

(7,  77* 

3.553 

56 

131.5 

5.52 

23/Ai 

nAPy 

3.024 

0.0 

120.9 

6.49 

^Optimized  geometries  obtained  from  [14]  for  the  singlets 
and  [15]  for  the  triplets. 

^R(ch)  bond  length  was  held  constant  at  the  GS  equilibrium 
distance  of  2.076  Bq. 
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R(C-S)[ao] 

FIGURE  9.  C  ~  S  stretch  potentials  of  HgCS  showing 
that  the  potential  of  cr,  77*/77,  tt*  at  6  -  36.2°  (dashed 
curve)  is  nearly  parallel  to  the  tt,  77*  /  potential  at 

e  =  0.0°. 

In  the  same  article  [13],  calculated  vibrational 
energies  and  oscillator  strengths  are  given  both  for 
the  77,  77*/n,4py  adiabatic  potential,  similar  to  the 
77, 77*/Rydberg  states  discussed  earlier  for  H2CO, 
and  for  the  diabatic  n^Py  potential.  In  both  cases, 
the  calculated  energy  for  the  0"0  band  (6.65  eV 
adiabatic,  6.56  eV  diabatic)  agrees  well  with  the 
observed  origin  of  the  n  ^  system  at  6.60  eV 
[20].  However,  in  the  adiabatic  case,  two  addi- 

TABLE  IV _ _ _ 

Vibronic  transition  energies  AEJ  (in  eV)  and 

osciilator  lengths  for  the  (71,71*)  X 
system  in  HgCS. 


AES' 

•0 

v' 

calcd.^ 

exptl.^ 

calcd.® 

0 

5.821 

— 

4  X  10'^ 

1 

5.879 

5.876 

4  X  10  ® 

2 

5.949 

5.927 

2  X  10“® 

3 

6.006 

5.981 

7  X  10  ® 

4 

6.065 

6.036 

2  X  lO^'* 

5 

6.122 

6.111 

5  X  10“'' 

6 

6.177 

6.138 

9  X  10“'' 

7 

6.231 

6.193 

0.001 

8 

6.282 

6.243 

0.002 

9 

6.331 

6.297 

0.003 

^Values  from  [13],  assuming  planar  geometry. 

^Estimated  energies  of  band  maxima  from  the  spectrum  of 
Moule  et  al.  [19]. 


tional  strong  bands,  at  6.82  OJ)  and  6.99  eV  (Sq), 
are  predicted,  whereas  in  the  diabatic  case,  higher 
bands  have  very  low  oscillator  strengths.  Unfortu¬ 
nately,  the  spectrum  above  6.6  eV  is  poorly  re¬ 
solved,  and,  therefore,  it  cannot  yet  be  determined 
which  potential,  diabatic  or  adiabatic,  approaches 
reality  more  closely.  Our  experience  with  H2CO 
would  give  a  preference  to  the  adiabatic  results. 

For  H2CS,  spectra  resulting  from  higher 
77,  77* /Rydberg  potentials  have  not  been  investi¬ 
gated.  A  complication  arises  here  due  to  the  dou¬ 
bly  excited  77*^  state,  whose  C  —  S  potential 
lies,  as  was  mentioned  earlier,  between  the  two 
lowest  Rydberg  states  and  crosses  them  as  well 
as  77, 77*. 


The  Triplet  tt,  tt*  States  of  H2CO 
and  H2CS 

In  Figure  10,  the  potential  curves  for  the 
C  —  O  stretch  of  H2CO  are  shown.  It  is  seen  that 
the  ^(77,  77*)  potential  lies  well  below  the  Rydberg 


FIGURE  10,  C  —  O  stretch  potentials  of  H2CS  for 
states  of  HgCO,  showing  the  location  of  77,  tt*  relative  to 
n,3Py. 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


1669 


GREIN  AND  HACHEY 


states,  and  the  triplet  tt  ^  tt*  system  should 
therefore  be  free  of  the  Rydberg  mixing  which 
plagues  its  singlet  counterpart.  An  electron-impact 
study  by  Taylor  et  al.  [6]  provided  vibronic  ener¬ 
gies  and  intensities  with  which  our  calculated  en¬ 
ergies  and  Franck-Condon  factors  (based  on  the 
potential  from  Fig.  10)  agree  closely. 

Contrary  to  the  singlet  tt,  tt*  state,  ^(tt,  tt*)  is 
nonplanar,  as  shown  in  the  out-of-plane  potentials 
in  Figure  11.  The  calculated  inversion  barrier  is  0.1 
eV,  The  reason  for  the  singlet  being  planar,  and  the 
triplet  nonplanar,  arises  from  the  mixing  of  77,77* 
with  the  closed-shell  configuration  in  the  sin¬ 
glet  case,  which  is  not  possible  for  the  triplet. 
Obviously,  prefers  planarity,  whereas  77,  tt*,  in 
conformity  with  the  Mulliken -Walsh  rules,  prefers 
a  nonplanar  conformation.  The  optimized  geome¬ 
tries  and  T/s  of  ^(77,77*),  ^(c7-,77*),  and  ^(n,3p^) 
are  included  in  Table  II.  In  contrast  to  the  singlet 
case,  cr,77*  lies  well  above  77,77*,  and,  therefore, 
mixing  between  these  configurations  is  negligible. 

The  situation  for  the  triplet  77,  77*  of  H2CS  is 
similar  to  that  of  H2CO.  In  the  planar  case, 
1M|(77,  77*)  lies  much  lower  than  the  Rydberg 
states,  and  ^(77,77*)  is  nonplanar  with  ^  =  28°, 
having  an  inversion  barrier  of  0.05  eV  [14].  Its 
optimized  geometry  and  T^,  as  well  as  values  for 
^(cr,  77*)  and  ^(n,4p^),  are  given  in  Table  III. 


FIGURE  11.  Out-of-plane  potentials  for  ^A'  states  of 
H2CO,  at  Rco  =  2.70  Sq. 


Summary  and  Conclusion 

In  the  absorption  spectrum  of  formaldehyde, 
the  ^77,  77*)  ^  X  system  is  not  seen  directly,  since 
the  C  —  O  stretch  potential  of  77,  77*  is  repulsive 
and  crosses  all  Rydberg  potentials  (as  well  as 
the  ground-state  potential).  As  a  consequence,  a 
series  of  avoided  crossing  minima  are  found  which 
are  bound  on  the  left  (smaller  R^^q)  by  a  section  of 
the  77,77*  potential,  and  on  the  right,  by  a 
Rydberg  potential.  Due  to  the  fact  that  77, 77*  and 
n,  Rydberg  states  differ  by  a  double  excitation, 
their  interaction  is  weak,  as  seen  in  the  small 
energy  difference  between  the  states  at  the  point  of 
the  avoided  crossing,  and  usually  the  configura¬ 
tion  change  is  very  abrupt. 

Due  to  such  mixed  77,  77*/Rydberg  potentials, 
the  high  intensity  of  the  77,  77*  transition  is  trans¬ 
ferred  to  Rydberg  states  within  the  same  symme¬ 
try  species,  making  them  appear  much  more  in¬ 
tense  than  they  ought  to  be,  and  also  shifting  their 
positions  in  unexpected  ways,  resulting  in  unusual 
quantum  defects.  Using  the  C  —  O  stretch  mode, 
all  strong  and  medium  peaks  of  the  absorption 
spectrum  of  H  2  CO  above  8  eV  could,  at  least  in 
principle,  be  explained  in  this  way.  Many  details 
still  need  to  be  sorted  out,  such  as  non-Born- 
Oppenheimer  interactions,  which  should  be  partic¬ 
ularly  important  near  the  avoided  crossings.  The 
further  crossing  of  77,  77*  into  the  GS  may  explain 
the  observed  predissociation  of  Rydberg  bands. 

For  thioformaldehyde,  as  for  H2CO,  the  Htt,  77* ) 
potential  crosses  all  Rydberg  potentials  as  well  as 
the  ground-state  potential.  Since  the  lowest 
Rydberg  state  is  crossed  near  its  minimum  the 
77,  77*  system  has  higher  Franck-Condon  factors 
than  in  the  case  of  H2CO  and  becomes  directly 
observable. 

Complications  are  introduced  by  the  out-of¬ 
plane  motion.  While  ^77,  77*)  of  H2CO  has  a  local 
minimum  for  the  planar  conformation,  there  is 
only  a  low  barrier  of  about  0.2  eV  toward  the 
nonplanar  global  ^(u,  77*)  minimum,  lying  0.35  eV 
below  ^77,  77*).  For  H2CS,  the  ^(77,  77*)  minimum 
that  is  seen  in  the  Rj^s  potential  curves  for  planar 
geometry  becomes  a  saddle  point  when  the  out- 
of-plane  motion  is  also  taken  into  account.  How¬ 
ever,  the  ^(cr,  77*)  minimum  lies  only  about  0.1  eV 
below  the  77, 77*  minimum.  Therefore,  the  ob¬ 
served  "tt,  77*  bands"  of  H2CS  should  better  be 
called  (7,  77*  bands,  according  to  the  predominant 
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configuration  of  the  global  minimum.  Due  to  the 
small  energy  difference  between  tt,  tt*  and  a,  tt*, 
and  the  fact  that  the  C  —  S  potential  at  the  out- 
of-plane  angle  d  =  36.2'',  corresponding  to  the  opti¬ 
mized  value  for  o’,  tt*,  is  nearly  parallel  to  that  at 
6  ^  0°,  the  distinction  between  ^(or,  tt*)  and 
Htt,  TT*)  for  the  C  —  S  stretch  is  perhaps  not  so 
important.  The  very  shallow  potential  of  2^ A  over 
two  internal  coordinates  is  expected  to  introduce 
some  interesting  features  into  the  spectrum. 

For  the  triplet  tt,  tt*  states  of  both  H2CO  and 
H2CS,  the  Rco  (Res)  potential  curves  lie  well 
below  the  Rydberg  potentials.  Therefore,  the 
^(tt,  TT*)  spectra  should  be  observable  and  have, 
indeed,  been  seen  for  H2CO.  For  both  molecules, 
^(tt,  tt*)  is  nonplanar,  with  energy  lowerings  from 
the  planar  to  the  nonplanar  conformation  of  0.17 
eV  for  H2CO,  and  0.07  eV  for  H2CS.  Again,  one 
deals  here  with  fairly  shallow  potentials. 

For  alkylated  carbonyl  compounds,  such  as  ac¬ 
etaldehyde  and  acetone,  as  well  as  for  many  larger 
systems,  the  ^(tt,  tt*)  transition  has  also  not  been 
assigned,  although,  as  in  H2CO,  it  may  have  been 
observed  indirectly.  Recent  calculations  on  acetone 
[21]  give  Rco  potential  curves  very  similar  to 
those  of  H2CO.  On  the  other  hand,  substitutions  of 
hydrogens  by  halogens  raise  the  energy  of 
Rydberg  states  relative  to  tt,  tt*,  due  to  the  stabi¬ 
lization  of  the  Uq  orbital  interacting  with  halogen 
lone  pairs.  According  to  calculations  performed  in 
this  laboratory  [22],  the  ^(tt,  tt*  )  potentials  of  F2CO, 
C/2CO,  and  CI2CS  lie  well  below  the  Rydberg 
potentials. 

In  summary,  the  fact  that  the  ^(tt,  tt*)  state  has 
not  been  observed  directly  in  the  spectrum  of 
formaldehyde  and  alkylated  carbonyl  compounds 
can  be  explained  by  the  accidental  and  somewhat 
unfortunate  positioning  of  the  ^(tt,  tt*)  R^o -poten¬ 
tial  relative  to  the  Rydberg  potentials.  As  a 
consequence,  ^(tt,  tt*)  transfers  its  intensity  to  the 
Rydberg  states,  providing  them  with  imusually 
high  oscillator  strengths  and  also  changing  the 
vibrational  energy  levels. 
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ABSTRACT _ _ _ _ _ _ 

The  SCMEH-MO  method  with  average  relativistic  and  spin-orbit  effects  has  recently 
been  applied  to  study  the  electronic  structure  and  bonding  in  samarium  pentamethyl- 
cyclopentadienyls.  In  this  report  the  same  approach  has  been  utilized  in  studying  the 
electronic  structures  of  Nd  and  Sm  hexacarbonyls.  In  contrast  to  the  stable  transition 
metal  d-block  carbonyls,  these  lanthanide  carbonyls  are  foimd  to  be  quite  unstable.  These 
findings  are  based  on  calculated  electronic  structures  and  bond  energies.  ©  1996  John 
Wiley  &  Sons,  Inc. 


Introduction 

Over  two  decades  ago  the  preparation  of  sev¬ 
eral  lanthanide  carbonyls  were  reported  in 
Ar  matrices  at  10  K  [1,  2].  The  infrared  (IR)  spectra 
implied  the  presence  of  several  Nd(CO)^  species, 
with  Nd(CO)g  as  the  primary  product.  A  couple  of 
years  prior  to  this  the  preparation  of  UfCOlg  was 
reported  under  similar  conditions  [3]. 

While  the  ground-state  electron  configuration  of 
Nd  is  4f*  6s^,  that  of  U  is  5/^  7s^.  Both  have 

six  valence  electrons  to  afford  an  effective  atomic 
number,  EAN  =  18,  counting  two  electrons  from 
each  of  the  six  CO  ligands.  Traditionally  the  EAN 
rule  for  d-block  transition  metals  is  well  estab¬ 
lished  for  monomolecular  carbonyls,  so  that  those 

Intarnational  Journal  of  Quantum  Chamistry:  Quantum  Chamistry 
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not  obeying  the  18  EAN  rule  are  unstable.  But  the 
electrons  in  4/  orbitals  of  lanthanides  are  much 
more  contracted  than  those  of  transition  metal  d 
orbitals,  hence  the  former  are  restricted  from  effec¬ 
tive  orbital  overlap.  This  is  the  reason  that  bonding 
in  lanthanide  compounds  is  invariably  ionic.  Thus, 
the  formation  of  Nd(CO)g  and  any  of  the  other 
lanthanide  carbonyls  is  quite  surprising  in  light  of 
the  fact  that  ionic  bonding  is  precluded  for  such 
molecules. 

Although  the  production  of  Sm(CO)g  was  not 
reported,  it  is  interesting  to  consider  this  molecule, 
since  Sm  having  a  6s^  configuration  would  be 
an  atypical  hexacarbonyl  with  an  EAN  =  20. 

Very  recently,  full  relativistic  DV-Xa  calcula¬ 
tions  were  reported  on  several  MfCOlg  carbonyls 
including  U(CO)g  [4].  It  was  found  that  the  highest 
molecular  orbitals  (MOs)  consisted  of  a  manifold 
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of  relativistic  levels  involving  an  admixture  of  U 
5/  and  6d  orbitals.  But  it  was  not  reported  whether 
these  were  open  or  closed  shells,  or  a  combination 
of  both. 

This  work  reports  calculations  on  the  systems 
LnCCO)^  and  LnCOC)^,  Ln  =  Nd,  Sm,  utilizing  the 
SC~MEH-MO  method  with  average  relativistic  and 
spin-orbit  effects  included  [5].  Since  it  has  been 
indicated  that  the  M-OC  bonding  mode  may  be  a 
preferred  intermediate  in  the  mechanisms  of  some 
carbonyl  insertion  reactions  [6,  7],  thus  the  LnCOC)^ 
ligation  mode  was  included  in  this  study. 

Pertinent  MO  data  obtained  from  these  calcula¬ 
tions  are  presented  in  Table  I-IV.  Calculated  bond 
energies  are  presented  in  Table  V. 


Method  of  Computation 

Details  of  the  SC-MEH-MO  method  have  al¬ 
ready  been  described  extensively  elsewhere  and 
have  been  successfully  applied  for  some  30  plus 
years  [5,  8]. 


The  AO  basis  functions  and  associated  orbital 
quantities  essential  for  calculating  STO  overlaps, 
diagonal  elements  of  the  SCC  Hamiltonian,  elec¬ 
tron  repulsion,  and  spin-orbit  parameters,  were 
obtained  from  the  HF-SCF  quasirelativistic  pro¬ 
gram  provided  by  Klobukowski  [9].  Since  the  SC- 
MEH-MO  method  employs  a  charge-dependent 
quadratic  relation  for  the  SCC  Hamiltonian  and 
the  allowance  for  Ln  negative  charge  would  ap¬ 
pear  to  be  important  [8],  some  reasonable  means 
for  obtaining  the  proper  charge-dependent  orbital 
energies  of  all  valence  and  virtual  Ln“  levels  had 
to  be  derived.  The  difficulty  is  that  our  HF-SCF 
AO  program  does  not  converge  for  Ln~  cases.  A 
density  functional  theory  (DFT)  routine  developed 
by  Perdew  does  calculate  electron  affinities  for 
some  Ln  in  their  5d^  (5d^  for  Gd)  configurations, 
but  not  for  Sm  and  other  lanthanides  higher  than 
Gd  [10].  No  stable  4/”^^  or  6p^  configurations  of 
Ln"  were  computable  in  any  case.  Thus  a  graphi¬ 
cal  extrapolation  procedure  was  applied  to  all  cal¬ 
culated  and  spectroscopic  data  available  for  5s,  5p, 
4/,  5d,  6  s,  and  6p  orbital  energies  in  the  charge 


TABLE  I  _ _ 

Pertinent  MO  parameters  for  NcKCOlg.® 


MO  symmetry 
Type'’ 

Elec. 

Occ.^ 

E 

(eV)'* 

Percent  AO  character  (>  0.5) 

5s 

5p 

4f 

5c/  6s 

6p 

CO 

0 

-5.519 

68.1 

31.9 

Uu 

0 

-9.313 

100 

0 

-9.648 

4.8 

3.6 

91.6 

^2u 

0 

-9.871 

5.8 

94.2 

Uu 

0 

-10.421 

93.2 

1.7 

5.1 

^2u 

0 

-10.559 

84.2 

5.8 

^2u 

0 

- 1 1 .078 

100 

6 

-12.016 

27.8 

72.2 

6 

-12.289 

2.0 

1.9 

20,1 

76.0 

2 

-13.486 

0.7 

15.6 

83.7 

4 

-14.026 

15.6 

84.4 

Uu 

6 

-14.944 

1.1 

98.9 

^2g 

6 

-15.290 

4.2 

95.8 

6 

-24.705 

8.8 

6.4 

84.8 

2 

-28.125 

2.3 

16.4 

81.3 

4 

-29.164 

1.1 

98.9 

6 

-42.873 

16.3 

2.5 

81.2 

2 

-45.153 

1.0 

4.7 

94.3 

^9 

4 

-45.900 

1.3 

98.7 

6 

-53.098 

65.7 

34.3 

2 

-81.454 

89.7 

10.3 

^Populations;  Nd 

“  5s^  5p^ 

4f^°, 

6s“  8,  6p’-8. 

Atom  chargers:  Nd  =  - 

0.421;  C  =  0.303;  O  =  - 

0.233. 

^  O/,  point  group. 

^Two  more  filled  CO  orbitals  of  symmetry  have  been  omitted. 

Includes  average  relativistic  and  spin-orbit  effects  here  and  in  Tables  II  -IV. 
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TABLE  II _ 

Pertinent  MO  parameters  for  NcKOOg-^ _ 

MO  symmetry  Elec.  £  - 

Type  Occ.*'  (eV)  5s 


Percent  AO  character  (>  0.5) 

5p  4f  5d  6s  6p 


^2u 

Uu 

Uu 

^2u 

^2u 

Uu 

^^g 

Uu 

hg 

Uu 

^^9 

^9 

Uu 

^^9 

^9 

Uu 

^19 


1 

3 
2 
6 
2 

4 
6 
6 
6 
2 
4 
6 
2 
4 
6 
2 


-5.801 

-8.108 

-8.116 

-8.127 

-8.240 

-8.284 

-8.429 

-8.992 

-12.040 

1.6 

-12.373 

0.5 

-13.031 

-14.708 

-15.185 

-26.672 

6.1 

-27.520 

0.9 

-28.500 

-39.855 

38.3 

-45.138 

1.7 

-46.189 

-51.634 

52.2 

-82.750 

95.9 

86.5 

4.7 

10.8 

93.2 

94.2 

27.8 

100 

12.5 

9.7 

4.4 

2.2 

3.3 

4.3 

2.1 


^Populations:  Nd  =  55^  °,  5p"  ",  4f' 5d'  ^  6s°  ",  6p0  ".  Atom  charges:  Nd  =  1.84;  O  =  -0.640;  C  -  0.334. 
^  There  are  31  fully  occupied  orbitals  plus  4  unpaired  electrons  in  2  triply  degenerate  orbitals. 


1.7 


11.0 

0.6 


2.4 


CO 


13.5 

95.3 

89.2 
100 

5.1 
5.8 

72.2 

87.4 
87.0 

90.3 

99.4 

95.6 

93.9 

96.9 

96.7 
59.3 
94.0 

97.9 

47.8 

4.1 


sequence  Ln“,  Ln^  Ln^  for  Ln  -  Nd,  Sm.  Mona- 
tonic  trends  between  quadratic  curves  were  as- 
sumed  to  be  transferable  among  like  orbitals. 

Initial  calculations  were  carried  out  on  LnfCO)^ 
for  Ln  =  Nd,  Sm  in  regular  octahedral  geometries, 
for  successive  variations  of  0.03  increments  in  the 
Ln-CO  bond  distance  range  2.30-2.60  A.  Each  of 
these  was  a  completely  separate  calculations  with 
set  atomic  charges.  The  optimized  bond  distances 
was  taken  to  be  that  for  which  a  self-consistency  in 
Lowdin  populations  and  minimum  total  energy 
were  attained.  Convergence  was  impossible  to 
achieve  in  any  other  way,  and  even  with  this 
procedure  it  was  still  a  very  difficult  and  time-con¬ 
suming  problem. 

The  LnfOOg  calculations  were  carried  out  a 
fixed  Ln-O  bond  distances  taken  to  be  some  2-3% 
shorter  than  the  computationally  optimized  Ln-C 
distances.  While  there  are  no  experimental  data 
available  to  confirm  these  per  se,  there  are  some 
data  showing  M— OC  bond  distances  falling  in  a 
range  of  97—98%  of  M— CO  bonds,  for  M  —  U,  Th 
[6,  7].  In  all  pses  the  C-O  bond  distances  were 
fixed  at  1.41  A. 


Energies  of  Ln— CO  and  Ln— OC  bonds  were 
computed  via  a  relationship  successfully  applied 
to  HCo(CO)4  and  a  variety  of  other  molecular 
systems  [8]. 


Results  and  Conclusions 

All  pertinent  MO  parameters  for  these  two  lan¬ 
thanide  hexacarbonyls  in  both  carbonyl  and  ox- 
onyl  bonding  conformations  are  presented  in  Ta¬ 
bles  I-IV.  The  computed  optimized  bond  distances 
and  bond  energies  are  contained  in  Table  V. 

As  noted  in  Table  I,  the  NdfCOlg  highest  occu¬ 
pied  molecular  orbital  (HOMO)  is  a  filled  (2^  level 
lying  1  eV  lower  than  the  fl2«  lowest  unoccupied 
molecular  orbital  (LUMO).  The  HOMO  is  about 
28%  Nd  5d  and  72%  CO  character.  The  LUMO  is 
composed  totally  of  the  ^fxyz  orbital  with  the 
other  4/  levels  only  some  0.4-0.6  eV  above  it. 

The  results  for  NdfOOg  in  Table  II  shows  a 
quite  different  situation.  The  six  Nd  valence  elec¬ 
trons  are  distributed  over  a  filled  ^21/  orbital,  plus 
three  electrons  in  the  (2^,  and  one  in  the  level. 


1675 


INTERNATIONAL  JOURNAL  OF  QUANTUM  CHEMISTRY 


BOUDREAUX 


TABLE  III 


Pertinent  MO  parameters  for  Sm(CO)6.® 


MO  symmetry 
Type 

Elec. 

Occ.^ 

E 

(eV) 

Percent  AO  character  (>  0.5) 

5s 

5p 

4f 

5d  6s 

6p 

CO 

Uu 

0 

-9.186 

100 

Uu 

0 

-9.666 

29.1 

3.8 

67.1 

^2u 

0 

-9.879 

8.4 

91.6 

Uu 

0 

-0.998 

97.3 

0.9 

1.8 

^2u 

0 

-10.005 

91.6 

8.4 

^2u 

2 

-10.994 

100 

^29 

6 

-11.962 

25.8 

74.2 

6 

-12.294 

1.9 

0.6 

16.0 

75.4 

2 

-13.386 

0.7 

16.0 

83.3 

4 

-13.974 

15.8 

84.2 

Uu 

6 

-14.951 

1.3 

98.7 

^2g 

6 

-15.307 

3.6 

96.4 

hu 

6 

-24.558 

7.7 

7.0 

85.3 

®ig 

2 

-27.939 

2.1 

16.9 

81.0 

®9 

4 

-29.080 

9.3 

90.7 

6 

-43.151 

11.9 

2.8 

85.3 

2 

-45.088 

0.8 

4.7 

94.5 

®9 

4 

-45.888 

1.1 

98.9 

bo 

6 

-55.115 

70.8 

29.2 

2 

-92.128 

89.4 

10.6 

^  Populations:  Sm 

=  5s’ 

,  5cf2-3^  03 

0.8  6p2,0 

Atom  charges:  Sm  = 

-0.204;  C  =  0.256;  0  =  - 

0.222. 

There  are  a  total  of  34  fully  occupied  orbitals. 

The  interval  spacings  of  these  levels  are  some  0.6 
eV  between  and  and  0.1  eV  between 
and  f2i,-  There  are  also  some  vacant  orbitals  lying 
only  0.06-0.12  eV  above  the  occupied  ones.  Thus 
the  open-shell  quintet  ground  state  is  amenable  to 
extensive  delocalization  of  electron  density  over 
both  metal  and  ligands. 

SmCCOg  is  shown  in  Table  III  to  have  a  closed- 
shell  <72h  ground  state  totally  4/^.^,  in  character 
and  about  1  eV  below  the  fj,,  LUMO  also  of 
predominately  4/  character.  However,  there  are 
two  other  vacant  levels  within  the  very  close  prox¬ 
imity  of  0.1 -0.3  eV  with  predominately  CO  charac¬ 
ter. 

Finally,  SmfOC),,  has  a  spin  nonet  ground  state 
with  eight  impaired  electrons  distributed  over  a 
manifold  of  levels  spanning  a  total  of  only  0.5  eV 
from  lowest  to  highest  occupied  (see  Table  IV). 
These  levels  consist  of  an  admixture  of  Sm  4/ 
(with  some  5d)  and  CO  character.  The  LUMO  also 
lies  in  the  same  energy  range  but  is  of  totally  CO 
character. 

Notably  in  the  case  of  Nd(CO)6  there  is  a  mod¬ 
erate  net  negative  charge  on  Nd,  which  is  about 
half  that  magnitude  on  Sm  in  SmfCO)^.  This  is  in 


keeping  with  the  evidence  indicating  that  the  elec¬ 
tron  affinity  of  Nd,  though  a  small  positive  value, 
should  certainly  be  higher  than  that  of  Sm  [10-13]. 
However,  in  both  Nd(OC)  and  Sm(OC)6  the  net 
metal  charges  are  both  positive,  with  that  of  Nd 
3.6  times  higher  than  the  Sm  charge.  This  is  in 
agreement  with  the  evidence  that  M-OC  bonds 
are  more  typically  ionic  than  M-CO  bonds  [6,  7]. 

As  shown  in  Table  V,  the  calculated  bond  ener¬ 
gies  are  found  to  be  quite  small  for  both  the 
Ln(CO)^  and  Ln(OC)g,  but  those  for  which  Ln  = 
Nd  are  a  little  higher  than  those  of  Sm.  However, 
none  of  these  are  high  enough  to  suggest  any 
meaningful  bond  stability. 

Apparently  the  low  bond  energies  and  close 
proximity  of  HOMO  and  LUMO  levels  provides  a 
reasonable  explanation  for  the  instability  of  these 
molecules,  and  hence  the  reason  for  their  fleeting 
existence  in  low-temperature  noble-gas  matrices. 
This  study  also  shows  that  Ln(OC)(;  oxonyl  bond¬ 
ing  mode  is  less  stable  than  the  Ln(CO)6  carbonyl 
bonding. 

Further  work  is  currently  underway  to  complete 
an  ab  initio  DFT  calculation  on  Nd(CO)g  and  to 
continue  SC-MEH-MO  calculations  on  the  remain- 
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TABLE  IV 


I  1  »  _ _ _ _ _ _ 

Pertinent  MO  parameters  for  Sm(OC)6.® 


MO  symmetry 
Type 


Elec. 

Occ.^ 


E 

(eV) 


Percent  AO  character  (>  0.5) 


5s 


5p 


5d 


6s 


6p 


CO 


Uu 

Uu 

t.. 


2u 


^2u 

Uu 

^2g 

Uu 

a 


ig 


Uu 

a 


ig 


0 

0 

0 

0 

1 

3 

3 
1 
6 
2 

4 
6 
4 
6 
2 
4 
6 
2 


-5.745 

-7.908 

-8.368 

-8.379 

-8.394 

-8.482 

-8.647 

-8.925 

-12.079 

-12.295 

-13.128 

-15.201 

-28.844 

-41.107 

-45.405 

-46.573 

-54.401 

-89.869 


99.5 


1.1 

98.9 


100 


1.5 


96.0 


1.5 


27.5 


63.8 


84.0 


11.1 


10.6 

4.7 

3.4 


2.1 


13.4 


4.4 


12.5 


2.9 


^  op",  6s°  6p^  Atom  charges:  Sm  =  0.510;  O  =  -0.529;  C  -  0.444. 

total  of  30  fully  occupied  orbitals  plus  4  singularly  occupied  orbitals  (2  triply  degenerate),  providing 


16.0 

0.5 

99.7 

100 

98.9 
1.1 

88.9 

86.0 

86.6 

89.4 

95.3 

96.6 

69.6 

94.1 

97.9 

36.2 
4.0 


^Populations:  Sm  =  5s^°,  5p^® 
^  There  are  a 
electrons. 


8  unpaired 


TABLE  V 


Comparative  bond  energies  and  bond  lengths. 


Molecule 

R(M-C) 

(A) 

R(M-O) 

(A) 

Average 
bond  energy 
(kJ  mor^) 

NcKCOle 

2.40^ 

eU 

NdlOOe 

2.34® 

56® 

Sm(CO)6 

2.37^ 

49® 

Sm(OC)6 

2.31® 

48® 

Cr(CO)6 

1.91 

108^ 

Mo(CO)6 

2.06 

152^ 

W(C0)6 

2.06 

179^ 

U(CO)6 

2.21 

®This  work;  optimized  bond  lengths. 

^  From  J.  A.  Connor,  Topics  Current  Chem.  71,  71  (1977). 


ing  LnCCO)^  molecules  for  Ln  =  Ce,  Pr,  Eu,  Gd, 
Tb,  Dy,  Ho,  Er,  Tm,  Yb,  and  Lu. 
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ABSTRACT _ _ _ _ _ _ 

Despite  their  importance  in  the  characterization  of  molecular  magnetism,  electronic 
^-tensors  have  received  rather  little  theoretical  attention.  In  this  work,  however,  the 
ground-state  g-tensor  of  MgF  is  computed  at  both  the  ROHF  and  multireference  Cl 
levels.  The  calculations  are  expanded  complete  to  second  order  in  appropriate  Breit-Pauli 
terms  and  the  contributions  of  all  important  magnetically  coupled  excited  states  are 
accounted  for.  Both  the  Cl-  and  ROHF-level  A  g,, -values  (-59  and  -55  ppm,  respectively) 
are  in  agreement  with  experiment  (  —  319  +  500  ppm)  within  the  range  of  experimental 
uncertainty.  For  Agj^  ,  however,  the  ROHF  treatment  yields  a  value  (  —  659  ppm)  in 
substantial  disagreement  with  experiment  (  - 1319  +  500  ppm).  Fortimately,  this 
discrepancy  is  alleviated  by  Cl  level  treatment  (Ag  ^  =  - 1447  ppm).  ©  1996  fohn  Wiley 
&  Sons,  Inc. 


Introduction 

When  molecular  radicals  are  subjected  to  an 
external  magnetic  field  B,  their  levels 
become  energetically  nondegenerate.  This  ener¬ 
getic  splitting,  called  the  electronic  Zeeman  effect, 
is  described  by  the  relation 

\E  =  hv  =  —fji-B,  (1) 

where  (i.  is  the  magnetic  dipole  moment  of  the 
radical. 


Due  to  molecular  anisotropy  |x  is  often  direc¬ 
tionally  dependent.  This  dependence  can  be  para¬ 
metrized  as  follows: 

H  =  -figS-g,  (2) 

where  is  a  second  rank  tensor  called  the  elec¬ 
tronic  ^-tensor;  fig,  the  Bohr  magneton;  and  S,  the 
molecule's  spin  angular  momentum.  Substituting 
Eq.  (1)  in  (2),  we  get  the  following  generalized 
expression  for  the  Zeeman  effect: 

AE  =  tigS-g-B.  (3) 
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Their  relation  to  magnetic  dipole  moments  makes 
g-tensors  important  as  molecular  parameters.  Be¬ 
cause  of  their  computational  complexity,  however, 
very  few  ab  initio  studies  have  been  done  on  these 
quantities. 

In  recent  work  [1-3],  however,  we  performed 
several  ROHF-level  calculations  of  electronic  g- 
tensors.  These  studies  include  the  first-ever  ab 
initio  treatments  complete  to  second  order  in  Ray- 
leigh-Schrodinger  perturbation  theory  [2,  3]  and 
also  provide  basis-set  and  gauge-dependence  anal¬ 
ysis  [1,  2]. 

These  ROHF-level  calculations  were  able,  for 
the  first  time,  to  reproduce  general  experimental 
trends  with  reasonable  consistency.  For  many  of 
the  systems  examined,  however,  sizable  discrepan¬ 
cies  were  observed  50%)  between  experiment 
and  theory. 

Except  for  small  radicals  with  relatively  few 
electrons  [3],  the  most  dominant  contribution  to 
the  ^-tensor  usually  comes  from  a  second-order 
(sum-over-states)  expansion  wherein  the  ground 
state  magnetically  couples  with  the  excited-state 
manifold.  It  is  expected  that  electron  correlation 
should  play  a  major  role  in  the  treatment  of  such 
terms. 

In  this  work,  therefore,  we  compared  and  con¬ 
trasted  ROHF-  and  multireference  Cl-level  g- 
tensor  calculations  for  the 
ground  state  of  MgF. 


Theory 

The  g-tensor  expansion,  complete  to  second  or¬ 
der  in  appropriate  Breit-Pauli  operators,  can  be 
written  as  follows: 

+ (4) 

where  g^  =  2.002319304386  [4]  is  the  free-electron 
g-factor,  ^7,  e  { X,  y,  z)  are  Cartesian  coordinates, 
and 

=  ^gRMc-sz8‘“’  +  Ag^lsz^e) 

+  Ag^cc-szi^e)  +  (5) 

is  the  g-shift.  Expressions  for  all  the  above  contri¬ 
butions  are  given  elsewhere  in  regular  [5]  and 
spin-field  reduced  forms  [1]. 

Most  of  the  contributions  to  Eqs.  (4)  and  (5)  are 
obtained  as  expectation  values  of  different 


Breit-Pauli  operators.  The  constant  e.g.,  is  ob¬ 
tained  as  the  expectation  value  of  the  spin-Zee- 
man  operator.  ^g^Mc-sz  obtained  similarly  as  a 
relativistic  mass  correction  to  the  spin-Zeeman 
operator,  and  ^gGc-sz(^e)  and  AgGc-sz(2^)  are 
one-  and  two-electron  gauge  corrections. 

By  contrast,  Ag2o  comes  from  a  second-order 
sum-over-states  expansion: 

7-1  , 


which  describes  the  total  magnetic  coupling  be¬ 
tween  the  ground  and  excited  states  [5],  HjoCle) 
and  HgQi2e)  are  the  one-  and  two-electron 
spin-orbit  operators,  is  the  orbital-Zeeman 
operator  and  E„  -  Eq  represents  the  vertical  exci¬ 
tation  energy  of  the  excited  state  More  details 
of  this  are  provided  elsewhere  [1,  5]. 

Computing  Ag^^  at  the  Hartree-Fock  level  gen¬ 
erally  requires  that  the  spin-orbit  and  orbital-Zee¬ 
man  matrix  elements  and  the  vertical  excitation 
energies  be  estimated  in  terms  of  the  ground-state 
molecular  orbitals.  The  resulting  formula,  given  in 
previous  work  [1,  2],  is  computationally  simple 
but  yields  rather  approximate  results. 

Using  correlated  methods  such  as  Cl,  it  is  possi¬ 
ble  to  resolve  the  matrix  elements  and  vertical 
excitation  energies  with  much  improved  accuracy 
[6].  Such  a  treatment  should  thus  yield  more  reli¬ 
able  results  for  Agjo- 

Cl  treatment  of  the  sum-over-states  expansion 
is,  however,  much  more  computationally  demand¬ 
ing.  Fortunately,  only  states  of  certain  symmetry 
species  may  contribute.  In  the  case  of  MgF 
ground  state),  only  IT  states  may  contribute  to 
^g  1.  (=  Ag'^X  Ag^J'),  while  only  X^  states  are 
relevant  for  Ag,,  (=  Ag"U.  As  well,  since  Hqz  is 
not  spin-dependent,  the  AS  =  0  selection  rule 
holds  firmly  for  g-tensor  calculations.  Finally,  it 
should  be  mentioned  that  although  the  X’^'^  X" 
(Ag||)  coupling  is  symmetry  allowed,  it  is  expected 
to  be  very  weak  [7].  While  ROHF-level  treatment 
gives  a  strictly  zero  second-order  contribution  to 
Ag||  [8],  Cl  calculations  allow  for  (extremely  small) 
nonzero  shifts  due  to  contributions  from  three 
open-shell  tt  tt*  states. 

Cl  methods  have  the  disadvantage  that  the 
sum-over-states  expansion  must  be  truncated  at 
some  point  and  that  the  contribution  of  discarded 
terms  is  difficult  to  estimate.  In  this  respect,  meth¬ 
ods  which  implicitly  include  all  excited-state  con- 
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tributions,  such  as  propagator  techniques,  would 
be  preferable.  As  yet,  however,  no  methodology 
suitable  for  ab  initio  polarization  propagator  g- 
tensor  calculations  has  been  developed. 


Computation 

Both  the  ROHF  and  Cl  wave  functions  for  MgF 
were  generated  using  the  MRD-CI  package  [9], 
ROHF-level  ^-tensor  calculations  were  performed 
with  the  GSTEPS  suite  of  programs  [1,  2,  10]  (in-' 
corporating  external  integral  routines  [11,  12]), 
while  Cl  level  treatment  was  effected  through  a 
combination  of  the  MRD-CI  (with  accompanying 
spin/orbit  programs)  and  GSTEPS  packages. 

In  this  work,  A^gc-sz^c)  is  only  computed  at 
the  Hartree-Fock  level,  due  to  inefficiency  in  our 
present  code.  Therefore,  our  best  g-tensor  results 
correspond  to  the  sum  of  ^gRMC-sz 
+  Ag2o  computed  at  the  Cl  level  plus  the  ROHF 
value  for  Ag^c-sz^^^X  As  Ag^c-sz^^^)  makes  only 
a  small  contribution  to  the  g-tensors  of  MgF  (see 
Results),  the  corresponding  correlation  error  is  ex¬ 
pected  to  be  minimal. 

Our  calculations  use  the  experimentally  deter¬ 
mined  Mg — F  bond  distance  (RMgF  "  bohr) 
[13].  As  per  convention,  the  molecule  was  speci¬ 
fied  to  lie  in  the  z-axis.  The  gauge  origin  was 
chosen  to  reside  at  the  electronic  charge  centroid 
(see  [2,  14])  as  computed  at  the  Hartree-Fock 
level.  An  estimate  of  the  gauge  dependence  was 
obtained  by  computing  the  g-tensor  with  a  gauge 
centered  on  the  Mg  nucleus. 

The  basis  sets  chosen  are  triple-zeta  polarized 
Cartesian  sets  of  Sadlej  [15,  16]  [(13sl0p4(i)  -> 
[7s5p4:d]  for  Mg;  (lOsSpAd)  [5s3p4d]  ifor  F;  d 
primitives  left  uncontracted]. 

While  the  basis-set  criteria  for  g-tensor  calcula¬ 
tions  have  not  been  fully  established,  the  Sadlej 
basis  set  is  designed  to  treat  both  electric  dipoles 
and  polarizabilities  at  correlated  levels  [15]  and 
may  thus  provide  the  flexibility  necessary  for  com¬ 
puting  our  complete-to-second-order  expansion 
[Eqs.  (5)  and  (6)],  It  is  therefore  seen  as  a  good 
starting  point  for  these  calculations. 

With  four  polarization  primitives  (of  which  two 
are  diffuse)  and  a  reasonable  valence  expansion, 
one  would  expect  a  decent  description  of  both  the 
ground  and  (valence)  excited  states.  As  well,  the 
range  of  primitives  from  very  compact  to  fairly 


diffuse  is  appropriate  for  computing  spin-depen¬ 
dent  properties  [17]. 

The  most  diffuse  s  and  p  functions  on  Mg  are 
essentially  equivalent  to  Rydberg  functions.  To 
ensure  the  resolution  of  lower-lying  Rydberg  states, 
however,  additional  3  s  and  3p  Rydberg  functions 
(a35  =  0.036,  =  0.029)  of  Dunning  and  Hay 

[18]  are  included  in  the  fluorine  basis  set.  Our  total 
description,  therefore,  amounts  to  88  functions. 

Details  of  the  Cl  wave  functions  for  the  relevant 
magnetically  coupled  excited  states  are  presented 
in  Table  I.  The  lowest  three  core  orbitals  (Is^  on  F 
and  ls^2s^  on  Mg)  are  frozen  and  the  highest 
three  virtual  orbitals  are  discarded.  While  the  gen¬ 
erated  Cl  expansions  range  in  size  up  to  3.0  mil¬ 
lion  symmetry-adapted  functions  (SAFs),  the  selec¬ 
tion  thresholds  (0-2  juHartree)  are  set  to  retain 
only  the  most  important  25,000  to  32,000  configu¬ 
rations.  The  values  (sum  of  squares  of  the 
expansions  coefficients  of  the  reference  configura¬ 
tions  in  the  Cl  expansion)  mostly  range  from  0.94 
to  0.96,  indicating  that  the  states  are  consistently 
well  described  by  the  reference  configurations  [20]. 
The  state  energies  reported  correspond  to  approxi¬ 
mate  full  Cl  energies  as  obtained  by  the  general¬ 
ized  version  [21]  of  Davidson's  extrapolation  tech¬ 
nique  [22]. 

In  computing  A ^20  level,  all  valence 

states  (and  some  low-lying  Rydbergs)  are  treated 
in  energetically  increasing  order  until  three  consec¬ 
utive  states  with  minimal  importance  (\^g2o^%)\ 
<  1%  of  Ag"^^)  are  encountered.  At  this  point,  all 
further  contributions  are  assumed  to  be  insignifi¬ 
cant  and  the  expansion  is  truncated.  Only  those 
states  making  appreciable  (>  1%)  contributions 
are  retained  in  Ag2o’  Th^  discarded  states,  while 
not  useful  in  computing  Ag2o/  provide  a  rough 
approximation  to  the  error  range  of  Ag2o  fol¬ 
lows: 

A(Ag,%^)-2x  E  (7) 

discarded 

test 

states 

One  should  note  two  things  about  this  formal¬ 
ism:  (1)  it  tends  to  ignore  most  of  the  higher 
Rydberg  states,  and  (2)  the  expansion  may  proceed 
beyond  the  molecule's  ionization  limit  into  the 
range  of  high-energy  valence  states.  Both  these 
conditions  are  reasonable  since  (1)  the  higher  Ryd¬ 
berg  states  are  generally  much  too  diffuse  to  cou¬ 
ple  effectively  with  the  ground  state  and  (2)  the 
high-energy  valence  states,  although  physically 
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TABLE  I 


Cl  wave  function  data  for  relevant  states  of  MgF. 

State 

Leading 

Configuration 

E  ^ 

^thr 

SAFs“ 

Vertical  excitation  energies  (eV) 

This  work  Theoretical^  Exp.® 

6o'^27r^7cr^ 

0.947 

0.2 

27925 

— 

—  — 

277  ^  377 

0.913 

2.0 

25630 

8.7231 

22X- 

277  ^  377 

0.919 

2.0 

25630 

10.1969 

277  ^  477 

0.968 

1.5 

27268 

20.0115 

i^n 

CO 

T 

0.944 

0.6 

31160 

3.4470 

3.3  3.45 

2^n 

7(7  ^  477 

0.943 

0.6 

31287 

6.1359 

6.2  6.73 

3^n 

7(7  ^  577 

0.942 

0.3 

29125 

7.0223 

4^n 

7(7  ^  077 

0.948 

0.6 

27592 

8.5875 

5^n 

CO 

T 

b 

CD 

0.948 

1.2 

25637 

8.7797 

6^n 

277  ^  7(7 

0.904 

1.0 

27684 

10.9284 

7^n 

6(7  ^  377 

0.999 

1.0 

26649 

19.7708 

8^n 

7(7  ^  777 

0.990 

0.6 

29601 

20.2130 

6(7  577 

0.998 

1.0 

26142 

20.6257 

^  Squares  of  Cl  expansion  coefficients  summed  over  reference  configurations. 

Energy  threshold  (in  jiiHartree)  for  retention  of  configurations  in  the  Cl  expansion. 

No.  of  symmetry-adapted  functions  (i.e.,  configurations)  included  in  the  selected  Cl  space. 
^  Obtained  from  [19]. 

®  Obtained  from  [13]. 


unstable,  may  be  computationally  significant  and 
have  been  found  in  some  cases  [10]  to  yield  appre¬ 
ciable  contributions  to  the  ^-tensor. 


Results  and  Discussion 

Table  II  provides  details  of  states  whose  mag¬ 
netic  coupling  contributes  to  A^2o-  From  this,  one 
notices  two  things:  (1)  As  expected  [7,  8],  there  do 
not  appear  to  be  any  significant  second-order  con¬ 
tributions  to  A^||,  and  (2)  (A^2o)x  is  apparently 
dominated  by  the  first  ^  IT  state. 

The  l^n  coupling  is  especially  favor¬ 

able  for  two  reasons:  The  state  is  low  in  energy 
and  its  singly  occupied  molecular  orbital  (SOMO), 
377,  is  located  primarily  on  the  Mg  atom  and  thus 
can  interact  strongly  with  the  ground-state  SOMO 
(also  on  Mg).  No  other  states  possess  both  such 
qualities;  thus,  (Ag2o)±  is  effectively  a  one-state 
expansion. 

Table  III  gives  the  breakdown  of  different  con¬ 
tributions  to  the  ROHF  and  Cl  level  ^-tensors.  In 
both  cases,  second-order  terms  dominate  Ag^, 
while  first-order  contributions  determine  A^j|. 


TABLE  II _ 

Magnetic  coupling  data  for  excited  states 
contributing  to  Ag2o  of  MgF;  spin-orbit  and 
orbital-Zeeman  matrix  elements,  vertical  excitation 
energies,  and  Aggo  contribution  given.® 


(Hso^ 
X10-''  (eV) 

XIO-'’  (eV) 

AE 

(eV) 

(ppm) 

Ag,, 

I^S" 

-13.4183 

-2X10“^ 

8.7231 

0 

22X- 

-20.5968 

-1  X  10"® 

10.1969 

0 

32X- 

1.1201 

4  X  10“® 

20.0115 

0 

i^n 

-16.2122 

0.9053 

3.4470 

-1471 

2^U 

-7.9868 

0.1918 

6.1359 

-86 

3^n 

-3.1030 

-0.1603 

7.0223 

14 

4^n 

11.9253 

-0.0423 

8.5875 

-20 

1 .2059 

0.2568 

8.7797 

12 

e^n 

16.4946 

0.2390 

10.9284 

125 

-16.3599 

0.0019 

19.7708 

1 

1 .0662 

0.0617 

20.2130 

1 

g^n 

0.0786 

0.0689 

20.6257 

0 

®  Coupling  is  with  respect  to  the  ground  state. 
^Assuming  a  field  strength  of  1  Tesla. 
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TABLE  III _ _ 

ROHF  and  Cl  level  g-shifts  for  MgF  showing 
contributions  from  different  terms 
(all  values  In  ppm). 


ROHF 

Cl 

^9rmc-sz^ 

-63 

-68 

APx 

144 

150 

^9|| 

90 

90 

^9gc-sz 

Ag^ 

-103 

-103 

-81 

-81 

^92o" 

APi 

-637 

-1426 

±4 

Ag,! 

0 

0 

±0 

Total 

Agx 

-659 

-1447 

Ag,, 

-55 

-59 

Exp.'^ 

Ag^ 

-1319 

±500 

Ag,! 

-  319 

±500 

®  Reiativistic  mass  correction  is  isotropic. 

^Values  for  Aggc-szt^e)  are  computed  at  the  ROHF  level. 
""  States  contributing  only  minimally  to  Ag2o  are  omitted 
from  the  Cl  sum-over-states  expansion.  These  states  are 
used,  however,  in  estimating  the  uncertainty  for  Agfgo* 
"[23]. 


First-order  teriirs  are  well  described  at  the  ROHF 
level;  thus,  there  is  little  difference  between  the 
ROHF  and  Cl  g||-shifts.  Correlation  plays  a  very 
important  role  for  ,  however,  in  particular 
because  of  the  failure  of  the  ROHF  treatment  to 
adequately  describe  the  1^11  interaction. 

The  matrix  elements  for  the  1^0 

spin-orbit  and  orbital-Zeeman  interactions  are 
both  substantially  underestimated  (  —  6.70  X  10”^ 
and  0.69  X  10“'*  eV,  respectively)  at  the  ROHF 
level  compared  to  the  corresponding  Cl  values 
(-16.21  X  10"^  and  0.91  X  10“^  eV).  Addition¬ 
ally,  the  excitation  energy  determined  by  the  ROHF 
treatment  (5.22  eV)  is  more  than  1.7  eV  too  large 
(Cl:  3.45  eV).  From  all  three  factors,  therefore,  one 
sees  why  Ag  is  drastically  underestimated  (  —  659 
ppm)  at  the  ROHF  level.  The  Cl  value  (-1447 
ppm),  although  possibly  larger  than  the  experi¬ 
mental  value  ( — 1319  ±  500  ppm,  Ne/ Ar  matrix) 
is  in  much  better  agreement. 

With  regard  to  the  gy  shift,  very  little  difference 
is  observed  when  comparing  Cl  (-59  ppm)  and 
ROHF-level  (  —  54  ppm)  values.  Both  are  well 
within  the  range  of  experimental  error  (-319  + 
500  ppm),  although  it  is  possible  that  they  under¬ 


estimate  the  magnitude  of  the  negative  shift.  This 
would  be  consistent  with  other  matrix  isolation 
data  on  diatomic  species.  In  the  extreme  case  of 
CO^,  e.g.,  the  experimental  Ag||  value  is  —1419  + 
500  ppm  [24],  while  the  best  theoretical  estimate  is 
=  -178  ppm  [10]).  The  relatively  large  experi¬ 
mental  |Ag|||  values  may  indicate  matrix  interac¬ 
tions  which  reduce  the  molecule's  effective  sym¬ 
metry,  thereby  allowing  spin-orbit  coupling  to 
occur  along  the  bond  axis  [7].  It  may  therefore  be 
somewhat  inappropriate  to  compare  our  A^n  val¬ 
ues  with  matrix  isolation  data. 

Finally,  while  most  aspects  of  the  Cl  g-tensor 
treatment  have  yielded  very  encouraging  results 
for  MgF,  we  found  the  Cl-level  results  to  have  a 
surprisingly  large  gauge  dependence.  The  Agj^ 
value  computed  with  the  origin  on  the  Mg  atom 
(i.e.,  shifted  by  1.34  bohr  from  the  ECC)  was  sub¬ 
stantially  smaller  (  —  1187  ppm),  leading  to  a  CI- 
level  gauge  dependence  of  260  ppm/bohr  (18.0% 
the  size  of  Ag  j^).  This  is  the  in  stark  contrast  to  the 
Cl-level  results  that  we  have  obtained  for  other 
molecules  [10].  For  example,  the  Cl-level  gauge 
dependence  of  CO’^  has  a  value  only  0.3%  the  size 
of  Ag^  [10]. 

The  ROHF-level  gauge  dependence  for  MgF  is 
found  to  be  only  8  ppm/bohr  (1.2%  of  Ag  j^).  The 
discrepancy  between  this  value  and  the  Cl-level 
gauge  dependence  is  difficult  to  explain.  While  the 
larger  Cl  value  may  be  related  to  the  trimcation  in 
the  sum-over-states  expansion,  this  is  unlikely  due 
to  the  rapid  decline  in  importance  of  higher  contri¬ 
butions  (see  Table  II).  Further  study  of  this  phe¬ 
nomenon  may  be  in  order. 

In  light  of  these  gauge-dependence  findings,  we 
do  stress  that  all  our  calculations  are  conducted 
with  the  center  at  the  molecule's  electronic  charge 
centroid.  This  provides  a  nonarbitrary  choice  of 
gauge,  which,  in  theory  [14],  should  provide  re¬ 
sults  in  consistent  proximity  to  the  correct  values. 


Summary  and  Conclusions 

Although  our  ROHF  g-tensor  calculations  [1-3] 
are  adequate  to  reproduce  qualitative  trends  in 
experimental  data,  we  foimd  in  this  article  that  the 
extension  of  the  method  to  correlated  levels  sub¬ 
stantially  improves  agreement  with  experiment.  In 
particular,  the  Cl  level  Ag^  value  for  MgF  differs 
by  only  10%  from  the  experimental  value  (well 
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within  the  error  margin),  while  the  discrepancy 
was  50%  at  the  ROHF  level.  While  first-order 
terms  appear  to  be  well  described  at  the  ROHF 
level,  a  reliable  treatment  of  excitation  energies 
and  spin-orbit  coupling  requires  correlation. 

While  this  treatment  is  the  most  advanced  yet 
applied  to  the  g-tensor  problem,  we  admit  to  some 
technical  difficulties:  First,  we  have  not  yet  de¬ 
vised  an  efficient  method  for  obtaining  values  for 
the  term  at  the  Cl  level.  Neglect  of 

correlation  in  this  term,  however,  is  expected  to 
introduce  only  minor  errors,  likely  on  the  order  of 
20  ppm  or  less  for  most  systems. 

Second,  our  approach  to  computing  Ag2o  the 
Cl  level  introduces  a  degree  of  uncertainty  because 
of  the  truncation  of  the  sum-over-states  expansion. 
This  is  of  little  worry  in  the  case  of  MgF,  where 
A^^2o  is  dominated  by  just  one  state  (1^11),  while 
other  states  contribute  only  3%  to  A^^  .  One  can 
envision  systems,  however,  in  which  substantial 
difficulties  would  be  encountered.  A  more  effec¬ 
tive  way  of  treating  such  a  perturbation  expansion 
would  be  through  the  use  of  a  spin-orbit-per- 
turbed  Cl  formalism,  such  as  that  proposed  by 
Hess  [25].  Unfortunately,  no  such  program  is  yet 
available. 

Nonetheless,  MgF  (and  other  such  simple  radi¬ 
cals)  provide  excellent  examples  of  systems  for 
which  our  Cl-level  ^-tensor  treatment  is  particu¬ 
larly  successful.  It  is  not  certain  whether  such 
accurate  calculations  will  be  possible  for  all  molec¬ 
ular  radicals;  however,  this  work  represents  an 
important  step  in  the  development  of  effective  ab 
initio  electronic  g-tensor  computation. 
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ABSTRACT _ _ _ _ _ 

In  intense  laser  fields,  atoms  and  molecules  can  absorb  many  more  photons  than  required 
for  ionizahon.  This  phenomenon  is  called  above-threshold  ionization  (ATI)  and  it  shows 
up  in  kinetic  energy  spectra  of  ionized  electrons.  The  presence  of  very  high  energy 
photoelectrons  necessitates  the  use  of  extremely  large  grids  in  numerical  simulations 
based  on  the  full  time-dependent  Schrodinger  equation.  We  show  that  by  using  a  wave 
function  splitting  scheme  one  can  circumvent  the  problem  of  large  grids  and  thus  obtain 
accurate  multiphoton  photoelectron  spectra.  This  scheme  will  be  very  useful  in  studies 
of  ATI  spectra  generated  by  molecules  in  intense  laser  fields.  We  illustrate  this  method 
for  a  one-dimensional  model  of  the  H  atom  in  intense  laser  fields.  ©  1996  John  Wiley  & 
Sons,  Inc. 


Introduction 

The  interaction  of  intense  laser  pulses  (inten¬ 
sity  I  >  10^^  W/cm^)  with  atoms  and  mole¬ 
cules  leads  to  many  interesting  multiphoton,  non- 
perturbative  phenomena  such  as  high-order  har¬ 
monic  generation  and  above-threshold  ionization 
(ATI).  This  has  been  an  area  of  active  research  in 
the  past  decade  [1-4].  One  of  the  theoretical  ap¬ 
proaches  to  deal  with  this  nonperturbative  prob¬ 
lem  relies  on  exact  numerical  solutions  of  the 
time-dependent  Schrbdinger  equation  (TDSE)  on 


recently  available  powerful  computers.  So  far,  this 
approach  has  been  successfully  applied  to  one- 
electron  atoms  and  molecules  (or  two  1-D  elec¬ 
trons).  Typically,  in  such  numerical  simulations, 
the  initial  wave  function  of  the  electron  is  well 
localized  in  space  and  later  becomes  spread  over  a 
huge  distance.  For  a  short,  intense,  40  fs  laser 
pulse,  the  front  edge  of  the  wave  packet  of  the 
ionizing  electron  can  be  as  far  as  4000  bohr  from 
the  nucleus  (for  J  =  2  X  10^^  and  A  =  630  nm)  and 
continues  to  expand  for  longer  times.  Thus,  to 
avoid  reflections  from  the  boundaries,  pro¬ 
hibitively  large  boxes  (grids  with  32,000  points  or 
more),  in  which  the  electron  wave  packet  is  fully 
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contained,  should  be  used.  The  most  common  so¬ 
lution  to  this  difficulty  is  to  use  much  smaller 
boxes  with  absorbing  boundary  conditions.  This 
approach  allows  one  to  calculate  the  ionization 
rates  and  harmonic  generation  spectra  but  it  does 
not  allow  one  to  calculate  the  photoelectron  spec¬ 
tra,  since  the  external  part  of  the  wave  function  is 
thrown  away.  Also,  in  our  recent  numerical  stud¬ 
ies  of  competition  between  ionization  and  dissocia¬ 
tion  [5],  the  use  of  an  absorber  for  the  electron  flux 
led  to  a  considerable  loss  of  information  about  the 
nuclear  movement.  In  this  article,  we  develop  a 
technique  which  allows  one  to  reconstruct  the  ab¬ 
sorbed  electron  wave  packet  in  momentum  space 
and  propagate  it  in  time,  analytically  to  any  time 
after  the  pulse  turn-off.  Thus,  with  the  help  of  this 
technique,  one  can  calculate  efficiently  the  photo¬ 
electron  spectra  after  the  turn-off  of  the  pulse.  A 
similar  technique  has  been  used  earlier  [6],  for 
calculating  kinetic  energy  spectra  of  photodissocia¬ 
tion  products,  with  the  laser-molecule  coupling 
neglected  in  the  asymptotic  region.  Recently  [7], 
this  technique  was  generalized  for  non  decaying 
(for  larger  distances)  dipole  couplings  (for  dissoci¬ 
ating  molecules)  and  for  the  calculation  of  the  ATI 
spectra  from  the  TDSE  in  the  Kramers -Henne- 
berger  (K— H)  representation  [8]  for  high-frequency 
laser  fields.  In  these  cases,  the  laser-induced  quiver 
(ponderomotive)  motion  is  small.  In  this  article, 
we  use  the  electric  field  representation  (gauge)  for 
near-infrared- visible  frequencies,  when  quiver  mo¬ 
tions  are  large.  In  [8],  the  projections  on  exact 
eigenstates  of  hydrogen  were  performed.  TIere,  we 
do  not  use  the  (K-TI)  frame  and  show  that  correct 
spectra  can  be  obtained  by  projecting  on  plane 
waves  in  the  asymptotic  region  instead  of  project¬ 
ing  on  hydrogen  asymptotic  outgoing  Coulomb 
waves.  This  technique  will  allow  us  to  develop  a 
rigorous  theory  of  multiphoton  electron  spec¬ 
troscopy  based  on  ATI  in  molecules  with  the  aim 
of  developing  a  new  tool  for  studying 
electron-molecule  dynamics. 


Description  of  the  Wave-Function 
Splitting  Algorithm 

The  dynamics  of  a  hydrogen  atom  interacting 
with  an  intense,  linearly  polarized  laser  field  is 
frequently  described  by  the  1-D  Schrodinger  equa¬ 


tion  [2]: 


ih 


dipiz,  t) 
dt 


r  +  ZE{t)  \l^{z,t), 

VI 


(1) 


where  E(0  =  and  is  the 

maximum  value  of  the  electric  field.  This  equation 
is  usually  discretized  and  solved  numerically  for 
\z\  <  z^.  For  our  test  calculations,  we  chose  the 
laser  intensity  I  =  22  X  10^^  W/cm^  A  =  630  nm, 
and  ==  20  cycles  (1  cycle  =  2.17  fs)  in  order  to 
compare  with  the  results  from  [9],  The  photoelec¬ 
tron  spectra  from  this  reference  show  a  plateau 
extending  up  to  SU^  =  70  eV  with  a  slowly  decay¬ 
ing  tail  up  to  130  eV  (70  photons  absorbed),  where 
Up  =  [eE^/(2mci))]^  is  the  electron  ponderomotive 
energy.  Thus,  the  fastest  electron  has  a  speed 
=  2.3  au  and  =  43  fs  can  be  as  far  as  4050 
bohr.  Typically,  the  integration  step  in  space  is 
8z  =  0.25  bohr,  which  means  that  the  grids  with 
over  32,000  grid  points  should  be  used  for  a  43  fs 
pulse.  For  n  times  longer  pulses,  n  time  greater 
grids  should  be  used.  This  constitutes  a  serious 
difficulty,  even  for  1-D  problems.  Therefore,  a  fast 
algorithm  for  large  z's  is  very  desirable.  In  partic¬ 
ular,  since  the  Coulomb  forces  are  smooth  and 
negligible  for  large  z's,  larger  interaction  steps  in 
space  5z,  as  well  as  much  larger  steps  in  time,  can 
be  used. 

Let  us  divide  the  total  grid  into  three  overlap¬ 
ping  regions  in  space,  in  which  different  evolution 
algorithms  will  be  used:  the  internal  and  two  ex¬ 
ternal  (asymptotic,  for  positive  and  negative  z's) 
parts  defined  by 


internal:  |z|  <  Zy„  and  external:  z^,^  <  |z|  <  z^, 

(2) 


where  Zy„  —  z^,^  =  Zq  >  0  is  the  size  of  the  over¬ 
lap  of  both  regions.  In  this  overlapping  (or 
"matching")  region,  the  absorbing  potential  is  in¬ 
troduced,  which  we  have  chosen  in  the  following 
form: 


=  iVo 


for  z,,  <  |z|  <  z,-„. 

(3) 


1686 


QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  30 


ABOVE-THRESHOLD  IONIZATION  ELECTRON  SPECTRA 


The  electron  wave  function  is  split  into 

internal  and  external  parts,  by  applying 
this  absorbing  potential  in  the  overlapping  region, 
at  times  t  ^  k8t,  where  St  is  much  larger  than  the 
integration  time  dt  (dt  =  0.03  au,  8t  =  150  dt,  in 
our  simulations)  with  the  help  of  the  following 
formulas: 


(4) 

t)  =/(z,  t), 

(5) 

=  (1  -/(z,  80)iA(z,  0, 

(6) 

where 

/(z,60 

=  expC-SfV^jijCz))  for  z^,^  <  |z| 

(7) 

/(z,  60  =  1  for  |z|<z,^. 

(8) 

and 

fiz,8t)=0  for  |z|  >  z,„. 

(9) 

Our  calculation  scheme  relies  on  the  assumption 
that  in  the  asymptotic  zone  (defined  by  |z|  > 
the  Coulomb  potential  can  be  neglected.  This  al¬ 
lows  us  to  perform  the  time  evolution  exactly  and 
over  an  arbitrary  time  interval  by  calculating  first 
the  wave  function  in  the  velocity  gauge 
with  the  help  of  the  formula 

ip^iz,  t)  =  exp(-zA(C0)z)i/^(z,  t),  (10) 

where 

=  -  pECOdt  (11) 

is  the  electric  field  area  over  the  interval  f2- 
Next,  we  calculate  the  Fourier  transform  <p^(  p,  0 
of  ip.^(z,  0,  which  evolves  in  time  according  to  the 
equation 

Vr  ^2)  =  Vr  ^2)/  (12) 

where 

=  exp[  -  -  +  2A(f ,  0)  p  +  A^(f ,  0))  dtj  . 


In  our  algorithm,  we  perform  the  series  of  follow¬ 
ing  operations  at  each  time  =  k8t: 

(i)  The  exact  time  evolution  of  is  evaluated 
in  the  internal  zone  (|z|<z^„)  using  the 
split-operator  spectral  method  [10]. 

(ii)  The  outgoing  wave  is  eliminated  from  the 
internal  zone  with  the  help  of  Eq.  (5). 

(iii)  The  splitting  operation  described  by  Eq.  (6)  is 
performed  to  provide  the  outgoing  wave 
defined  in  the  "matching  zone,"  z^^  <  |z|  < 

(iv)  The  velocity  gauge  wave  function  ip^  and  its 
Fourier  transform  (p^  are  calculated: 

=  (277)"'/'  '\xp(~ipz)ip^(zjj,)  dz, 

Zex 

(14) 

(v)  The  momentum  wave  function  cp^i  p,  is 
calculated  at  some  final  time  tf>tp  (at  which 
the  photoelectron  spectra  are  to  be  calculated), 
using  the  formula 

(P^i  p,  tj)  =  t^)(pS  Pr  ffc).  (15) 

(vi)  The  momentum  wave  function  (pj^  p,t^)  is 
added  to  (p^(  p,  accumulated  at  previous 
steps. 

(vii)  The  loop  ends  here  and  calculation  starts  at 
step  (i)  with  the  integer  k  replaced  by  fc  -t-  1. 


Results  and  Discussion 

The  most  difficult  part  of  the  scheme  presented 
here  is  the  choice  of  the  free  parameters: 
z,„,  and  Zq.  Several  obvious  necessary  (but  not 
sufficient)  physical  conditions  should  be  satisfied: 
First,  the  external  zone  (|z|  >  z^^)  in  our  scheme  is 
the  asymptotic  region  in  which  we  neglect  the 
Coulomb  forces;  therefore,  we  must  require  that  at 
z  =  z^^  the  Coulomb  force  is  much  less  then  the 
electric  force  i.e.: 

—  (16) 

Second,  our  scheme  is  based  on  the  assumption 
that  the  wave  entering  the  "matching"  zone  (of 
length  Zq)  does  not  return  back  to  the  nucleus,  as 
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TABLE  I 


Values  of  parameters  (In  atomic  units)  used  in  the  wave-function  splitting  scheme. 


0^0 

dt 

dt 

dz 

^in 

^ex 

^abs 

0.32 

16.1 

0.03 

4.5 

0.25 

4196 

3300 

2 

rij^  is  number  of  grid  points  in  the  interna!  zone  (|z|  <  z,^)  and  is  the  number  of  grid  points  in  the  external  zone,  in  momentum 
space  with  the  momentum  resolution  Ap  =  277/7168  au. 


well  as  that  the  wave  exiting  this  zone  does  not 
return  to  it.  This  means  that  we  should  require 
that  the  classical  ponderomotive  radius  is  much 
smaller  than  the  size  of  the  internal  box  and 
also  smaller  than  the  size  of  the  matching  ''zone": 

eE 

ttg  ^  and  Ug  Zq,  where  an  = - r .  (17) 

mo) 


In  addition,  we  must  require  that  the  matching 
should  be  redone  after  the  time  interval  8 1  shorter 
than  the  hme  necessary  to  move  for  the  fastest 
electron  across  the  matching  "zone."  Assuming 
that  the  fastest  electron  has  the  energy 
we  thus  get 


8t 


4y^- 


(18) 


Finally,  the  absorption  should  occur  on  the  dis¬ 
tance  Zq  longer  than  de  Broglie  wavelength  of  the 
slowest  electron.  Assuming  that  the  slowest  elec¬ 
tron  has  the  energy  h  co,  we  thus  get 


For  the  laser  parameters  chosen  in  this  article, 
7  =  2.2  X  10^^  W/cm^  and  A  =  630  nm,  the  values 
of  two  relevant  parameters  are  ag  =  16.1  bohr. 
Up  =  0.32  au  =  8,7  eV,  and  w  =  0.7  au.  We  have 
listed  in  Table  I  the  values  of  all  parameters  which 
we  used  in  our  calculations  and  which  satisfy  all 
inequalities  Eqs,  (16)-(19).  The  spectra  of  electrons 
moving  in  the  direction  of  positive  z  where  calcu¬ 
lated  according  the  formula 

o  m 

probability(E)  =  \(p(p(E))\  ,  (20) 

where  p(E)  =  •\/2mE .  We  display  in  Figure  1  the 
low-energy  part  of  the  calculated  spectra,  and  in 


FIGURE  1.  Low-energy  part  of  the  ATI  photoelectron 
spectrum.  The  laser  intensity  I  =  2.2  x  W/cm^  and 
the  wavelength  A  =  630  nm.  Solid  line:  two-box 
calculation;  dashed  line:  one-box  calculation. 


Figure  2,  the  entire  spectrum,  on  the  logarithmic 
scale.  We  also  calculated,  for  comparison  (dashed 
line),  the  photoelectron  spectra  using  one  large  box 
defined  by  |z|  <  7000  bohr.  We  note  a  slight  dis¬ 
agreement  between  the  heights  (not  positions)  of 
several  first  photoelectron  peaks.  This  disagree¬ 
ment  becomes  less  when  the  spectra  are  calculated 
at  later  times,  if  >  tp  (with  the  laser  tumed-off  for 
t^>  tp),  since,  then,  all  slow  electrons  have  enough 
time  to  leave  the  matching  zone.  The  peaks  for 
energies  higher  than  5  a),  obtained  from  one-box 
and  two-box  calculations,  overlap  exactly  and  they 
agree  well  with  previous  calculations  [8,  9],  in 
which  projections  were  done  on  exact  Coulomb 
waves,  while  we  performed  projections  on  plane 
waves  in  the  external  zone,  |z|  >  z^^. 

We  are  currently  extending  this  method  to  in¬ 
clude  the  nuclear  motion  as  well,  following  the 
numerical  procedure  described  in  our  previous 
work  [5,  10].  We  expect  the  ensuing  multiphoton 
electron  spectra  to  reveal  the  full  electron-nuclear 
dynamics  of  molecules  in  strong  laser  fields. 


1688 


QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  30 


ABOVE-THRESHOLD  IONIZATION  ELECTRON  SPECTRA 


10  20  30  40  50  60 


ELECTRON  ENERGY  (PHOTON  ENERGY) 

FIGURE  2.  Entire  ATI  photoelectron  spectrum.  The 
laser  intensity  /  =  2.2  x  1 0^^  W  /  cm^  and  the  wavelength 
A  =  630  nm. 
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ABSTRACT _ 

By  performing  ab  initio  calculations  on  fragments  of  leu^-zervamicin,  it  is  demonstrated 
that  accurate  wave  functions  can  be  obtained  for  this  large  structure  by  combining  the 
results  from  the  fragments.  Input  information  consists  of  atomic  coordinates  as  obtained, 
for  example,  from  a  crystal  structure  determination.  The  fragments  are  composed  of  a 
kernel  of  atoms  surrounded  by  a  chosen  neighborhood  of  atoms.  The  entire  molecule  is 
divided  into  individual  kernels  and  their  neighborhoods  are  added  for  the  individual 
fragment  calculations  on  the  basis  of  the  distances  of  other  atoms  from  the  atoms  in  a 
kernel.  The  hydrated  leu^-zervamicin  is  composed  of  295  atoms  which  could  be  handled 
in  full  by  Gaussian  94.  The  results  of  the  fragment  calculations  were  combined  to  provide 
an  electron  density  distribution  for  the  molecule.  This  distribution  was  compared  with 
one  that  represents  the  distribution  obtained  from  a  calculation  on  the  entire  molecule  at 
once.  The  clear  implication  of  this  study  is  that  it  is  feasible  to  perform  ab  initio 
calculations  on  structures  by  the  use  of  fragments.  The  time  involved  increases  essentially 
linearly  with  complexity.  ©  1996  John  Wiley  &  Sons,  Inc. 
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Introduction 

In  a  previous  article  [1],  an  intimate  connection 
was  made  between  structure  factor  magni¬ 
tudes  obtained  from  X-ray  diffraction  data  and 
those  obtained  from  quantum  mechanical  models 
of  electron  density  distributions.  The  objective  has 
been  to  obtain  wave  functions  consistent  with 
crystallographic  diffraction  data  for  use  in  the  cal¬ 
culation  of  various  properties  of  a  broad  variety  of 
substances.  In  our  approach  to  the  calculation  of 
the  quantum  mechanical  model,  the  electron  den¬ 
sity  distribution  in  a  crystal  is  defined  in  terms  of 
a  single  determinant  of  orbitals.  In  the  single- 
determinant  approach,  a  main  feature  is  the  occur¬ 
rence  of  a  projector  matrix  in  the  quantum  me¬ 
chanical  definition  of  the  electron  density  distribu¬ 
tion.  The  quantum  mechanical  model  is  matched 
to  the  experimental  X-ray  data  by  making  adjust¬ 
ments  to  the  elements  of  the  projector  matrix  by 
use  of  the  least-squares  technique.  The  goal  then  is 
to  obtain  a  projector  matrix  that  provides  a  good 
fit  to  the  X-ray  data  and  is  suitable  for  antisym¬ 
metric  N-electron  wave  functions.  Such  a  matrix  is 
called  N-representable  [2-4].  Evidently,  it  is  desir¬ 
able  to  have  a  good-quality  projector  matrix  for 
use  in  initiating  the  calculations. 

A  Hartree-Fock  or  density  functional  theory 
calculation  produces  a  suitable  projector  matrix  for 
initiating  the  desired  calculations.  In  their  present 
configuration,  currently  available  computer  pro¬ 
grams  can  handle  a  polypeptide  composed  of  sev¬ 
eral  hundred  atoms.  When  this  type  of  capacity  is 
not  exceeded,  there  may  be  no  reason  to  consider 
fragment  calculations  that  can  produce  projector 
matrices  in  a  stepwise  fashion  [1].  On  the  other 
hand,  fragment  calculations  would  certainly  be 
indicated  for  molecules  larger  than  the  capacity  of 
computers  and  computer  programs.  Fragment  cal¬ 
culations  performed  on  parallel  computers  may 
afford  an  advantage  in  speed  that  may  be  worth 
considering  even  when  the  capacity  to  make  the 
calculation  for  the  entire  molecule  at  once  exists. 

Quantum  mechanical  calculations  have  been 
performed  on  molecular  moieties  for  several  pur¬ 
poses  in  a  variety  of  ways.  Some  of  these  types  of 
calculations  have  already  been  noted  in  our  previ¬ 
ous  study  [1].  They  concerned  the  work  of  Yang 
and  Lee  [5,  6]  who  solved  the  Kohn-Sham  [7] 
equations  for  partial  structures  and  obtained  a 


representation  of  a  full  molecule  by  using  spatial 
partition  functions  to  weight  a  sum  of  partial 
structures,  the  calculations  of  Christofferson  and 
co-workers  [8-10]  who  solved  the  Schrodinger 
equation  approximately  for  partial  structures  with 
the  use  of  Hartree-Fock  calculations  and  used  the 
solutions  as  a  source  of  molecular  orbitals  which 
could  serve  as  a  basis  set  of  functions  for  solving 
the  Schrodinger  equation  for  the  complete 
molecule,  the  approach  of  Bader  and  co-workers 
[11-14]  who  developed  a  method  for  splitting  the 
known  electron  density  of  a  molecule  into  densi¬ 
ties  for  atomic  groupings  that  have  sharply  de¬ 
fined  borders,  and  the  "Tego"  method  of  Walker 
and  Mezey  [15]  who  make  a  database  of  the  elec¬ 
tron  densities  of  the  various  atomic  groupings 
required  for  assembly  to  form  the  electron  densi¬ 
ties  of  larger  molecules  of  interest. 

Additional  types  of  calculations  are  related  to 
our  fragment  method,  in  that  they  all  scale  linearly 
with  the  number  of  atoms  for  essentially  the  same 
reasons.  They  are  also  similar  to  ours  in  that  they 
use  expansions  in  direct  (coordinate)  space  with 
basis  functions  localized  to  one  region  of  space. 
They  are  distinguished  from  one  another  accord¬ 
ing  to  whether  the  principal  objects  emphasized 
are  density  matrices  or  wave  functions.  Li,  Nunes, 
and  Vanderbilt  [16,  17]  obtain  a  variational  density 
matrix,  constrained  to  be  idempotent  with  off- 
diagonal  elements  equal  to  zero,  if  they  ''connect'' 
basis  functions  separated  by  a  distance  larger  than 
a  parametric  "cutoff"  distance.  The  method  is  ap¬ 
plied  in  the  context  of  tight-binding  models  of 
solids.  In  a  closely  related  work  Daw  [18]  obtains  a 
variational  sparse  density  matrix  and  applies  this 
method  to  three  examples,  viz.,  a  one-dimensional 
infinite  chain,  a  semi-infinite  chain,  and  a  vacancy 
in  a  simple  cubic  lattice.  Hernandez  and  Gillan 
[19]  employ  a  density  matrix  method  that  relies 
closely  upon  that  of  references  [16,  17]  to  calculate 
results  for  crystalline  silicon  systems. 

Mauri,  Galli,  and  Car  [20,  21]  obtain  molecular 
orbitals  by  a  Kohn-Sham  related  scheme,  which 
uses  basis  orbitals  defined  in  local  regions  of  direct 
space.  They  apply  their  method  to  semiconducting 
systems  and  in  particular  show  that  it  is  useful  for 
obtaining  molecular  dynamics  simulations  of  sili¬ 
con  atoms  in  a  diamond  lattice  configuration  [20] 
and  of  liquid  carbon  at  5000  K  [21].  Kim,  Mauri, 
and  Galli  [22]  address  explicitly  a  multiple  mini¬ 
mum  problem  of  energy  functionals  based  on  lo¬ 
calized  basis  orbitals  and  treat  numerous  examples 
concerning  surfaces,  bulk  systems,  and  clusters. 
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Ordejon  and  co-workers  [23],  using  a  method 
similar  to  that  of  Mauri,  Galli,  and  Car  [20,  21], 
obtain  results  for  a  tight-binding  model  of  silicon. 
This  method  is  developed  further  in  a  more  recent 
study  [24],  giving  a  detailed  comparison  of  their 
work  and  that  of  Daw  [18]  and  Li,  Nunes,  and 
Vanderbilt  [16,  17].  Goedecker  and  Teter  [25]  de¬ 
scribe  a  tight-binding  total-energy  and  molecular 
dynamics  algorithm  that  scales  linearly  with  sys¬ 
tem  size  and  is  applicable  with  parallel  computers. 

Yang  and  Lee  [26]  generalize  the  "'divide  and 
conquer"  density  method  [6]  to  one  based  on  the 
density  matrix.  Their  density  matrix  for  a  full 
molecule  may  be  obtained  from  subsystem  contri¬ 
butions  in  a  manner  consistent  with  Mulliken  pop¬ 
ulation  analysis  and  incorporates  zero  elements  in 
a  way  that  is  analogous  to  the  definitions  of  our 
previous  study  [1]  and  that  of  Walker  and  Mezey 
[15].  They  apply  their  method  in  a  density  func¬ 
tional  context  to  4-glycine,  8-glycine,  and  12- 
glycine  polypeptides. 

A  density  functional/Wannier  function  formal¬ 
ism  is  given  by  Kohn  [27]  whose  objective  is  simi¬ 
lar  to  that  of  Lee  and  Yang  [6]  but  "differs  by  the 
central  role  of  the  systematic  construction  of  the 
generalized  Wannier  function." 

In  this  study,  we  present  purely  quantum  me¬ 
chanical  calculations.  The  only  use  of  crystallogra¬ 
phy  is  to  provide  atomic  coordinates.  The  purpose 
of  the  calculations  is  to  illustrate  how  fragment 
calculations,  using  standard  programs,  can  be  used 
to  obtain  parts  of  projector  matrices.  These  parts 
can  be  added  together  to  form  the  projector  matrix 
for  a  large  molecule,  and  the  calculations  increase 
linearly  with  increase  in  the  number  of  atoms  in 
the  structure.  When  the  calculation  is  programmed 
for  high-performance  parallel  computers,  the  in¬ 
crease  in  computing  time  should  be  very  small. 
The  details  of  the  computation  of  the  individual 
parts  of  the  projector  matrix  from  the  fragments 
and  how  the  parts  are  combined  are  discussed  in 
detail.  Once  the  projector  matrix  is  completed, 
wave  functions  follow  by  standard  methods  of 
quantum  mechanics. 

All  of  the  above  methods  emphasize  that  their 
computational  difficulty  increases  linearly  with  the 
number  of  atoms  in  the  system,  in  contrast  to 
standard  methods  which  scale  with  a  higher  power 
of  that  number. 

The  various  aspects  of  the  fragment  calcula¬ 
tions  used  in  this  study  for  the  molecule  of 
leu^-zervamicin  differ  somewhat  from  the  de¬ 


scription  in  our  previous  work  [1]  and  will  be 
discussed  in  some  detail. 


Initial  Considerations 

The  fragment  calculations  to  be  presented  in 
this  study  are  based  on  structural  data,  i.e.,  atomic 
positions,  for  the  molecule  leu^-zervamicin, 
Ac-Leu-Ile-Gln-Iva-Ile-Thr-Aib-Leu-Aib-Hyp-Gln- 
Aib-Hyp- Aib-Pro-Phol  ( Aib :  a-aminoisobutyric 
acid;  Iva:  isovaline.  Hyp:  4-hydroxyproline;  Phol: 
phenylalininol)  obtained  from  a  crystal  structure 
determination  [28].  The  formation  of  molecular 
orbitals  follows  the  procedure  presented  in  Ref. 
[1],  with  the  necessary  adjustments  for  the  differ¬ 
ent  peptide  residues  and  solvents  present. 

A  typical  molecular  orbital  is 

835 

(t>i  =  L  Cijipj,  (1) 

;=i 

where  the  ipj  are  individual  orbitals  and  the 
are  the  coefficients.  The  upper  limit  of  the  sum 
follows  from  IS  orbitals  for  each  of  the  160  hydro¬ 
gen  atoms  and  IS,  2S,  2P^,  2?^,  and  2P^  orbitals 
for  each  of  the  135  other  atoms,  giving  835  total 
orbitals,  ipj.  The  hydrated  peptide  has  1048  elec¬ 
trons  and  therefore  524  doubly  occupied  molecular 
orbitals  and  311  unoccupied  ones. 

The  electron  density  distribution  p  is 

524 

P  =  2  E  (2) 

i=l 

The  two  on  the  right  side  of  Eq,  (2)  reflects  the 
fact  that  the  orbitals  are  doubly  occupied.  If  the  ijjj 
form  a  column  vector  in  Eq.  (1),  then  so  do  the 
and 

p  =  2  tr  .  (3) 

The  trace  includes  only  occupied  molecular  or¬ 
bitals.  We  may  write 

<|>  =  C^.  (4) 

and  from  Eq.  (3)  it  follows  that 

p  =  2trCx|jili^C^  =  2trC^Ci|i\l#^  (5) 

The  following  definitions  are  made 

S  =  J dr,  (6) 
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where  the  integration  is  performed  over  the  indi¬ 
vidual  elements  of  the  product  matrix 


R  =  C^C 


(7) 


and 


RS  =  p,. 


(8) 


produces  524  molecular  orbitals,  each  having  835 
terms. 

According  to  Eq.  (3), 


j p  dr  ^  2tx  j c{)<}>^  dr  =  1048,  (16) 


where  is  a  projector.  The  subscript  a  indicates 
that  unless  special  steps  are  taken  in  forming  the 
matrix  i|i  the  projector  matrix  will  not  be  symmet¬ 
ric.  It  can  be  shown  that,  as  a  consequence  of  the 
fact  that  the  ^  are  composed  of  elements  that  are 
orthonormal, 

P/  =  P.  (9) 

and 

trP,  =  N,  (10) 

where  N  is  the  number  of  doubly  occupied  or¬ 
bitals  in  the  molecule  of  interest.  Equation  (9)  is 
the  projector  property. 

It  is  worthwhile  to  have  a  projector  P^  that  is 
symmetric  since  it  reduces,  almost  by  a  factor  of  2, 
the  number  of  elements  in  the  projector  that  must 
be  considered.  From  Eq.  (8),  it  follows  that 

(11) 

This  matrix  product  is  a  symmetric  projector  P^ 
and  may  be  written 

P,  =  (12) 

It  follows  from  Eqs.  (6)-(8)  and  (11)  that  the  elec¬ 
tron  density  can  be  written 

p  =  2trRi|;i|/'  =  2  tr  (18) 

and 

p  =  2  tr  (14) 

In  view  of  Eqs.  (4),  (13),  and  (14),  it  is  worth¬ 

while  to  characterize  C  and  the  molecular  orbitals. 
The  matrix  C  has  524  rows  and  835  columns. 


^1,2 

Q,3 

Q,835 

c  = 

^2,1 

^2,2 

^2,3 

^2,835 

Q24, 1 

Q24, 2 

Q24,3 

^24,835 

C  has  835  rows  and  524  columns.  The  product 
C '  C  produces  a  square  matrix  of  dimension  835.  If 
C  multiplies  the  column  matrix  of  orbitals  ij/,  it 


where  the  integral  is  taken  over  all  three-dimen¬ 
sional  space.  According  to  Eqs.  (5)-(10), 

trP,  =  524.  (17) 


Fragment  Calculation:  Example 
and  Results 

The  purpose  of  the  fragment  calculations  pre¬ 
sented  here  is  to  obtain  an  ab  initio  projector 
matrix  P,  when  ab  initio  calculations  of  an  entire 
molecule  is  either  not  feasible  or  considered  to  be 
too  time-consuming.  In  our  calculations,  a  frag¬ 
ment  consists  of  an  inner  core,  or  "kernel,"  and  a 
number  of  neighboring  atoms  called  a  "neighbor¬ 
hood."  The  molecule  is  divided  into  a  suitable 
number  of  kernels,  which,  when  recombined,  form 
the  complete  molecule. 

Since  the  coordinates  of  the  structure  of  interest 
are  available,  it  is  readily  possible  to  calculate 
which  atoms  would  occur  within  a  particular  cho¬ 
sen  distance  from  each  of  the  atoms  in  a  kernel. 
Such  atoms  would  form  the  neighborhood.  In  or¬ 
der  to  effect  a  pairing  of  all  electrons  and  an  even 
number  of  electron  pairs,  it  may  be  necessary  to 
attach  some  hydrogen  atoms  to  atoms  belonging  to 
the  neighborhood  or  a  moiety  that  is  part  of  the 
molecule,  or  both.  This  can  extend  the  outer  limits 
of  the  fragment  somewhat.  It  was  used  in  the 
leu^  -zervamicin  example  to  be  described. 

A  rule  employed  in  our  procedure  is  that  all 
atoms  present  must  be  a  member  of  some  kernel 
once  and  only  once.  With  atomic  positions  held 
fixed,  the  electron  density  distribution  in  a  frag¬ 
ment  is  computed.  Such  a  calculation  delivers  con¬ 
tributions  to  an  R  matrix  and  an  S  matrix  from 
which  the  portion  that  concerns  the  kernel  is  saved 
in  the  form  of  P^  =  SY^Kj^SY^  where  the  sub¬ 
script  k  refers  to  a  kernel  matrix.  The  elements 
that  are  saved  in  a  kernel  projector  matrix  are 
described  as  follows.  Those  contributions  to  the  P 
matrix  involving  orbitals  from  a  neighborhood 
atom  and  an  atom  in  the  kernel  are  saved  at  the 
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fractional  value  of  one-half,  in  accordance  with  the 
above  rule.  If  all  neighborhood  atoms  occur  only 
once  as  part  of  a  kernel,  another  one-half  value 
would  be  added  to  those  contributions  to  the  P 
matrix  already  saved  at  one-half  values,  when  the 
values  associated  with  the  adjoining  kernels  are 
calculated.  Contributions  from  pairs  of  atoms,  both 
in  the  same  kernel,  are  saved  with  a  coefficient 
of  1. 

In  our  previous  procedure  [1],  we  saved  the 
R^XO)  instead  of  the  P^(0)  and  obtained  an  R(0) 
matrix  for  the  full  molecule  by  combining  all  the 
R^XO)  for  the  various  kernels.  The  P^(0)  matrix  was 
then  obtained  by  multiplying  the  R(0)  matrix  by 
S(0)  according  to  Eq.  (8).  The  P^  are  saved  here 
instead  and  in  symmetric  form.  The  P^^  are  very 
good  kernel  representations  and  lead  to  a  full  P^ 
matrix  that  is  a  very  good  projector,  an  improve¬ 
ment  on  P/0).  The  symbol  (0)  implies  a  certain 
pattern  of  zeros  that  was  special  to  the  cyclic 
hexapeptide  treated  in  our  previous  article  [1].  For 
this  hexapeptide,  it  was  possible  to  obtain  good 
results  by  defining  the  single  adjacent  peptide 


residue  on  both  sides  of  a  kernel  residue  as  a 
neighborhood.  As  a  consequence  of  the  denser 
packing  of  residues  in  leu^-zervamicin,  more 
residues  were  required  to  form  the  neighborhoods 
of  each  kernel. 

The  matrix  S,  defined  in  Eq.  (6),  is  a  matrix 
representing  the  overlap  integrals  of  pairs  of  or¬ 
bitals.  For  pairs  of  orbitals  belonging  to  atoms  that 
are  separated  by  large  distances,  the  values  of  the 
overlap  integrals  will  be  close  to  zero.  The  behav¬ 
ior  of  S  is  the  reason  why  the  fragment  calcula¬ 
tions  can  give  accurate  values  for  the  elements  of  P 
for  the  molecule  as  a  whole. 

The  fragment  calculations  for  the  hydrated  hexa- 
decapeptide,  leu^-zervamicin,  were  performed  by 
defining  19  kernels  as  the  16  peptide  residues,  two 
clusters  of  water  molecules  and  a  cluster  of  a 
water  and  an  ethanol  molecule.  In  this  application, 
the  neighborhoods  were  formed  with  atoms  within 
5  A  of  the  kernels  plus  some  few  additions  to 
assure  that  all  electrons  were  paired  and  the  num¬ 
ber  of  electron  pairs  was  even.  The  kernels  and 
their  neighborhoods  are  listed  in  Table  1.  The  num- 


TABLE  I  _ _ — - 

Composition  of  the  19  fragments,  i.e.  kernels  and  their  corresponding  neighborhoods,  used  In  the  calculation 

of  the  P  matrix  for  the  hydrated  hexadecapeptide,  leu^  -zervamicin.® 


Kernel  Neighbors 


Kernel 

Neighborhood 

No.  of 
Atoms 

No.  of 
Basis 

No.  of 
Atoms 

No.  of 
Basis 

1  (Ac-Leu) 

2, 3, 4, 5,  H6a 

27 

69 

99 

275 

2  (He) 

1,3, 4, 5, 6,  H7a,Wb3 

19 

51 

124 

348 

3  (Gin) 

1,2,4,5,6,7,  N8,  H8a  Wb2,Wa3,W4 

17 

53 

147 

403 

4  (Iva) 

1,2,3,5,6,7,8,  H9a,  Wb2 

16 

44 

148 

412 

5  (He) 

1,2,3,4,6,7,8,9,  H10f 

19 

51 

159 

447 

6  (Thr) 

01,2,3,4,5,7,8,9,10,  Wb2,  Wa3,Wb3 

14 

42 

158 

446 

7  (Aib) 

02,3,4,5,6,8,9,1 0,H1 1  e,H1 1  h,Wb2,Wa3 

13 

37 

153 

441 

9  (Leu) 

03,4,5,7,9,10,11,12 

19 

51 

143 

411 

9  (Aib) 

04,5,6,7,8,10,11,12,13 

13 

37 

164 

465 

1 0  (Hyp) 

05,6,7,8,9,11,12,13,14,  Wa1,Wa2 

15 

47 

142 

414 

11  (Gin) 

07,8,9,10,12,13,14,15,  Wal ,  Wa2 

17 

53 

164 

480 

12  (Aib) 

8,9,10,11,13,14,15,  H16a,  H16c,  EtOH,  Wal 

13 

37 

141 

401 

13  (Hyp) 

9,10,11, 12, 14,15,16,EtOH,  Wal ,  Wa2,  W8 

15 

47 

142 

410 

14  (Aib) 

10,11,12,13,15,16,  Wal,  Wa2 

13 

37 

117 

345 

15  (Pro) 

11,12,13,14,16 

14 

42 

143 

411 

16  (Phol) 

012,13,14,15,  EtOH,  W8 

23 

67 

112 

311 

EtOH,  W8 

12,13,16 

12 

28 

116 

324 

Wal  ,Wa2 

10,11,12,13,14 

6 

14 

85 

253 

Wb2,Wa3,Wb3,W4 

2,3,4, 6,7 

12 

28 

145 

411 

®The  individual  numbers  in  the  first  column,  associated  with  the  16  sequential  peptide  residues,  imply  the  same  corresponding 
residues  in  column  2.  Other  numbers  in  column  2  have  letters  with  them,  e.g.,  H  for  hydrogen,  O  for  oxygen,  N  for  nitrogen,  and  W 
for  water.  EtOH  symbolizes  ethanol.  The  structural  aspects  of  these  symbols  are  to  be  found  in  Ref.  [28]. 
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bers  refer  to  the  peptide  residues  in  the  sequence 
that  was  given  above  in  the  description  of  the 
chemical  content  of  the  leu^-zervamicin  molecule. 
The  symbols  in  Table  I  correspond  to  those  found 
in  the  crystal  structure  analysis  [28].  The  crystal 
structure  analysis  provided  the  atomic  coordinates 
used  in  the  calculations  reported  in  this  article  and 
also  afforded  the  information  from  which  the  se¬ 
lection  of  the  associated  solvent  molecules  was 
based.  The  last  four  columns  of  Table  I  show  the 
number  of  atoms  and  the  number  of  basis  func¬ 
tions  for  each  kernel  and  for  each  neighborhood. 

The  Hartree-Fock  calculations  for  the  entire  hy¬ 
drated  leu^-zervamicin  molecule  and  for  the  sepa¬ 
rate  fragment  calculations  were  made  by  use  of  the 
Gaussian  94  program  [29]  employing  the  ST03g 
basis.  The  structure  of  the  R  matrix,  obtained  from 
the  calculations,  which  is  preliminary  to  obtaining 
the  projector  matrix  by  use  of  Eq.  (12),  is  now 
presented  in  some  detail,  A  schematic  diagram  of 
the  entire  R  matrix,  obtained  from  the  fragment 
calculations,  is  shown  in  Figure  1,  The  horizontal 
rows  are  characterized  in  Table  I.  All  of  the  squares 
in  Figure  1  contain  numerous  elements  of  the  R 
matrix  which  has  a  dimension  of  835.  The  values 
of  the  elements  in  the  dark  blocks,  obtained  from 
the  fragment  calculations  are  entered  in  the  dark 
blocks  with  a  coefficient  of  1.  The  values  of  the 
elements  in  the  gray  areas  are  formed  from  two 
fragment  calculations,  one  in  which  one  orbital  is 
part  of  a  kernel  atom  and  the  other  is  part  of  a 
neighborhood  atom  and  a  second  calculation  in 
which  the  roles  of  the  same  two  orbitals  are  inter¬ 
changed.  Each  such  fragment  calculation  produces 
a  value  which  is  saved  with  a  coefficient  of  one 
half.  The  two  values  are  then  added  together  to 
give  values  for  the  appropriate  elements  in  the  R 
matrix  [1]. 

The  number  of  elements  for  the  R  matrix  repre¬ 
sented  in  the  various  boxes  in  Figure  1  varies 
considerably  and  may  be  judged  by  examination 
of  Table  I  with  respect  to  the  atomic  composition 
of  the  kernels  and  also  their  neighborhoods.  Nev¬ 
ertheless  each  row  and  each  column  of  both  R  and 
P  has  a  total  of  835  elements.  The  occurrence  of 
empty  boxes  in  Figure  1  is  a  consequence  of  the 
limitation  put  on  the  extent  of  the  neighborhoods. 
Estimates  of  the  validity  of  the  limitation  can  come 
from  various  comparisons,  one  of  the  most  obvi¬ 
ous  being  an  inspection  of  corresponding  elements 
in  the  full  molecule  calculation  and  the  fragment 
calculation.  There  are  a  total  of  697,225  elements  in 
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FIGURE  1 .  Schematic  diagram  of  the  R  matrix  obtained 
from  the  fragment  calculations  for  hydrated  leu''  - 
zervamicin.  The  numbers  in  the  figure  label  the  kernels 
as  in  Table  I.  The  elements  in  the  dark  squares  are 
composed  of  functions  of  the  coefficients  of  pairs  of 
orbitals  associated  with  the  atoms  of  the  particular  kernel. 
The  elements  in  the  gray  squares  and  hatched  squares 
are  composed  of  functions  of  the  coefficients  of  pairs  of 
orbitals,  one  of  which  is  associated  with  a  kernel  atom  of 
the  dark  square  in  the  same  row  and  the  other  is 
associated  with  an  atom  of  the  labeled  neighborhood. 
The  contents  of  the  dark  square  labeled  by  kernel  7  are 
shown  in  Figure  2  at  higher  resolution. 


the  matrix  and  412,902  of  them,  i.e,,  59,2%,  have  a 
zero  value. 

The  numbers  label  the  kernels  as  in  Table  I.  The 
elements  in  the  squares  are  functions  solely  of  the 
coefficients  of  pairs  of  orbitals  associated  with 
the  kernels  that  label  the  squares.  The  elements  of 
the  dark  block-diagonal  squares  are  functions  of 
the  coefficients  of  pairs  of  orbitals  associated  with 
kernel  atoms.  The  gray  squares  in  the  same  row  as 
a  dark  square  contain  as  elements  functions  of  the 
coefficients  for  pairs  of  orbitals,  one  of  which  is 
associated  with  a  kernel  atom  and  one  with  a 
neighborhood  atom.  The  hatched  squares  contain 
elements  that  are  functions  of  the  coefficients  of 
pairs  of  orbitals  from  only  some  of  the  neighbor¬ 
hood  atoms  represented  by  the  hatched  square  and 
only  some  of  the  kernel  atoms  represented  by  the 
dark  square  in  the  same  row. 

Figure  2  is  an  enlargement  of  the  dark  block  in 
Figure  1  which  represents  the  elements  of  the  R 
matrix  formed  from  functions  of  the  coefficients  of 
the  orbitals  associated  with  the  atoms  comprising 
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kernel  number  7.  Those  atoms  label  the  individual 
squares  that  contain  within  them  the  elements  of 
the  R  matrix  that  derive  from  the  orbitals  associ¬ 
ated  with  the  labeling  atoms. 

A  further  enlargement  of  the  box  labeled  N7, 
N7  in  Figure  2  is  made  in  Figure  3.  The  specific 
orbitals  associated  with  N7  label  the  squares  that 
contain  within  them  a  single  function  of  the  coeffi¬ 
cients  of  the  corresponding  orbitals.  The  numerical 
values  of  the  functions  in  this  calculation  are  given. 

As  mentioned  above,  as  soon  as  a  contribution 
to  the  R  matrix  was  obtained  from  a  fragment 
calculation,  it  was  converted  to  the  corresponding 
contribution  to  the  P  matrix  by  the  use  of  Eq.  (12) 
and  stored.  The  sum  of  all  such  contributions  forms 
the  P  matrix  for  the  complete  structure.  Also  noted 
previously  was  the  fact  that  a  consequence  of  the 
fragment  calculations  is  that  the  values  of  the 
elements  in  the  blank  squares  are  replaced  by 
zeros.  The  resulting  P  matrix  has  the  same  pattern 
of  zeros  as  the  R  matrix,  but  the  filled  squares 
have  more  complicated  functions  of  the  orbital 
coefficients  and  do  not  follow  the  labeling  of  the 
rows  and  columns  of  Figure  1.  Both  R  and  P  are 
symmetric. 


N? 


FIGURE  2.  Contents  at  higher  resolution  of  the  dark 
square  in  Figure  1  labeled  by  kernel  7.  The  elements  in 
the  squares  are  functions  of  the  coefficients  of  the  orbitals 
associated  with  the  atoms  that  label  the  squares.  The 
contents  of  the  square  labeled  by  N7  are  shown  in 
Figure  3  at  orbital  resolution. 
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FIGURE  3.  Contents  at  orbital  resolution  of  the  square 
in  Figure  2  labeled  by  N7.  The  elements  in  the  squares 
are  functions  of  the  coefficients  of  the  orbitals  that  label 
the  squares.  These  rather  complex  functions  of  the 
coefficients  have  the  values  given  in  the  squares. 


Comparison  of  Electron  Densities 

Isodensity  surfaces  have  been  calculated  by  use 
of  Eqs.  (1)  and  (2)  from  the  Hartree-Fock  orbitals 
for  the  hydrated  leu^-zervamicin  molecule.  Use 
was  made  of  the  P  matrix  obtained  at  once  for  the 
full  molecule  and  that  obtained  from  the  fragment 
calculations.  The  two  sources  gave  electron  densi¬ 
ties  that  appeared  to  be  quite  similar.  Therefore,  a 
portion  of  the  one  obtained  from  the  full  molecule 
calculation,  as  a  representative  of  both  types  of 
calculation,  is  illustrated  in  Figure  4,  at  an  isoden¬ 
sity  surface  of  0.005  e/A^.  Confining  the  computed 
volume  to  a  region  of  interest,  as  illustrated,  saves 
time  and  memory  when  desirable  or  necessary.  A 
ball-and-stick  model  of  the  structure  is  superim¬ 
posed.  In  order  to  obtain  a  more  quantitative  in¬ 
sight  into  the  similarity  of  both  types  of  density 
calculation,  a  series  of  difference  isodensity  sur¬ 
faces  were  calculated  in  which  differences  that  did 
not  exceed  increasingly  larger  values  were  omit¬ 
ted.  Evidently,  this  is  a  calculation  that  can  deter¬ 
mine  and  locate  the  largest  differences  between 
the  electron  densities. 

The  difference  isodensity  surfaces  shown  in  Fig¬ 
ures  5  and  6  were  obtained  from  Fp  -  Ff.  by  use  of 
Eq.  (14),  where  the  subscripts  imply  full  molecule 
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FIGURE  4.  Isodensity  surface  of  0.005  e/A^  within  a  selected  volume  for  hydrated  leu^  -zervamicin.  This  isodensity 
surface  was  obtained  from  a  Hartree-Fock  calculation  performed  on  the  entire  hydrated  molecule  and  the  further  use  of 
Eqs.  (1)  and  (2)  applied  to  the  resulting  wave  functions.  A  ball-and-stick  model  of  the  molecule  is  superimposed  on  the 
isodensity  surface. 


(F)  and  sum  over  kernel  (K)  matrices.  A  differ¬ 
ence  isodensity^  surface  is  shown  at  the  values  of 
10  ^  iq~3  Figure  5,  and  —1.2  X  10“^ 

e/A'  in  Figure  6.  Some  small  fuzzy  regions  are 
visible  at  which  there  are  differences  as  large  or 
larger  than  the  values  of  the  difference  isodensities 
shown.  The  fuzzy  regions  should  all  disappear  at 
slightly  larger  difference  isodensities.  Evidently, 
the  fuzzy  regions  in  Figures  5  and  6  are  quite 
small  and  highly  localized. 

The  difference  studies  shown  in  Figures  5  and  6 
indicate  that  the  electron  density  is  well  repre¬ 
sented  by  the  P  matrix  obtained  from  the  fragment 
calculations.  As  has  been  noted,  the  method  for 
developing  a  P  matrix  presented  here  differs  from 
the  one  presented  in  our  previous  article  [1].  The  P 
matrix  formed  here  is  more  accurate  than  that 
formed  from  the  R  matrix  in  the  previous  study.  It 
was  not  necessary  to  employ  the  Clinton  equations 
here  to  improve  the  projector  quality.  In  fact,  use 
of  the  Clinton  equations  left  the  good  quality  of 
the  P  matrix  unchanged.  Even  greater  accuracy 


may  accrue  if  more  atoms  were  included  in  the 
neighborhoods  of  the  kernels. 


Concluding  Remarks 

With  the  use  of  the  known  structure  of  the 
molecule,  leu^ -zervamicin,  from  a  crystal  struc¬ 
ture  analysis,  it  has  been  shown  once  again  that  it 
is  feasible  to  make  ab  initio  quantum  mechanical 
calculations  to  good  approximation  for  very  large 
molecules.  With  the  use  of  sequential  computers, 
the  computations  increase  only  linearly  with  time. 
Were  programs  to  become  available  for  parallel 
computers,  the  time  required  for  very  large  struc¬ 
tures  should  increase  even  more  slowly.  The  key 
to  such  computations  is  the  fragment  calculation 
wherein  a  molecule  is  divided  into  kernels  and  ab 
initio  calculations  are  performed  on  each  of  the 
kernels  and  their  corresponding  neighborhoods  of 
atoms.  The  theoretical  basis  for  the  fragment  calcu¬ 
lations  suggests  that  the  larger  the  neighborhood 
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FIGURE  5.  Difference  isodensity  surface  of  1.0  x  10  e/A^  for  the  hydrated  leu^ -zervamicin  molecule.  The 
difference  isodensity  surface  was  obtained  from  Pp  —  P)^.  Small  fuzzy  regions  represent  the  remaining  difference.  They 
involve  a  small  molecular  volume  and  are  evidently  small  in  magnitude. 


FIGURE  6.  Difference  isodensity  surface  of  -1.2  x  10  e/A^  for  the  hydrated  leu’  -zervamicin  molecule.  The 
difference  isodensity  surface  was  obtained  from  Pp  —  Pf^.  Small  fuzzy  regions  represent  the  remaining  difference.  They 
involve  a  small  molecular  volume  and  are  evidently  small  in  magnitude. 
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of  a  kernel,  the  greater  the  accuracy.  The  kernel 
matrices  consisting  of  the  results  of  interactions  of 
the  kernel  orbitals  with  each  other  with  a  weight 
of  one  and  the  results  of  interactions  of  kernel 
orbitals  and  neighborhood  orbitals  with  a  weight 
of  one-half,  are  additive  and  combine  to  form  the 
desired  matrix  for  the  complete  molecule.  The  pro¬ 
cedure  used  here  for  the  calculation  of  the  kernel 
matrices  resulted  in  a  total  matrix  with  good  pro¬ 
jector  properties,  namely,  tr(P^  —  P)^  =  S  X  10~^. 

The  hexadecapeptide  of  this  article  was  treated 
within  the  context  of  the  ab  initio  Hartree-Fock 
approximation.  However,  the  concept  of  extracting 
kernel  matrices  from  fragments  smaller  than  a  full 
molecule  would  be  applicable  within  the  context 
of  any  method  based  on  a  molecular  orbital  repre¬ 
sentation,  e.g.,  density  functional  theory. 

The  kernel-neighborhood  fragment  calculation 
affords  the  opportunity  to  make  ab  initio  calcula¬ 
tions  for  molecules  whose  complexity  exceeds  the 
capacity  of  available  computer  programs,  particu¬ 
larly  since  the  increase  in  computer  time  is  essen¬ 
tially  a  linear  function  of  the  complexity.  In  addi¬ 
tion,  the  fragment  calculation  may  save  time  when 
employed  as  an  alternative  to  a  calculation  made 
on  a  complex,  full  molecule. 

The  P  matrix  obtained  from  fragment  calcula¬ 
tions  may  be  used  to  derive  molecular  properties 
from  the  density  matrix  using  the  standard  rules 
of  quantum  mechanics. 
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ABSTRACT _ _ _ 

We  investigate  the  fission  process  Ag^3  ^  Ag^2  +  Ag^  in  order  to  compare  total  energies 
that  calculated  by  the  shell  correction  method  and  jellium  models.  A  cavity  potential  and 
a  Woods-Saxon-type  potential  are  used  as  the  shell  potential  for  the  shell  correction 
method,  with  which  the  single-particle  energy  levels  are  calculated.  Shell  corrections  are 
obtained  by  using  the  Balian-Bloch  formula  and  by  smoothing  the  discrete  energy  levels 
in  the  shell  potentials.  The  jellium  model  calculations  are  performed  in  the  framework  of 
the  local  spin  density  functional  approximation.  The  conventional  jellium  model  and 
stabilized  jellium  model  are  used.  Although  the  qualitative  agreement  between  the  shell 
correction  method  calculations  and  the  stabilized  jellium  calculations  is  very  good,  an 
improvement  of  the  liquid  drop  energy  will  be  necessary  for  the  satisfactory  quantitative 
agreement.  ©  1996  John  Wiley  &  Sons,  Inc. 


I.  Introduction 

The  concepts  and  methodologies  used  for 
atomic  nuclei  are  very  useful  in  studying 
metal  clusters.  The  method  of  shell  corrections  is 
one  of  them  [1-3].  In  this  method  the  cluster  is 
first  considered  as  a  classical  liquid  drop,  then 
quantum  mechanical  corrections  are  added.  The 
scheme  presented  by  the  method  of  shell  correc¬ 
tions,  i.e.,  a  classical  description  of  a  system  with 

*  To  whom  correspondence  should  be  addressed. 


quantum  mechanical  corrections,  may  be  a  promis¬ 
ing  method  to  deal  with  large  systems  where 
purely  microscopic  calculations  are  practically  im¬ 
possible  but  purely  macroscopic  theories  overlook 
important  details. 

The  most  important  quantum  mechanical  cor¬ 
rection  arises  from  the  inhomogeneous  distribu¬ 
tion  of  single-particle  energy  levels  in  an  effective 
potential  such  as  the  Hartree-Fock  field.  The  effec¬ 
tive  potential  is  not  known  unless  we  calculate  it, 
but  we  may  be  able  to  construct  a  model  potential 
which  mimics  the  effective  potential  very  well. 
This  model  potential  is  called  the  shell  potential. 


International  Journal  of  Quantum  Chemistry:  Quantum  Chemistry  Symposium  30,  1701-1707  (1996) 

©  1 996  John  Wiley  &  Sons,  Inc.  CCC  0360-8832  /  96  /  071 701  -07 


KOIZUMI  AND  FUKUMOTO 


The  single-particle  energy  levels  are  calculated 
with  the  shell  potential  and  then  the  shell  correc¬ 
tion  is  obtained  as  the  energy  fluctuation  caused 
by  the  inhomogeneous  distribution  of  the  discrete 
energy  levels. 

In  order  to  obtain  satisfactory  results  from  shell 
correction  calculations,  we  have  to  have  a  good 
shell  potential.  Actually  the  shell  potential  needs 
not  to  mimic  the  detail  of  the  effective  potential, 
but  the  energy  level  distribution  near  the  highest 
occupied  level  or  the  Fermi  level.  In  the  previous 
calculations  we  used  a  cavity  potential  of  a  cluster 
shape  as  the  shell  potential.  In  the  present  work 
we  also  use  a  Woo ds-Saxon- type  potential,  which 
is  commonly  used  in  nuclear  physics. 

Once  the  single-particle  energy  levels  are  ob¬ 
tained,  the  next  step  is  to  calculate  a  smoothed 
density  of  states.  For  the  cavity  potential,  there 
exists  the  formula  by  Balian  and  Block.  But  such  a 
formula  is  not  available  for  the  Woods-Saxon  po¬ 
tential.  Therefore,  we  will  calculate  the  smoothed 
density  of  states  from  the  discrete  density  of  states 
by  employing  the  standard  method  in  nuclear 
physics  [4]. 

We  also  performed  the  jellium  model  calcula¬ 
tions.  We  used  two  types  of  jellium  models,  the 
conventional  jellium  model,  and  the  stabilized  jel¬ 
lium  model.  The  stabilized  jellium  model  is  a 
modification  of  the  conventional  jellium  model  so 
that  the  jellium  is  in  mechanical  equilibrium.  This 
model  is  known  to  give  much  better  results  for 
surface  energy  of  metals  than  the  conventional  one 
[5,  6]. 

IL  Method 

A.  SHELL  CORRECTiOIX  METHOD 

In  the  shell  correction  method  the  metal  cluster 
is  first  considered  as  a  charged  liquid  drop.  The 
energy  of  the  liquid  drop,  E^q,  is  given  by 

^LD  ^  ^vol  ^surf  +  ^Coulomb/  (2.1) 

where  the  surface  energy,  is  a  product  of 

surface  energy  a  and  surface  area  S,  the  Coulomb 
energy,  is  the  Coulomb  energy  of  the 

charged  conductor,  and  the  volume  energy,  is 
the  energy  proportional  to  the  number  of  the  atoms 
in  the  cluster.  The  volume  energy  is  omitted  in  the 
following  discussion  because  we  only  consider  the 
volume  conserving  deformations  and  fission. 


The  shape  of  the  cluster  is  determined  by  a 
shape  function.  We  used  the  following  shape  func¬ 
tion,  which  is  a  modified  version  of  the  one  used 
in  nuclear  fission  [2]: 

p,(z)  =  s{[l  -  (z/sf]  [(C-3  -  B/5)  +  2aBz/s 

+  B(z/sf\j  .  (2.2) 

The  shape  of  the  cluster  is  given  by 
=  ^ -h  =  p.  The  modification  is  made  so  that 
the  volume  ration  of  two  fragments  is  constant 
along  the  fission  line 

BC^  ^  (2.3) 

where  the  parameter  a  is  determined  by  the  vol¬ 
ume  ratio  of  the  two  fragments  which  contain 
atoms  and  N2  atoms,  respectively: 

N,  (a  ~  1)^4  -h  a) 

—  =  - .  (2.4) 

^2  (a  -h  1)^(4  —  a) 

Now  we  add  the  shell  correction  to  the  liquid 
drop  energy  as  the  quantum  mechanical  correc¬ 
tion.  The  correction  we  take  into  account  is  the 
energy  fluctuation  from  the  average  that  arises 
from  the  inhomogeneous  distribution  of  the  va¬ 
lence  electron  levels. 

The  shell  correction  is  given  by 

oc 

^she\\=  /  £T(s)ds,  (2.5) 

where  and  n-  are  the  energy  and  occupation 
number  of  the  iih  energy  level  in  the  shell  poten¬ 
tial,  r(£:)  is  a  smoothed  density  of  states,  and  Sp  is 
the  Fermi  energy  associated  with  the  smoothed 
density  of  states. 

The  standard  formula  to  obtain  the  smoothed 
density  of  states  from  the  discrete  density  of  states 
of  the  shell  potential,  g(s)  =  E/5(^  —  is  [4] 

ns)  =  y-^  e,)/y],  (2.6) 

i 

where  the  smoothing  function  /  is  expressed  using 
the  liermit  polynomials  Hj,  as 

2M 

fix)  =  exp(-j^)  Y,  cif.H}^ix),  (2.7) 

A:=0,2 
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where 


flo  =  1,  for  k>2.  (2.8) 

We  use  2M  =  6  in  the  present  work. 

We  try  two  types  of  shell  potentials.  One  is  a 
cavity  potential  in  which  free  electrons  move  in 
the  cavity  of  the  cluster  shape,  and  the  other  is  a 
Woods-Saxson-type  potential,  which  is  commonly 
used  in  nuclear  physics. 

The  Woods-Saxson  potential  is  given  by 


Vip,z) 


Vq 

1  +  exp[d(  p,  z)/do]  ' 


(2.9) 


where 


d(  p,  z)  =  ±'\l -  Ps(z)^  /  (2.10) 

and  the  +  sign  is  taken  if  ( p,  2)  is  inside  the 
cluster  and  the  -  sign  is  taken  if  it  is  outside.  The 
parameters  ^od  dg  are  set  to  Vq  =  —  0.9(a.u.), 
do  =  7rr„s,  where  is  the  Wigner-Seitz  radius 
[  =  3.02(a.u.)  for  silver]. 

In  addition  to  the  smoothing  formula  [Eq.  (2.6)], 
the  Balian  and  Bloch  formula  [7]  can  be  used  for 
the  smoothed  density  of  states  in  the  cavity  poten¬ 
tial 


tn 


1 


Vk  - 


ttS 


+ 


(2.11) 


charged  background  jellium.  We  solve  the 
problem  of  valence  electrons  moving  in  the  back¬ 
ground  jellium  by  employing  the  density  func¬ 
tional  method  of  Kohn  and  Sham  [10].  The  local- 
spin  density  functional  by  Vosko,  Wilk,  and 
Nusair  [11]  is  used. 

The  eigenvalue  problem  of  the  Kohn— Sham 
equations  are  solved  using  the  discrete  valuable 
representation  (DVR)  [12-14].  In  this  representa¬ 
tion  the  basis  is  coordinate  points.  A  DVR  basis 
{\x^},  a  =  l, ...,N}  is  related  to  a  finite  basis 
representation  (FBR)  basis  {|  ),  i  ^  1, . . . ,  N}  via  a 

unitary  transformation: 

U„>=  LUilUa)  (2.13) 

r  =  1  a  =  1 

where  the  imitary  matrix  U  is  given  by 

u  =  ys-1/2,  (2.14) 

N 

'I'ai  =  ^ii=  L 

a=l 

(2.15) 

Because  the  shape  fimction  (2.2)  has  an  axial 
symmetry,  we  use  cylindrical  coordinates  assum¬ 
ing  the  axial  symmetry  of  the  Kohn-Sham  orbitals. 
Then  the  angular  momentum  around  the  z  axis,  v, 
becomes  a  good  quantum  number,  and  we  solve 
two-dimensional  eigenvalue  problems  for  each  v. 
As  the  basis  for  the  z  coordinate  we  use 


where 


k  = 


(2.12) 


sin(/,(z„ 


2.  min 


), 


(2.16) 


and  S  and  V  are  surface  area  and  volume  of  the 
cavity,  respectively,  and  Rj  and  R2  are  the  main 
curvatures  of  radii  of  the  surface.  We  used  Eqs. 
(2.6)  and  (2.11)  for  the  smoothed  density  of  states 
for  the  cavity  potential,  while  Eq.  (2.6)  is  used  for 
that  for  the  Woods-Saxson  potential. 

B.  JELLIUM  MODEL 

The  jellium  model  has  been  shown  to  give  satis¬ 
factory  results  for  metal  clusters  [8,  9].  In  this 
model  valence  electrons  move  in  the  positive  ion 
cores  which  is  approximated  by  a  imiformly 


(/,  —  tV/f  Z^3^  Zj„jn),  i 

Za  =  Zmin  +  “(^max  “  «  =  1,...,  N,, 

where  is  the  number  of  grid  points  in  the  z 
coordinate,  z^i^  and  z^^^  are  the  minimum  and 
the  maximum  of  the  grids,  respectively. 

Note  that  the  vectors  <z|(?;>  are  eigenvectors  of 
the  z  component  kinetic  energy 

-^<z|^?,>  =  ^?<z|<j,>.  (2.17) 
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As  the  basis  for  the  p  coordinate,  we  use  the 
following  two  sets: 


<Pa,ol”/0>  =  e  ^  '*—!„(«„  p),  (2.18) 

Pmax 

Pa ,  0  Pmax  ^a,  o/^N^,  0  '  ^  !/•••/  , 


and 


Pa,il«/1)  =  e  L\(a 

PmaxV”  +  1 

Po,l  “  Pmax  '  a  =  1,. 


a,l).  (2-19) 


where  is  the  number  of  grid  points  in  the  p 
coordinate,  p^^^^  is  the  maximum  of  the  grids,  L„ 
and  L\  are  the  Laguerre  polynomials  and  the 
associated  Laguerre  polynomials,  respectively,  and 
a^  Q  and  ^  are  their  zeros,  respectively.  The 
reason  we  use  two  basis  sets  is  that  the  behavior  of 
wave  functions  at  p  =  0  is  significantly  different 
depending  on  whether  the  angular  momentum 
around  z  axis  is  even  or  odd.  Therefore  changing 
grid  points  increases  the  accuracy  of  the  calculated 
energy  levels. 

Note  that  the  vectors  (  p\n,  v}  are  eigenvectors 
of  the  two-dimensional  harmonic  oscillator 

1  P"  IP  p  Y 

2  Pp2  2p  dp  ^  2p2  ^ 

X<  p\n,  v} 

=  -^O.n  +  V  +  \){p\n,v'}.  (2.20) 

Pmax 


The  bases  for  z  and  p  are  combined  to  the  total 
DVR  basis,  {|  p«,  o  >  ®  I  »  and  {|  p^  >  g)  |  z^  >}.  The 
first  set  is  used  if  the  angular  momentum  around 
the  z  axis  of  the  single-particle  level  is  even,  and 
the  second  set  is  used  if  it  is  odd.  This  distinction 
is  important  to  obtain  accurate  single-particle  en¬ 
ergy  levels. 

In  addition  to  the  conventional  jellium  model, 
we  try  the  stabilized  jellium  model.  It  has  been 
pointed  out  that  the  normal  jellium  model  is  not  in 
mechanical  equilibrium  if  it  is  applied  to  metals 
other  than  sodium.  Methods  of  improvement  to 
take  account  mechanical  equilibrium  have  been 
presented  by  Perdew  et  al.  [5]  and  Shore  et  al.  [6]. 


By  following  Shore  we  add 

^0  ^  ~  (2.21) 

to  the  chemical  potential  inside  the  jellium,  where 
Cjeii(n)  is  the  energy  per  electron  in  the  uniform 
electron  gas  of  density  n  and  is  the  chemical 
potential  of  the  uniform  electron  gas. 

Using 

(2.22) 

d 

P'jell(^)  =  (2,23) 

and  the  local  density  functional  of  VWN  [11],  we 
obtain  Vq  =  -0.024(a.u.)  for  silver  [n  =  3/(4:7rr^^), 
r^g  =  3.02(a.u.)].  In  the  actual  calculation  the  step 
function  causes  instability  in  the  calcula¬ 
tion,  Therefore  we  broaden  it  by  using  the 
Woods-Saxon  potential  form  (2.9)  with  Vq  =  Vq, 

^0  ^ws* 


III.  Results  and  Discussion 

We  apply  the  jellium  model  and  shell  correction 
method  calculations  to  the  fission  Ag2^  ^  ^Si2  + 
Agj^^.  In  Figure  1  shapes  of  the  cluster  at  various 
values  of  parameters  B  and  C  are  depicted.  Pa¬ 
rameter  C  describes  elongation  and  B  describes 
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FIGURE  1 .  Shapes  of  the  cluster  at  various  values  of  B 
and  C  for  Ag+ +  Ag^"^  (a  =  0.0116). 
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neck  formation  and  scission.  The  liquid  drop  en¬ 
ergy  surface  is  shown  in  Figure  2.  This  surface  is 
very  smooth  and  does  not  have  much  structure.  In 
Figure  3  the  total  energy  surface  obtained  by  the 
shell  correction  method  is  depicted.  This  surface  is 
rich  in  structure  due  to  the  shell  correction.  There 
is.  a  local  minimum  at  C  =  1.5,  B  =  0.5.  A  ridge 
appears  around  B  =  0.52,  2  <  C  <  3.  There  are 
two  saddles  for  fission:  One  of  them  is  around 
C  =  2.6,  B  =  0.25  and  the  other  is  around  C  =  1.5, 
B  =  1.2.  Usually  only  one  saddle  dominates  [2], 
but  for  this  fission  process,  there  are  two  compara¬ 
ble  saddles  for  fission. 

Now  we  compare  total  energies  obtained  by  the 
shell  correction  method  and  the  jellium  models 
along  two  fission  paths  shown  in  Figure  2.  These 
two  paths  pass  near  the  two  saddles  for  fission  by 
the  shell  correction  method  estimate.  The  frag¬ 
ments  are  prolatelike  along  path  1,  while  the  frag¬ 
ments  are  oblatelike  along  path  2.  It  is  predicted 
that  path  1  produces  low-spin  fragments,  AgJ'iCS 
=  0)  and  Agj2(S  =  i),  while  path  2  produces  the 
high-spin  fragments,  AgJjCS  =  1)  and  Ag^2^^  ^  f) 
[15]. 


1  1.5  2  2.5  3 


c 

FIGURE  2.  Contour  plot  of  the  liquid  drop  energy  for 
Agis Ag+2+  Agi%  The  zero  of  the  energy  is  at  C  =  1 , 
6  =  0.  The  contours  are  from  1  to  4  eV  with  spacing  of  1 
eV.  The  gray  levels  of  the  shading  run  from  black  to 
white  with  increasing  energy.  The  lines  denoted  by  1 
and  2  are  path  1,6  =  0.5(C  —  1),  1  <C<1.5  and 
6  =  0.25,  1 .5  <  C  <  3;  and  path  2,  6  =  6(C  -  1),  1  <  C 
<  1.2  and  6  =  1.2,  1.2  <  C  <  1.75,  respectively.  The 
sold  curve  is  the  fission  line. 


The  parameters  for  the  jellium  calculations  are 
^max  ”  35.0(a.U.),  35.0(a.U.),  Pmax 

25.0(a.u.),  N,  =  50,  and  =  30.  The  convergence 
problem  of  self-consistent  calculation  caused  by 
near-degenerate  levels  are  handled  by  broadening 
the  occupation  numbers  using  the  finite  tempera¬ 
ture  distribution.  The  temperature  is  set  to  500  K. 
It  is  seen  in  the  calculation  that  if  we  use  a  much 
higher  temperature  than  500  K,  the  energy  barrier 
is  reduced  but  it  does  not  change  much  by  raising 
the  temperature  up  to  1000  K.  Therefore  the  broad¬ 
ening  by  500  K  will  not  change  the  barrier  signifi¬ 
cantly. 

In  Figure  4  the  total  energy  and  shell  correction 
along  path  1,  B  =  0.5(C  —  1),  1  <  C  <  1.5;  B  = 
0.25,  1.5  <  C  <  3,  are  depicted.  The  cluster  first 
elongates  and  then  disintegrates  along  this  path. 
Shell  corrections  are  qualitatively  the  same  irre¬ 
spective  of  the  method  as  is  seen  in  the  bottom 
figure  of  Figure  4.  The  abrupt  jump  at  C  =  2.75 
seen  in  the  shell  correction  energy  by  the 
Balian-Bloch  formula  is  an  artifact.  This  comes 
from  the  difference  in  the  accuracy  of  the 
Balian-Bloch  formula  for  a  single  cluster  and  for 
fragments.  Because  the  number  of  atoms  in  the 
single  cluster  and  its  fragments  are  different,  the 


1  1.5  2  2.5  3 


c 

FIGURE  3.  Contour  plot  of  the  total  energy  for  Agfa  ^ 
Ag4+  Agi%  The  shell  correction  is  calculated  with  the 
cavity  potential,  y  =  0.15(a.u.).  The  zero  of  the  energy  is 
the  liquid  drop  energy  at  C  =  1 ,  6  =  0.  The  contours  are 
from  0  to  6  eV  with  spacing  of  1  eV.  The  gray  levels  of 
the  shading  run  from  black  to  white  with  increasing 
energy. 
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FIGURE  4.  The  total  energy  and  shell  correction  along 
path  1 .  Top:  Total  energies  by  the  shell  correction  method 
with  the  cavity  potential,  7  =  0.15(a.u.)  (solid  line),  by  the 
shell  correction  method  with  the  Woods -Saxon  potential, 
y=0.15(a.u.)  (dotted  line),  by  the  stabilized  jellium 
model  (thick  solid  line),  and  by  the  normal  jellium  model 
(dash-dot  line).  Middle:  The  shape  of  the  cluster  at 
C  =  0,  1.5,  2,  2.5,  and  3.  Bottom:  Shell  corrections  by 
the  shell  correction  methods  with  the  cavity  potential, 
7=  0.15(a.u.)  (solid  line),  with  the  cavity  potential,  7  = 
0.35(a.u.)  (dash-dot  line),  with  the  Woods -Saxon 
potential,  7=  0.15(a.u.)  (dotted  line),  and  with  the  cavity 
potential  using  the  Ballan -Bloch  formula  (thick  solid 
line). 


accuracy  of  the  Balian-Bloch  formula  is  different. 
This  difference  will  not  be  significant  for  larger 
clusters,  but  it  causes  a  sudden  jump  of  the  shell 
correction  for  the  cluster  size  considered  here. 

The  smoothed  density  of  states  obtained  by  Eq. 
(2.6)  does  not  have  this  problem.  The  smoothing 
parameter  7  is  of  the  order  of  typical  spacing  of 
energy  levels.  The  error  of  the  calculated  shell 
correction  may  be  minimized  by  finding  7  that 
satisfies  the  stationary  condition 


^^shell 

8y 


=  0. 


(3.1) 


We  first  tried  to  find  the  stationary  7  for  each 
cluster  shape,  but  we  could  not  find  a  stationary 
value  for  some  shapes.  Therefore  we  decided  to 
keep  using  the  constant  7  value  through  a  fission 
path,  where  the  value  is  an  average  that  satisfy  the 
stationary  condition  near  spherical  shapes.  We  use 
two  values  for  7:  7  =  0.15(a.u.)  and  7  =  0.35(a.u.), 
where  the  former  is  in  the  order  of  typical  spacing 
of  levels  and  the  latter  is  almost  the  same  as  the 
energy  difference  between  the  ground  level  and 
the  Fermi  level.  The  shell  correction  obtained  by 
using  7  =  0.35(a.u.)  is  very  similar  to  the  one  ob¬ 
tained  by  the  Balian-Bloch  formula,  but  it  gives  a 
higher  threshold  energy  for  fission  than  7  = 
0.15(a.u.).  It  seems  that  7  =  0.35(a.u.)  is  too  large 
and  7  =  0.15(a.u,)  is  a  better  value. 

A  better  shell  potential  may  be  the 
Woods-Saxon-type  potential.  This  potential  allows 
the  spill-out  of  the  electron  wave  functions.  We 
use  7  =  0.15(a.u.)  as  the  smoothing  parameter.  Due 
to  the  spill-out,  it  is  seen  in  the  shell  correction  by 
the  Woods- Saxson  potential  that  the  local  mini¬ 
mum  around  C  =  1.7  is  shifted  to  a  larger  C  com¬ 
pared  with  the  cavity  potential  result.  It  is  also 
seen  that  the  oscillation  of  the  shell  correction 
along  the  fission  path  is  somewhat  reduced. 

The  total  energy  is  compared  in  the  top  figure 
of  Figure  4.  The  improvement  made  by  the  stabi¬ 
lized  jellium  is  significant.  The  conventional  jel¬ 
lium  calculation  predicts  much  less  fission  barrier 
than  the  stabilized  jellium,  although  their  qualita¬ 
tive  features  are  the  same.  The  total  energies  ob¬ 
tained  by  shell  correction  methods  using  the  cavity 
potential  and  the  Wood-Saxon  potential  give  al¬ 
most  the  same  fission  barrier. 

The  position  of  the  local  minimum  around  C  = 
1.75  by  the  jellium  calculation  is  closer  to  the 
minimum  position  obtained  by  the  Woods-Saxon 
potential  calculation  than  to  that  obtained  by  the 
cavity  potential  calculation.  Therefore  the  spill-out 
is  necessary  to  predict  the  right  shape  for  the  local 
minimum. 

The  fission  barrier  obtained  by  the  shell  correc¬ 
tion  method  is  higher  than  that  obtained  by  the 
stabilized  jellium  model.  Because  the  shell  correc¬ 
tions  are  not  so  different  among  the  various  meth¬ 
ods,  this  difference  is  probably  due  to  the  inaccu¬ 
racy  in  the  model  of  calculating  the  liquid  drop 
energy.  The  quantitative  agreement  between  the 
stabilized  jellium  result  and  the  shell  correction 
results  are  very  good  for  clusters  with  near  spheri¬ 
cal  shapes,  although  the  agreement  is  poor  for 
those  with  large  deformations. 
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FIGURE  5.  The  total  energy  and  shell  correction  along 
path  2.  Top:  Same  as  the  top  of  Figure  4  but  for  the  path 
2.  Middle:  The  shape  of  the  cluster  at  C  =  0,  1.2,  1.4, 
and  1 .6.  Bottom:  Same  as  the  bottom  of  Figure  4  but  for 
the  path  2. 

In  Figure  5,  the  total  energy  and  shell  correction 
along  path  2,  B  =  6(C  —  1),  1  <  C  <  1.2;  B  =  1.2, 
1.2  <  C  <  1.5,  are  shown.  The  cluster  forms  the 
neck  without  substantial  elongation  along  this 
path.  The  qualitative  feature  of  the  stabilized  jel- 


lium  result  and  the  shell  correction  method  using 
the  Woods-Saxson  potential  are  the  same. 

In  order  to  improve  the  quantitative  prediction 
by  the  shell  correction  method,  the  model  for  cal¬ 
culating  the  liquid  drop  energy  must  be  improved. 
But  even  with  the  present  model,  it  is  expected 
that  the  accuracy  of  the  shell  correction  method 
estimate  will  increase  as  the  size  of  the  cluster 
increases  since  the  cluster  energy  will  approach  to 
the  bulk  value  as  the  cluster  size  increases. 
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ABSTRACT _ _ _ 

Various  rutrogen  clusters,  N^,  are  selected  for  the  present  theoretical  study.  The  number 
of  nitrogen  atoms  chosen  in  this  work  varies  from  x  =  8  to  x  =  32.  PM3,  which  is  known 
as  one  of  the  best  semiempirical  methods,  is  selected  for  the  self-consistent  molecular 
orbital  calculations.  The  geometrical  optimization,  vibrational  frequencies,  and 
thermochemical  computations  are  all  involved  for  various  types  of  molecular  nitrogen 
clusters.  The  results  show  that  all  N^'s  belong  to  the  category  of  stable  high-energy 
compounds.  Comparison  of  average  bond  energy  and  delocalization  energy  of  all  cases 
reveals  that  N2o(4  symmetry)  is  the  most  stable  molecule  among  all  the  nitrogen  clusters 
studied.  In  addition,  our  results  show  five-membered  rings  are  the  most  favored  in  the 
structures  of  nitrogen  clusters  (N^).  ©  1996  John  Wiley  &  Sons,  Inc. 


Introduction 

In  the  period  from  1990  to  1992,  a  hypothetical 
molecule  Ng  having  symmetry  was  intro¬ 
duced  and  developed  by  Lauderdale  et  al.  [1]  and 
Engelke  and  Stine  [2]  using  ab  initio  calculations. 
They  proposed  the  cubane-type  Ng  molecule  to  be 
a  stable  high-energy  structure.  Recently,  N20  with 
4  symmetry  [3-6]  and  N28  with  symmetry  [7] 
have  been  successfully  developed  and  reported  in 
the  literature,  with  both  ab  initio  as  well  as 

*To  whom  correspondence  should  be  addressed. 


semiempirical  theoretical  predictions.  We  have  ob¬ 
served  in  the  present  study  that  I^Aype  N20  and 
T^-type  N28  are  both  similar  to  the  previously 
mentioned  Ng.  All  the  molecules  are  more  stable 
than  nitrogen  atoms  and  have  higher  energy  than 
N2(g)  molecules,  thus  belonging  to  the  category  of 
stable  high-energy  compounds.  In  order  to  exam¬ 
ine  the  comparative  stability  among  nitrogen  clus¬ 
ters  in  general,  we  studied  several  (x  =  8-32) 
structures.  According  to  both  experimental  and 
theoretical  results,  the  midsize  clusters  of  the  pure 
element  with  covalent  bonds  usually  exist  with 
highly  symmetric  structures.  The  well-known 
molecule  buckminsterfullerene  (C^g)  is  formed 
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with  symmetry  [8-10],  and  even  the  smallest 
fullerene  cluster  (C28)  with  a  ^^2  open-shell  ground 
state  still  belongs  to  the  point  group  [11].  Since 
both  the  nitrogen  and  carbon  clusters  are  con¬ 
structed  with  equivalent  covalent  bonds  and  the 
same  kind  of  atoms,  similar  to  the  structures  of 
carbon  clusters,  we  select  the  highly  symmetric 
structure  for  each  of  nitrogen  clusters.  In  addi¬ 
tion  to  the  behavior  of  symmetry,  we  should  also 
pay  attention  to  the  structure  of  local  rings  in  each 
cluster.  When  nitrogen  atoms  are  forming  a  three- 
dimensional  cluster,  three  equivalent  single  bonds 
are  required  for  each  atom.  These  three  bonds  are 
usually  formed  with  the  polarized  sp^  type  or 
delocalized  sp^  type  of  structure.  The  favored  bond 
angles  of  these  bonds  in  each  atom  are  usually 
more  or  less  near  to  the  tetrahedral  angle  (109°X 
The  self-consistent  field  type  optimized  rings  in 
the  cluster  of  molecular  orbital  (MO)  calculation 
are  usually  in  favor  of  planar  or  near  to  the  planar 
local  ring  structure  due  to  the  delocalization  effect. 
In  the  planar  regular  polygonal  rings,  the  inside 
bond  angles  of  a  triangle  is  60°,  of  a  square  is  90°, 
of  a  five-membered  ring  is  108°,  of  a  six-membered 
is  120°,  of  a  seven-membered  is  129°  and  of  a 
eight-membered  is  135°.  If  one  compares  these 
angles  with  the  sp^  type  of  angle  as  shown  above, 
it  is  not  difficult  to  find  out  that  108°  of  five- 
membered  ring  is  the  most  suitable  local  structure 
for  the  nitrogen  clusters.  Consequently,  for  the 
planar  or  nearly  planar  rings,  small  rings  with  a 
size  of  3  or  4  get  the  prominent  negative  bond 
angular  deviation  from  the  sp^  angle  (A0  =  60°  - 
109°  =  —49°  for  the  size  3  and  AO  =  90°  —  109°  = 
— 19°  for  the  size  of  4);  large  rings  with  sizes  of  7 
or  8  get  the  prominent  positive  bond  angular  devi¬ 
ation  from  the  sp^  angle  (AO  =  135°  -  109°  =  26° 
for  the  size  of  8  and  =  129°  -  109°  =  20°  for 
the  size  of  7).  Both  of  these  negative  and  positive 
deviation  from  sp^  angle  will  cause  the  large  an¬ 
gular  strain  for  the  single-bond-based  nitrogen 
clusters.  For  nonplanar  large  rings  with  the  sizes  of 
7  or  8  case,  bond  angle  may  be  shrunk  by  changing 
dihedral  angle.  However,  the  symmetry  of  a 
three-dimensional  system  of  clusters  will  be  dis¬ 
torted,  and  the  small  rings  of  size  3  or  4  may 
occur,  and  the  whole  cluster  will  be  less  compact 
than  the  original  regular  polygonal  system.  The 
instability  and  another  kind  of  angular  strain  may 
happen  again  due  to  nonplanar  distortion  of  large 
rings.  Therefore,  for  the  single-bond-based  three- 
dimensional  nitrogen  cluster  in  this  work,  we  de- 


;  12  five-membered  rings;  30  bonds  ;  2  six-membered  rings;  36  bonds 


3  six-membered  rings;  39  bonds  Nj,{TJ;  4  six-membered  rings;  42  bonds 


5  six-membered  rings;  45  bonds 


NjjCDj);  6  six-membered  rings;  48  bonds 

FIGURE  1.  The  geometry  of  clusters  (which 
consisted  of  five  and  six-membered  rings  for  x  >  2). 
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fined  that  both  the  negative  and  positive  deviation 
of  angles  will  be  increasing  angular  strain  and 
decreasing  the  stability  of  the  cluster  formation 
whenever  the  size  of  the  ring  is  smaller  or  larger 
than  5.  The  introductory  idea  of  this  angular 
strain-type  argument  was  first  introduced  by  us  in 
our  former  work  of  the  semiempirical  study  of 
N2o(4)  [4].  If  the  above  argument  is  true,  the 
priority  of  our  cluster  structures  assignment  of  this 
work  are  selected  according  to  the  following  two 
principles.  First  of  all,  the  five-membered  rings  are 
assigned  as  much  as  possible  for  the  cluster  in 
order  to  decreasing  its  angular  strain.  The  second 
important  principle  is  to  assign  the  structure  of 
clusters  according  to  their  highly  symmetric  prop¬ 
erty.  Under  these  two  principles  of  assignment, 
N2o(4X  N24(D4,),  N26(D3^),  a^d 

NsalDs)  as  shown  in  Figure  1  and  NgfO;,), 
Nio(D3„),  N,2(D6,,),  Ni4(D3„),  and  NJO,,)  as 
shown  in  Figure  2  are  selected  for  this  work.  In 
order  to  testify  about  stability  and  angular  strain 
of  various  N^'s  with  the  same  number  of  nitrogen 
atoms  ix),  we  also  select  the  high  strained 
N,4(D7,,),  NigfDg,,),  N24(0,),  N^sWjdl  and 
N32(Oft)  as  shown  in  Figure  3  to  compare  them 
with  the  related  less  strained  N^.  (x  =  14,  16,  24, 
28,  32)  of  Figures  1  and  2.  With  this  type  of 
comparison,  the  real  evidence  of  the  angular  strain 
of  various  sized  localized  rings  in  nitrogen  clusters 
may  be  testified  as  shown  in  the  following  section. 

The  MO  method  used  in  this  work  is  the  PM3 
method  which  is  known  to  be  one  of  the  best 
semiempirical  methods  so  far  [12].  As  shown  in 
our  previous  study  on  N20  [6],  the  optimized  struc¬ 
tures,  the  calculated  first  ionization  potentials,  and 
thermochemical  results  of  both  the  semiempirical 
PM3-MO  as  well  as  ab  initio  type  6-31G  are  all 
very  close  to  each  other.  Consequently,  we  believe 
that  using  the  semiempirical  type  PM3-MO  method 
as  an  instrument  for  the  theoretical  study  of  nitro¬ 
gen  clusters  will  be  very  effective,  efficient,  and 
reliable  in  the  present  comparative  analysis. 


Calculation 

The  versions  of  PM3  in  Gaussian  92  [13]  and 
HyperChem  [14]  were  both  used  in  the  present 
work.  For  various  nitrogen  clusters,  different  cal¬ 
culations  of  the  same  molecule  by  these  two  pro¬ 
grams  usually  yield  the  identical  results.  The  ver¬ 


sion  of  PM3  in  the  Gaussian  package  was  used 
since  it  provided  better  degeneracies  for  both  MOs 
and  vibrational  frequencies  in  the  cases  of  high 
symmetry  point  groups.  The  version  of  PM3  in 
HyperChem  usually  yielded  the  calculated  bind¬ 
ing  enthalpy  and  enthalpy  of  formation  directly  for 
the  following  reactions: 

xN(^)  =  N,(g),  (1) 

x/2N2(g)  =  (2) 

For  comparing  the  stability  of  the  bonding  behav¬ 
ior  for  each  cluster,  similar  to  the  average  bond 
energy  calculations  as  reported  earlier  [3,  7],  we 
calculated  the  average  bond  energies  (BE)  with 
enthalpy  of  atomization,  i.e.,  a  negative  value  of 
binding  enthalpy  divided  by  the  number  of  bonds 
(n)  in  the  molecule  as 

BE  =  AHJn.  (3) 

In  addition  to  these  quantities,  the  newly  de¬ 
fined  "delocalization  energy"  (DE)  of  a  single 
bonded  structure  of  C24N4  cluster  [15]  is  also  ap¬ 
plied  in  this  work: 

DE  =  AH„  -  Ln,.(AHbond,)^  (4) 

i 

where  n,  and  are  the  number  of  bonds 

and  bond  energy  of  ith  type  of  bond.  For  the 
clusters  in  this  work,  all  bonds  are  referenced  to 
the  N-N  single  bond 

DE  =  AH„  —  n  AHi,ond(M-N)' 

where  n  is  the  number  of  N-N  bonds  in  the 
cluster;  AHbond(N-N)  =  163  kj  moP^  is  assigned 
from  a  recent  textbook  of  physical  chemistry  [16]. 

Eor  the  purpose  of  comparison  between  various 
Nj.,  the  average  delocalization  energy  of  each  bond 
can  be  easily  evaluated  as 

DE//7  AH^/TI  AFIbond(N-N) 

=  BE  -  163kJmor^  (6) 

The  cluster  type  of  molecules  calculated  in  this 
work  are  all  assumed  to  consist  of  nitrogen  atoms. 
The  size  of  the  molecules  assigned  here  lies  in  the 
range  between  8  and  32  atoms.  The  assignment  of 
cluster  structures  was  according  to  the  principle  of 
less  angular  strain  and  their  symmetric  behavior  of 
the  assigned  symmetry  group.  Hence  we  first  con- 
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NioCDjk);  2  five-membered  rings; 

5  four-merobeied  rings;  15  bonds 


1.494 


N,2(D^;  2  slx-membered  rings; 

6  four-membered  rings;  18  bonds 


6  five-membered  rings; 

3  four-membered  rings;  21  bonds 


NmCDjk);  8  five-membered  rings; 

2  four-menibered  rings;  21  bonds 

FIGURE  2.  The  geometry  of  nitrogen  clusters  (which  consisting  of  four,  five  or  six~membered  rings  for  x  <  20). 
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7  four-membeied  rings;  21  bonds 


6  four-membered  rings; 

8  six-membered  rings;  36  bonds 


N,5(D  J;  2  eight-membered  rings; 

8  four-membered  rings;  24  bonds 


2  seven-membered  rings; 

14  five-membered  rings;  42  bonds 


NjjCOJ;  12  six<membered  rings; 

6  four-membered  rings;  48  bonds 


FIGURES.  clusters  (x  =  14,  16,  24,  28,  32)  with  comparative  larger  angular  strain. 
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structed  geometries  consisting  of  five-  and  six- 
membered  rings.  The  4  ^20  constructed 

with  12  regular  five-membered  rings.  For  x  >  20, 
N24(D6^)  has  2  six-membered  rings,  the  N26(D3;^) 
has  3  six-membered  rings,  the  N28(T^)  has  4  six- 
membered  rings,  has  five  six-membered 

rings,  and  N32(D3)  has  6  six-membered  rings,  in 
addition  to  12  five-membered  rings  in  each  case,  as 
shown  in  Figure  1.  For  cluster  sizes  less  than  20, 
structures  formed  with  four-  and  five-membered 
rings  or  four-  and  six-membered  rings  were  cho¬ 
sen,  including  the  cubane-like  NgCO;^), 

^14(03;,)  with  3  four-membered  and  6 
five-membered  rings,  and  "^ith  2  four- 

membered  and  8  five-membered  rings  as  shown  in 
Figure  2.  In  order  to  verify  the  principle  of  large 
angular  strain  in  the  cases  of  large  (7  or  8)  or  small 
(3  or  4)  sizes  of  rings,  we  also  selected  clusters 
with  seven-  or  eight-membered  rings,  such  as 
^14(^7/^)/  along  with  an 

analogous  larger  cluster  with  four-membered  rings 
such  as  N24(0;^)  and  N32(0;^)  as  shown  in  Figure  3, 
for  the  purpose  of  comparison.  Thus,  a  total  of  16 
nitrogen  cluster  structures  (N^,  x  varying  from  8 
to  32)  have  been  evaluated  in  this  work. 

Both  the  geometry  optimization  and  vibrational 
frequency  calculations  of  all  the  clusters  men¬ 
tioned  above  were  successfully  computed.  All  the 
calculated  ground-state  electronic  configurations 
are  closed-shell  singlets.  N^gfDg;,)  exhibited  two 
imaginary  vibrational  frequencies  and  proved  not 
to  be  a  local  minimum,  whereas  the  vibrational 
frequencies  of  all  the  remaining  15  structures  were 
found  to  be  positive  real  values  lying  between  89 
and  1108  cm"k  Consequently,  they  are  stable 
structures  on  their  respective  potential  surfaces. 


Result  and  Discussion 

COMPARISOrV  BETWEEN  LESSER  OR 

LARGER  ANGULAR  STRAIN  N, 

In  order  to  demonstrate  the  significant  effect  of 
angular  strain  on  the  molecules,  we  initially 
selected  Nj4,  N24,  N2g,  and  N32  consisting  of 
comparably  lesser  angular  strain  structures  of  Fig¬ 
ures  1  and  2  and  compared  them  with  the  related 
five  larger  strain  molecules,  as  shown  in  Figure  3. 
We  selected  the  calculated  result  of  the  first  ioniza¬ 
tion  potentials,  i.e,,  the  energy  gap  between 
the  highest  occupied  molecular  orbital  (HOMO) 
and  lowest  unoccupied  molecular  orbital  (LUMO), 
i.e.,  Ae  =  ^lumo  “  -^homo  ^he  average  bond  ener¬ 
gies  from  Eq.  (3),  and  heat  of  formations,  in  order 
to  compare  the  stability  of  the  five  pairs  of 
(x  =  14,  16,  24,  28,  32).  The  calculated  values  of 
PM3  are  listed  in  Table  I  for  comparison.  From  the 
results  in  this  table  we  found  that  for  with 
identical  x,  4^^,  As,  and  BE  of  molecules  having 
lesser  strain  are  larger  than  the  corresponding  val¬ 
ues  of  molecules  having  larger  strain.  The  A  HJ  of 
lesser  strain  cases  is  lower  than  the  A  of  larger 
strain  cases.  All  these  results  imply  that  nitrogen 
clusters  consisting  of  five-  or  six-membered  rings 
are  favored  over  the  structures  which  containing 
seven-,  eight-,  or  four-membered  rings. 

COMPARISON  OF  THE  MOST  STABLE  N^ 

(x  =  8  TO  32) 

After  the  geometrical  optimization  and  vibra¬ 
tion  frequency  calculations  of  various  in  Fig- 


TABLE  I _ 

Comparisons  of  the  angular  strain  effect  in  (x  =  14, 16,  24,  28,  32). 


Nx 

Sym. 

Group 

Geometric 

Structure 

/ls. 

(eV) 

As 

(eV) 

BE 

(kJ  bond” '') 

ah; 

(kJ  mol” 

Ni4 

^3/j 

Fig.  2 

13.15 

10.15 

195.09 

2522.2 

Dj, 

Fig.  3 

12.40 

9.32 

175.57 

2932.0 

Ni6 

Fig.  2 

13.21 

10.01 

197.66 

2820.9 

^eh 

Fig.  3 

12.50 

8.84 

169.29 

3501 .6 

N24 

Fig.  1 

12.95 

8.90 

199.23 

4174.9 

0, 

Fig.  3 

12.90 

8.87 

190.54 

4487.6 

^28 

Td 

Fig.  1 

12.98 

8.62 

197.53 

4942.0 

Djd 

Fig.  3 

12.60 

8.55 

196.49 

4985.4 

^32 

Ds 

Fig.  1 

12.85 

8.00 

195.68 

5737.0 

On 

Fig.  3 

12.73 

7.51 

188.34 

6089.1 
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ures  1  and  2,  all  the  calculated  bond  distances  and 
angles  are  marked  in  the  related  figures  and  all  the 
other  results  are  list  in  Tables  II  and  IIL 

and  HOMO-LUMO  Energy  Gap 

As  shown  in  Figure  4  and  Table  II,  the  calcu¬ 
lated  first  ionization  potentials  (list)  various 
are  very  close  to  each  other;  their  Iig^  are  all  close 
to  13  eV.  The  HOMO-LUMO  energy  gap  (A^)  are 
decreased  whenever  the  number  of  nitrogen  (x) 
increased.  The  tendency  of  decreasing  in  more 
prominent  for  larger  cluster  whenever  x  >  20  than 
in  the  cases  of  smaller  cluster  {x  <  20).  If  one 


compares  the  observed  =  15.58  eV  and  PM3 
calculated  HOMO-LUMO  energy  gap  A^r  =  14.36 
eV  of  N2  molecule,  it  is  found  that  both  and 
energy  gap  A^  of  various  cluster  molecules  are  all 
less  than  the  related  values  of  N2;  that  means  the 
large  nitrogen  clusters  are  more  reactive  than  N2 
from  the  viewpoint  of  charge  transfer  processes. 

Vibration  Frequency  Calculation 

All  the  vibrational  frequencies  of  these  11  nitro¬ 
gen  clusters  are  positive  between  158  and  1108 
cm“h  The  largest  frequencies  of  them  are  all  less 
than  1200  cm"^  due  to  a  lack  of  stretching-type 


Comparisons  of  the  1^^^,  energy  gap  (Ae),  average  bond  energies  (BE),  delocalization  energies  (DE/n),  and 
heat  of  formations  (AH/'  /  nHz)  in  =  8  to  32).  _ 


N,v 

Symmetry 

Group 

^Ist 

(eV) 

A^ 

(eV) 

BE 

(kJ  bond“^) 

DE/n  (kJ  mol 
(n :  No  of  N-N  bonds) 

AH;/nN2 

[kJ(N2)-1 

Ns 

Oh 

12.99 

11.03 

173.59 

10.59 

424.82 

N10 

Dsh 

13.33 

11.18 

183.22 

20.23 

395.91 

Ni2 

Deh  Fig-  2 

12.82 

9.70 

180.64 

17.64 

403.67 

Ni4 

^3h 

13.15 

10.15 

195.09 

32.09 

360.32 

Ni6 

13.21 

10.01 

197.66 

34.66 

352.61 

N20 

Ih 

13.17 

9.63 

201 .55 

38.55 

340.95 

N24 

Ded 

12.95 

8.90 

199.22 

36.23 

347.91 

N26 

12.82 

8.60 

198.45 

35.45 

350.24 

N28 

“T 

CG 

12.98 

8.62 

197.53 

34.53 

353.00 

N30 

Dsh 

12.79 

8.28 

197.47 

34.47 

353.16 

N32 

Ds 

12.85 

7.96 

195.68 

32.68 

358.56 

TABLE  III 


Comparisons  of  the  average  bond  lengths  (in  A),  the  longest  (or  weakest)  bond  length  (in  A),  vibrational 
frequency  ranges,  and  the  difference  between  the  highest  frequency  and  the  lowest  frequency  in  (x  =  8  to  32). 


N. 

Symmetry 

Group 

Ave.  Bond 
Length 

Longest  Bond 
Length 

Vib.  Freq.  Range 
(cm 

AP 

(cm  "  ■') 

Ns 

Oh 

1.510 

1.510 

537  - 

1108 

571 

N10 

Dsh 

1.505 

1.508 

395 

1019 

624 

Ni2 

^6h 

Fig.  2 

1.498 

1.505 

274  - 

1003 

729 

Ni4 

^3h 

1.498 

1.506 

370  ~ 

946 

576 

Ni6 

^4d 

1.497 

1.520 

251  - 

965 

714 

N20 

Ih 

1.491 

1.491 

356  - 

947 

591 

N24 

Ded 

1.488 

1.532 

292  ~ 

997 

705 

N26 

Dsh 

1.485 

1.524 

273  ~ 

1024 

751 

N28 

Td 

Fig.  1 

1.483 

1.493 

287  - 

1000 

713 

N30 

Dsh 

1.483 

1.511 

158  - 

1093 

935 

N32 

D3 

1.480 

1.506 

228  - 

1041 

813 
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12.85 

— the  first  ionization  I 
potentials 


-■-the  energy  gap 
(HOMO-LUMO) 


7.96 


N8  NIO  N12  N14  N16  N20  N24  N26  N28  N30  N32 

(Oh)  (D5d)  (D6h)  (D3h)  (D4d)  (Di)  (D6d)  (D3h)  (Td)  (D5h)  (D3) 

Nx  Compounds 

FIGURE  4,  Comparisons  of  the  first  ionization  potentials  and  the  energy  gap  in  (x  =  8  to  32). 


vibrations  of  the  special  functional  group.  This 
skeleton  vibrational  frequencies  of  clusters  are 
usually  very  small.  The  largest  frequencies  of  vari¬ 
ous  nitrogen  clusters  are  more  or  less  around  1000 
cm  “  ^ .  For  the  sake  of  simplicity,  only  the  range  of 
frequencies  and  difference  between  largest  and 
smallest  frequencies,  (Ai' = 

are  collected  and  listed  in  Table  III  for  comparison. 
From  Av's  of  the  various  clusters,  one  may  easily 
identify  the  clusters  with  comparable  smaller  Av 
(Ng,  N^4,  and  N20)/  structures  which  are  more 
compact  or  more  symmetric  than  other  clusters. 
The  larger  Av  cases,  such  as  the  clusters  of  N30 
and  N32,  are  relatively  larger  and  more  flexible 
than  the  other  clusters.  Both  the  anomalous  large 
Az^  and  small  HOMO-LUMO  gap  of 
show  that  the  hexagonal  type  N|2  is  a  less  rigid 
and  less  stable  nitrogen  cluster  compared  to  the 
pentagonal-type  cluster  with  five-mem- 

bered  rings. 

Bond  Distance 

Most  of  the  calculated  bond  distances  shown  in 
Figures  1  and  2  are  less  than  the  standard  N-N 
single  bond  distance  1.45  A  assigned  by  Pople  and 
Beveridge  [17].  In  order  to  compare  the  geometri¬ 


cal  size,  the  average  bond  dista  'ice  of  each  cluster 
is  calculated: 

{d}avc.=  (7) 

Ln, 

i 

and  listed  in  Table  III  and  Figure  5.  In  Eq.  (7) 
and  are  number  and  length  of  the  ith  type  of 
bond.  The  range  of  average  bond  distance  lies 
between  1.51  and  1.48  A.  The  larger  cluster  gives 
the  shorter  average  distance  due  to  delocalization 
effect  of  this  single-bond-based  system  of  larger 
cluster. 

In  the  simplest  sense,  the  bond  will  be  strong 
when  the  bond  distance  is  short.  From  the  gross 
point  of  view,  the  larger  cluster  with  shorter  aver¬ 
age  bond  length  will  be  stronger  and  more  diffi¬ 
cult  to  break  than  the  other  smaller  clusters.  How¬ 
ever,  the  bond  breaking  is  not  a  gross  problem,  but 
it  is  a  problem  of  local  region  or  local  bond.  There¬ 
fore,  the  weakest  bond  or  longest  bond  distance  in 
each  cluster  should  be  selected  for  comparison.  If 
one  looks  at  Figure  5,  the  minimum  in  this  curve 
belongs  to  the  N2o(4)  cluster.  This  fact  means  that 
all  bonds  in  the  highly  symmetric  and  five-mem- 
bered  ring  systems  are  all  equivalent  and  balanced 
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N8  NIO  N12  N14  N16  N20  N24  N26  N28  N30  N32 

Nx  Compounds 


FIGURE  5.  Comparisons  of  the  average  bond  length  and  the  longest  bond  length  in  (x  =  8  to  32). 


local  parts  in  the  N20  cluster,  and  there  is  no  weak 
point  (or  special  long  bond  distance)  like  the  other 
nonequivalent  bond  systems  of  the  other  clusters. 

BE,  (DE)/ii,  and  AH^/(no.  of  1X2) 

The  last  and  the  most  important  problem  is  the 
comparison  of  the  energy  differences  between 
N^(g)  and  xN(g)  or  N^(g)  and  x/2N2(g).  In 
addition  to  the  average  bond  energy  (BE)  and 
average  delocalization  energy  (DE/n),  we  also  cal¬ 
culate  AH^/(no.  of  N2)  for 

reaction  in  which 

AH/(no.  of  N2)  =  AH/(x/2).  (8) 

These  results  are  shown  in  Table  11.  Because  DE/n 
may  be  evaluated  by  Eq.  (6)  and  which  is  calcu¬ 
lated  by  BE-163  (kj  mol“^).  The  reason  we  calcu¬ 
late  DE /n  in  this  work  is  to  show  the  delocaliza¬ 
tion  effect  related  to  N-N  single  bond.  Although, 
in  the  bond  distance  calculation  of  the  last  para¬ 
graph,  most  calculated  larger  than  the 

standard  single-bond  distance  (1.45  A),  all  the  cal¬ 
culated  DE  and  DE/n  are  positive  values  for  all 
clusters.  This  result  shows  that  all  the  N-N  bonds 
in  various  N^  clusters  are  stronger  than  the  ordi¬ 
nary  N-N  single  bond.  If  one  compares  the  BE, 


DE/n,  and  AH^/(no.  of  N2)  for  all  clusters,  all  the 
evidences  prove  that  N2o(4)  I^e  most  stable 
nitrogen  cluster.  If  one  compares  the  BE,  DE/n  of 
the  small  clusters  Ng(0^)  and 

the  most  stable  cluster  among  these  three  clusters 
is  N|o(D5;j).  The  idea  of  angular  strain  of  this  work 
shown  above  is  strongly  supported  by  these  re¬ 
sults.  The  less  angular  strained  five-membered 
rings  is  the  fundamental  unit  for  the  stable  nitro¬ 
gen  clusters  formation. 

Conclusion 

From  the  above  results  we  conclude: 

1.  For  nitrogen  clusters,  five-  and  six-membered 
rings  have  less  angular  strain  and  are  fa¬ 
vored  over  four-,  seven-,  and  eight-mem- 
bered  rings,  just  as  expected  for  analogous 
carbon  structures. 

2.  From  both  energy  and  largest  bond  distance 

comparisons,  N2o(4)  most  stable  N^ 

cluster  in  this  work. 

3.  From  the  energy  viewpoint,  is  the 

most  stable  structure  among  Ng,  N^q,  and 
Ni2/  which  is  the  additional  evidence  to  show 
five-membered  rings  are  preferred  also  in 
cases  of  small  nitrogen  clusters. 

4.  In  the  case  of  carbon  clusters,  such  as 
graphite,  and  C70,  their  six-membered 
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ring  part  is  usually  more  important  than  the 
five-membered  ring  part.  In  this  work,  all  the 
PM3  results  show  that  the  five-membered 
ring  is  the  most  favored  fundamental  struc¬ 
tural  unit  in  the  nitrogen  cluster  molecules, 
which  may  be  quite  different  from  those  of 
carbon  clusters.  The  idea  of  smallest  angular 
strain  of  the  five-membered  ring  system  is 
strongly  verified  by  all  the  above-mentioned 
results  of  this  work. 
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ABSTRACT _ 

We  carried  out  variational  model  calculations  for  the  assessment  of  the  combined  effect 
of  the  nonparabolicity  of  the  electron  effective  mass  and  the  screening  of  the  donor  ion  by 
the  valence  electrons  of  GaAs  for  a  donor  placed  at  the  center  of  a  spherical  quantum  dot. 
We  considered  finite  confining  potentials  between  the  GaAs  QD  and  the  surrounding 
Ga|_^A4As  matrix.  We  found  that  the  combined  effect  becomes  more  pronounced  as 
the  radius  of  the  quantum  dot  decreases.  ©  1996  John  Wiley  &  Sons,  Inc.  Int  J  Quant  Chem 
60:  507-510,  1996 


Introduction 

Reduced  dimensional  structures,  such  as  a 
quantum  well  (QW),  a  quantum-well  wire 
(QWW),  or  a  quantum  dot  (QD),  are  man-made 
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structures  that  can  be  fabricated  by  chemical  vapor 
deposition  (CVD)  [1,  2]  and  molecular-beam  epi¬ 
taxy  (MBE)  [3-5].  Donor  atoms  in  these  structures 
are  expected  to  have  different  binding  energies 
than  have  the  same  donor  atoms  in  the  bulk  semi¬ 
conductor. 

Each  of  these  reduced  dimensional  structures 
has  received  an  extraordinary  amount  of  attention 
in  the  literature.  For  this  reason,  we  shall  severely 
curtail  the  number  of  references  in  the  present 
article  and  refer  only  to  publications  that  we  deem 
essential. 

Much  studied  hetero  structures  are  the  follow¬ 
ing:  the  quasi-two-dimensional  QW,  such  as  a 
GaAs  slab  sandwiched  between  two  semi-infinite 
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Gai_^Al^As  blocks,  the  quasi  one-dimensional 
QWW,  such  as  a  GaAs  wire  placed  in  an  infinite 
Gai_^Al^As  matrix,  and  the  quasi  zero-dimen¬ 
sional  QD,  such  as  a  GaAs  sphere  placed  in  a 
Ga^  .^Al^As  bulk. 

The  finding  energy  of  a  hydrogenic  donor  in  a 
Gai_^  A4As/GaAs/Ga3^_^Al^  As  QW  was  calcu¬ 
lated  by  Bastard  [6,  7],  who  applied  a  variational 
approach.  The  binding  energy  in  a  GaAs / 
Ga^-^Al^As  QWW  of  a  circular  cross  section  was 
calculated  by  Brown  and  Spector  [8],  also  by  a 
variational  approach.  The  effect  of  changing  the 
shape  of  the  QWW  was  studied  by  Bryant  [9].  The 
binding  energy  of  a  hydrogenic  donor  in  the  center 
of  a  GaAs/Ga^_^Al^  As  QD  was  studied  by  Por- 
ras-Montenegro  and  Perez-Merchancano  [10],  Zhu 
et  al.  [11],  and  Zhu  [12].  The  binding  energy  of  an 
off-center  donor  in  a  spherical  QD  has  also  re¬ 
ceived  the  attention  of  Zhu  and  Chen  [13,  14]. 

The  screening  of  a  donor  ion  by  the  valence 
electrons  of  GaAs  in  a  Gai_^Al^As/GaAs/ 
Ga|_^Al^As  QW  was  considered  by  Csavinszky 
and  Elabsy  [15],  who  used  an  analytical  screening 
function  obtained  by  Resta  [16].  The  screening  of  a 
donor  ion  by  the  valence  electrons  of  GaAs  in  a 
GaAs/Ga|_^  Al^  As  QWW  of  circular  cross  section 
was  studied  by  Csavinszky  and  Oyoko  [17,  18]. 
Recently,  Elabsy  and  Csavinszky  [19]  considered 
the  screening  of  a  donor  ion  by  the  valence  elec¬ 
trons  of  GaAs  in  a  spherical  GaAs/Gai_^  Al^  As 
QD. 

In  all  the  above  calculations,  the  donor  electron 
was  described  by  a  constant  scalar  effective  mass. 
It  is,  however,  known  that  the  conduction  band 
(CB)  of  GaAs  deviates  from  the  parabolic  form. 
The  effect  of  the  nonparabolicity  of  the  CB  on  the 
binding  energy  of  a  donor  in  a  Ga^_^Al^As/ 
GaAs/Gaj_^Al^  As  QW  was  studied  by  Chaud- 
huri  and  Bajaj  [20]  who  considered  a  finite  confin¬ 
ing  potential.  A  similar  calculation  was  performed 
by  Csavinszky  and  Elabsy  [21].  Both  of  these  calcu¬ 
lations  made  use  of  an  energy-dependent  effective 
mass  for  the  calculation  of  the  binding  energies  of 
hydrogenic  donors  placed  in  on-center  [18]  and 
off-center  [19]  positions  in  the  Ga^.^Al^As/ 
GaAs/Gai_^  Al^As  QW.  Recently,  the  effect  of  the 
CB  nonparabolicity  on  the  binding  energy  of  an 
on-center  donor  in  a  GaAs/Gai_^Al^  As  QD  has 
been  considered  by  Elabsy  and  Csavinszky  [22], 

In  this  article,  we  study  the  combined  effect  of 
screening  and  nonparabolicity  on  the  binding  en¬ 
ergy  of  a  donor  located  at  the  center  of  a  spherical 


Gai_^Al^As/GaAs  QD.  We  consider  a  finite  bar¬ 
rier  confining  potential.  Our  approach  is  a  varia¬ 
tional  approach,  and  our  imits  are  atomic  units 
(imit  of  energy  is  the  Hartree,  unit  of  length  is  the 
bohr). 


Theory 

In  this  section,  we  restrict  ourselves  to  a  very 
brief  presentation  of  the  theory  of  the  combined 
effect  of  screening  and  the  nonparabolicity  of  the 
electron  effective  mass. 

The  Hamiltonian,  for  a  donor  in  the  center  of  a 
spherical  QD  of  radius  R,  is  given  by 


H=  - 


1 

€i(r)r 


+  n(^)- 


(1) 


In  Eq.  (1),  denotes  the  energy-dependent 
effective  mass  (hence,  the  subscript  £)  of  Hrivnak 
[23],  given  by 


r  £ 

^  H - .  (2) 

y 

In  Eq.  (2),  m*  ^  is  the  electron  effective  mass  at  the 
T-point,  =  0.0665  tHq,  E  is  the  electron  energy 
expressed  in  eV  units,  and  y  =  9.41  eV. 

Also  in  Eq,  (1),  denotes  Resta's  [16]  screen¬ 
ing  fimction,  obtained  from  a  linearized  differen¬ 
tial  equation  (hence,  the  subscript  /),  and  V^^(r)  is 
the  confining  potential  given  by 


0  r  <  R 
V,  r>R’ 


(3) 


By  assuming  that  60%  of  the  energy  band  gap 
discontinuity  is  related  to  the  conduction  band, 
Batey  and  Wright  [24]  obtained 

Vo  =  0.60  A  E[,  (4) 

where  A  Eg  is  the  difference  in  band  gaps  of 
Ga|_^A4As  and  GaAs  at  the  T-point.  This  quan¬ 
tity  is  then 


A^fev)  =  1.247X,  x  <  0.45.  (5) 
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Our  variational  trial  function  is  of  the  form 

r  <  R 


ipir)  = 


NsiniUr) 


N- e  r>R 
r 


(6) 


In  Eq.  (6),  N  is  a  normalization  constant,  given 
in  [15],  and  A  is  a  variational  parameter.  Finally, 
the  quantities  and  are  defined  by 

=  [2m*iV,  -  (7) 

and 

=  [2m*E,oV^\  (8) 

The  binding  energy,  E^fr),  described  by  the 

Hamiltonian  in  Eq.  (1),  is  defined  as 

E,(R)  =  Eio(R)  -  (9) 

In  Eq.  (9),  is  obtained  by  solving  the  transcen¬ 
dental  equation  of  Porras-Montenegro  and  Perez- 
Merchancano  [10].  By  extremalizing  the  expecta¬ 
tion  value  of  the  energy,  f(R),  with  respect  to  A, 
one  obtains 

The  results  of  our  variational  calculations,  for 
X  =  0.30  and  x  =  0.45,  are  given  in  Figures  1-4. 

Figure  1  shows  the  dependence  of  E^  on  R  for 
X  =  0.30  calculated  for  a  nonparabolic  conduction 
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FIGURE  3.  Dependence  of  on  R  when  the  CB 
nonparabolicity  is  considered  simultaneously  with 
valence  dielectric  screening  (x  =  0.30). 


band  for  a  screened  donor  with  the  relevant  data 
for  a  parabolic  conduction  band.  It  is  noted  here 
that  while  we  have  calculated  E^  usings  atomic 
units  the  graphs  are  given  in  meV  and  A,  since 
these  units  are  most  often  used  for  data  presen¬ 
tation  in  the  literature.  Figure  2  is  similar  to  Fig¬ 
ure  1,  except  that  it  is  obtained  for  x  =  0.45. 
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FIGURE  4.  Dependence  of  on  R  when  the  CB 
nonparabollcity  Is  considered  simultaneously  with 
valence  dielectric  screening  (x  =  0.45). 


Figure  3  compares  vs.  R,  calculated  for  a 
nonparabolic  conduction  band  for  both  a  screened 
and  unscreened  donor  in  a  QD  of  aluminum  frac¬ 
tion  X  =  0.30.  Figure  4  is  similar  to  Figure  3, 
except  that  is  obtained  for  x  =  0.45.  Finally,  in 
Figures  1  4,  the  numerical  value  of  the  screening 
radius  is  also  displayed. 


Discussion 

Both  Figure  1  (x  =  0.30)  and  Figure  2  (x  =  0.45) 
show  that  the  CB  nonparabolicity  of  GaAs  has  a 
significant  effect  on  the  binding  energy  of  a 
screened  donor  placed  at  the  center  of  a  spherical 
QD  compared  to  that  obtained  using  a  parabolic 
CB  of  GaAs.  It  is  also  seen  from  Figures  1  and  2 
that  the  effect  increases  as  the  radius  of  the  QD 
decreases.  It  is  further  evident  from  the  figures 
that  the  binding  energy,  both  for  the  parabolic  and 
for  the  nonparabolic  cases,  increases  at  first  as  the 
QD  radius  R  decreases,  reaches  a  maximum  at  a 
certain  R  value,  and  then  starts  decreasing  as  the 


QD  radius  R  decreases  further.  Figures  1  and  2 
permit  yet  another  observation:  They  show  that 
the  maxima  in  the  binding  energy  do  not  occur  at 
the  same  value  of  R  for  the  parabolic  and  non¬ 
parabolic  cases. 

Figures  3  (x  =  0.30)  and  4  (x  =  0.45)  show  the 
dependence  of  on  R  when  the  conduction  band 
nonparabolicity  is  considered  together  with  both 
screening  and  nonscreening  of  a  donor  placed  at 
the  center  of  the  QD.  Consideration  of  screening 
enhances  the  binding  energy  E^. 
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